Close Menu
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Facebook X (Twitter) WhatsApp
Trending
  • 2 rivers merged to form the Euphrates 3.6 million years ago, eventually leading to the Fertile Crescent
  • NASA confirms fireball meteor exploded over northeastern US with force of 230 tons of TNT
  • Astronauts could use lightning-like plasma jets to kill germs on the moon and Mars, demo hints
  • First whole-genome sequence of a Greenland shark holds clues to their extreme longevity
  • Heading a soccer ball just once is enough to raise levels of proteins associated with brain damage
  • OpenAI’s internal AI model just solved an 80-year-old math problem ‪—‬ and mathematicians verified it
  • Skeletal remains of Queen Elisenda, one of the most powerful rulers in medieval Europe, unearthed in Barcelona — along with several others who bore unexplained stab wounds
  • Tests that measure ‘biological age’ aren’t helpful for tracking your health, scientists say
Facebook X (Twitter) WhatsApp
Baynard Media
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Baynard Media
Home»Tech»Google Gemini contractors reportedly forced to evaluate responses they don’t know about
Tech

Google Gemini contractors reportedly forced to evaluate responses they don’t know about

EditorBy EditorDecember 19, 2024No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Like any genAI model, Google Gemini responses can sometimes be inaccurate, but in this case it might be because testers don’t have the expertise to fact-check them.

According to TechCrunch, the firm hired to improve accuracy for Gemini is now making its testers evaluate responses even if they don’t have the “domain knowledge.”

SEE ALSO:

Google adds Deep Research to Gemini for browsing the web on your behalf

The report raises questions about the rigor and standards Google says it applies to testing Gemini for accuracy. In the “Building responsibly” section of the Gemini 2.0 announcement, Google said it is “working with trusted testers and external experts and performing extensive risk assessments and safety and assurance evaluations.” There’s a reasonable focus on evaluating responses for sensitive and harmful content, but less attention is paid to responses that aren’t necessarily dangerous but just inaccurate.

Mashable Light Speed

Google seems to disregard the hallucination and error problem by simply adding a disclaimer that “Gemini can make mistakes, so double-check it,” which effectively absolves it from any responsibility. But that doesn’t account for the humans doing the work behind the scenes.

Previously GlobalLogic, a subsidiary of Hitachi, instructed its prompt engineers and analysts to skip a Gemini response they didn’t fully understand. “If you do not have critical expertise (e.g. coding, math) to rate this prompt, please skip this task,” said the guidelines viewed by the outlet.

But last week, GlobalLogic changed its instructions, saying, “You should not skip prompts that require specialized domain knowledge,” and to instead “rate the parts of the prompt you understand,” and note that they don’t have the required expertise in their analysis. Expertise, in other words, is not being treated as a prerequisite for this work.

Contractors can now only skip prompts that are “completely missing information,” according to TechCrunch, or those that contain sensitive content that requires a consent form.

Topics
Artificial Intelligence
Google



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleJohn Lennon’s Son Julian Details Undergoing Surgery for Cancerous Mole
Next Article Live Commentary – Tottenham vs Man Utd
Editor
  • Website

Related Posts

Tech

iPhone exploit DarkSword has been released in the wild

March 24, 2026
Tech

The U.S. router ban: Everything you need to know

March 24, 2026
Tech

Underage sexual content, self-harm info targeted by OpenAI’s new open-source prompts

March 24, 2026
Add A Comment

Comments are closed.

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Recent Posts
  • 2 rivers merged to form the Euphrates 3.6 million years ago, eventually leading to the Fertile Crescent
  • NASA confirms fireball meteor exploded over northeastern US with force of 230 tons of TNT
  • Astronauts could use lightning-like plasma jets to kill germs on the moon and Mars, demo hints
  • First whole-genome sequence of a Greenland shark holds clues to their extreme longevity
  • Heading a soccer ball just once is enough to raise levels of proteins associated with brain damage
calendar
June 2026
M T W T F S S
1234567
891011121314
15161718192021
22232425262728
2930  
« May    
Recent Posts
  • 2 rivers merged to form the Euphrates 3.6 million years ago, eventually leading to the Fertile Crescent
  • NASA confirms fireball meteor exploded over northeastern US with force of 230 tons of TNT
  • Astronauts could use lightning-like plasma jets to kill germs on the moon and Mars, demo hints
About

Welcome to Baynard Media, your trusted source for a diverse range of news and insights. We are committed to delivering timely, reliable, and thought-provoking content that keeps you informed
and inspired

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Facebook X (Twitter) Pinterest WhatsApp
  • Contact Us
  • About Us
  • Privacy Policy
  • Disclaimer
  • UNSUBSCRIBE
© 2026 copyrights reserved

Type above and press Enter to search. Press Esc to cancel.