AI Hallucinations Linked to Faulty Model Assessments in New OpenAI Research

AI hallucinations, evidenced in GenAI's creative but flawed outputs in July 2024, were viewed as inevitable. The entity captivated the world with its creativity, yet stumbled with erroneous citations, biased imagery, and contorted depictions reminiscent of caricatures. In light of this, I...

, and Administrator

2025 September 17 . 2:53 AM

2 min read

AI hallucinations: Research by OpenAI links inaccuracies in model assessments

AI Hallucinations Linked to Faulty Model Assessments in New OpenAI Research

OpenAI, a leading research organization in artificial intelligence (AI), has published a new research study that sheds light on a persistent issue in AI systems: hallucinations. These hallucinations, while capable of inspiring surreal art or imaginative leaps, pose significant challenges in day-to-day knowledge-based work.

The study reveals that AI systems, much like a clever but overconfident friend, can offer insightful information at times, but their unreliability and the need for fact-checking when statements seem too good to be true is a recurring theme.

The problem, OpenAI explains, is not due to ignorance but rather incentive structures that reward polished guesses over calibrated restraint. This is particularly evident in the current AI evaluation system, which rewards models that bluff, considering them to "win" under the current rules, due to accuracy-only leaderboards.

To address this issue, OpenAI proposes redesigning evaluations to negatively reward false answers, allowing non-answers to receive partial credit to prevent blind guessing, and encouraging reasoning steps where the model justifies its answers. The goal is not to completely eliminate hallucinations, as models should be able to refrain from answering rather than hallucinate.

The research paper traces hallucinations back to the foundations of pretraining, where models learn by predicting the next word in massive datasets without exposure to examples labeled as "false." This lack of explicit guidance on what is false contributes to the models' tendency to hallucinate.

In high-stakes contexts like healthcare or legal advice, a model that admits uncertainty is safer than one that invents answers. Smaller models sometimes outperform larger ones in humility, as a small model that knows it doesn't understand Māori can simply admit ignorance.

Calibration, or knowing what you don't know, is not a brute-force problem solvable only with trillion-parameter giants. The deeper problem with AI hallucination, according to OpenAI, is misaligned incentives baked into the very fabric of AI evaluation.

OpenAI's new proposal for reducing AI hallucination is a step towards addressing this issue, acknowledging that the problem isn't just the models, but also the way we measure them. The organization hopes that this research will pave the way for more reliable and trustworthy AI systems in the future.

AI systems are powerful tools, but only when grounded in truth or transparent about uncertainty. As we continue to develop and rely on these systems, understanding and addressing issues like hallucinations will be crucial for their safe and effective use.

Latest

This picture shows a couple of men playing table tennis and we see couple of them watching by...

Spin Your Way to Fortune!

WSOP 2025 Super High Roller Concludes with $15.6M Prize Pool

13 of the world's top 20 poker players battled it out. Now, the final table is set with a $15.6M prize pool and Thomas Boivin in the lead.

, and Administrator

2025 October 9

In the picture there is a sports player,he is posing for the photograph and on his shirt there are...

Spin Your Way to Fortune!

Former Star Nuri Sahin's Net Worth Tops €15M

From the pitch to the dugout, Sahin's success has translated into a substantial net worth. Discover how his career and family support have contributed to his wealth.

, and Administrator

2025 October 9

This picture shows a woman playing tennis with the tennis bat.

Finance

Australia's Lockdowns Drive Surge in Golden Visa Applications for Portugal

Tired of lockdowns, Australians are turning to Portugal's Golden Visa. The program offers a route to European residency and citizenship, attracting billions in investment since 2012.

, and Administrator

2025 October 9

AI Hallucinations Linked to Faulty Model Assessments in New OpenAI Research

AI Hallucinations Linked to Faulty Model Assessments in New OpenAI Research

Read also:

Related

Latest