OpenAI gets caught vibe graphing
Share this @internewscast.com

During its major GPT-5 presentation on Thursday, OpenAI unveiled several charts intended to highlight the model’s capabilities. However, upon closer inspection, some of these graphs contained inaccuracies.

One chart that aimed to illustrate GPT-5’s performance in “deception evals across models” had inconsistent scaling. For instance, it reported GPT-5 achieving a 50.0 percent deception rate in “coding deception,” yet this was compared to OpenAI’s smaller o3 score of 47.4 percent, which displayed a disproportionately larger bar.

Another chart displayed a peculiar anomaly, where GPT-5’s score was actually lower than o3’s, yet depicted with a bigger bar. Furthermore, it showed o3 and GPT-4o with differing scores but identically-sized bars. This particular chart was so problematic that CEO Sam Altman commented on it, calling it a “mega chart screwup,” and an OpenAI marketing team member apologized for what they termed an “unintentional chart crime.”

OpenAI did not immediately reply to requests for comments. While it remains unclear whether GPT-5 was used to generate the charts, these issues cast a shadow over the company’s major launch event—particularly when it was promoting the “significant advances in reducing hallucinations” achieved with the new model.

Share this @internewscast.com
You May Also Like

A Peek at Dominic Preston’s Desk: What’s on It?

Not every member of The Verge’s team resides in the United States.…

Discovering Living Room PC Gaming Through the Framework Desktop and Linux

I’ve always envisioned doing all my gaming on a PC — a…

Fairphone 6: A Strong Contender Globally, with a Few US Exceptions

The Fairphone 6 makes its debut nearly two years after the Fairphone…

The Most Enjoyable Way to Browse Through Old Photographs

Hello, friends! Welcome to Installer No. 95, your gateway to the ultimate…