logo
welcome
Futurism

Futurism

OpenAI Research Finds That Even Its Best Models Give Wrong Answers a Wild Proportion of the Time

Futurism
Summary
Nutrition label

73% Informative

OpenAI 's latest AI models are shockingly bad at being right.

The AI company has revealed just how bad its latest models are at providing correct answers.

The company's o1-preview model, released last month , scored an abysmal 42.7 percent success rate on the new benchmark.

Competing models, like Anthropic 's, scored even lower on the benchmark.

VR Score

68

Informative language

65

Neutral language

27

Article tone

informal

Language

English

Language complexity

58

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

medium-lived

Source diversity

2

Affiliate links

no affiliate links