Seshat - Search News

AI Models Struggle with Historical Accuracy, GPT-4 Turbo Only Scores 46%

According to a new study, many AI models don't answer accurately about world history which is a very concerning matter. The researchers of the study developed some answer questions using benchmarks ...

Hosted on MSN18d

AI models struggle with expert-level global history knowledge

Researchers recently evaluated the ability of advanced artificial intelligence (AI) models to answer questions about global history using a benchmark derived from the Seshat Global History Databank.

earth19d

AI struggles to understand human history and fails miserably when tested

For over a decade, complexity scientist Peter Turchin and his collaborators have worked to compile an unparalleled database of human history – the Seshat Global History Databank. Recently, Turchin and ...

EurekAlert!20d

Can ChatGPT pass a Ph.D.-level history test?

Peter Turchin, from the Complexity Science Hub, and an international team of collaborators decided to evaluate the historical knowledge of advanced A.I. models such as ChatGPT-4, Llama, and ...

techxplore20d

Can AI pass a Ph.D.-level history test? New study says 'not yet'

For the past decade, complexity scientist Peter Turchin has been working with collaborators to bring together the most current and structured body of knowledge about human history in one place: the ...

azoai20d

AI Models Struggle to Master Expert-Level Historical Knowledge

This new assessment, the first of its kind, challenged these AI systems to answer questions at a graduate and expert level, similar to the ones answered in Seshat (using a multi-shot approach that ...

20don MSN

AI chatbots still can’t accurately answer high-level history questions: study

While artificial intelligence excels at tasks like coding and podcast generation, it struggles to accurately answer high-level history questions, according to a study. Researchers tested OpenAI’s ...

New York Post20d

AI chatbots still can’t accurately answer high-level history questions: study

While artificial intelligence excels at tasks like coding and podcast generation, it struggles to accurately answer high-level history questions, according to a study. Artificial intelligence ...

Yahoo News20d

Factbox-Who has Donald Trump threatened to prosecute as president?

(Reuters) -Donald Trump has vowed to investigate or prosecute political rivals, former intelligence officials, the country's former military chief, prosecutors and judges, tech moguls, members of ...

newsbytesapp.com21d

AI systems struggle with complex historical questions, new study reveals

A new study has found that artificial intelligence (AI) systems are failing to respond to complicated historical queries. The research was conducted by a team from the Complexity Science Hub (CSH), an ...

Yahoo Finance21d

AI isn’t very good at history, new paper finds

The benchmark, Hist-LLM, tests the correctness of answers according to the Seshat Global History Databank, a vast database of historical knowledge named after the ancient Egyptian goddess of wisdom.

21d

AI isn’t very good at history, new paper finds

Meta’s Llama, and Google’s Gemini — on historical questions. The benchmark, Hist-LLM, tests the correctness of answers according to the Seshat Global History Databank, a vast database of historical ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results