While artificial intelligence excels at tasks like coding and podcast generation, it struggles to accurately answer high-level history questions, according to a study. Researchers tested OpenAI’s ...
The study, which is the first of its kind, evaluates the historical knowledge of leading AI models such as ChatGPT-4, Llama, ...
AI might excel at certain tasks like coding or generating a podcast. But it struggles to pass a high-level history exam, a new paper has found.
For the past decade, complexity scientist Peter Turchin has been working with collaborators to bring together the most current and structured body of knowledge about human history in one place: the ...
According to a new study, many AI models don't answer accurately about world history which is a very concerning matter. The ...
Peter Turchin, from the Complexity Science Hub, and an international team of collaborators decided to evaluate the historical ...
A team of researchers has developed a novel benchmark to evaluate the historical knowledge of leading large language models (LLMs) and found significant ...