Saturday, February 7, 2026

New chatbot ‘outperforms PhDs on literature reviews’ - Jack Grove, Times Higher Education

A new chatbot designed by scholars can outperform PhD students and postdocs in undertaking scientific literature reviews, according to a Nature study that says the large language model (LLM) is capable of producing reliable summaries for less than a penny. Evaluating a new model designed to stop ChatGPT’s frequent “hallucinations” when it conducts literature reviews, US researchers asked experts in computer science, physics, neuroscience and biomedicine to assess summaries written by OpenScholar and a spin-off version ScholarQABench against reviews written by PhD students. According to the study, published on 4 February, the domain-level experts – also PhDs and postdocs – preferred OpenScholar and ScholarQABench responses either 51 per cent or 70 per cent of the time respectively.

https://www.timeshighereducation.com/news/new-chatbot-outperforms-phds-literature-reviews