- Article
- Published:
- Eser Aygün1 na1,
- Anastasiya Belyaeva2 na1,
- Gheorghe Comanici1 na1,
- Marc Coram2 na1,
- Hao Cui ORCID: orcid.org/0009-0006-2456-083X2 na1,
- Jake Garrison3 na1,
- Renee Johnston2 na1,
- Anton Kast ORCID: orcid.org/0000-0002-0755-99962 na1,
- Cory Y. McLean ORCID: orcid.org/0000-0001-9928-82162 na1,
- Peter Norgaard2 na1,
- Zahra Shamsi2 na1,
- David Smalling1 na1,
- James Thompson2 na1,
- Subhashini Venugopalan ORCID: orcid.org/0000-0003-3729-84562 na1,
- Brian P. Williams ORCID: orcid.org/0000-0002-2839-01062 na1,
- Chujun He2,4,
- Sarah Martinson ORCID: orcid.org/0009-0004-4636-50612,5,
- Martyna Plomecka2,6,
- Lai Wei2,
- Yuchen Zhou2,
- Qian-Ze Zhu2,5,
- Matthew Abraham2,
- Erica Brand2,
- Anna Bulanova1,
- Jeffrey A. Cardille2,7,
- Chris Co2,
- Scott Ellsworth2,
- Grace Joseph2,
- Malcolm Kane2,
- Ryan Krueger2,5,
- Johan Kartiwa2,
- Dan Liebling2,
- Jan-Matthis Lueckmann ORCID: orcid.org/0000-0003-4320-46632,
- Paul Raccuglia ORCID: orcid.org/0000-0002-2434-28522,
- Xuefei Julie Wang2,8,
- Katherine Chou2,
- James Manyika2,
- Yossi Matias ORCID: orcid.org/0000-0003-3960-60022,
- John C. Platt2,
- Lizzie Dorfman2,
- Shibl Mourad ORCID: orcid.org/0009-0002-2040-70111 &
- …
- Michael P. Brenner ORCID: orcid.org/0000-0002-5673-79472,5
Nature (2026) Cite this article
We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.
Subjects
Abstract
The cycle of scientific discovery is frequently bottlenecked by the slow, manual creation of software to support computational experiments1. To address this, we present Empirical Research Assistance (ERA), an AI system that creates expert-level scientific software whose goal is to maximize a quality metric. The system uses a Large Language Model (LLM) and Tree Search (TS)2 to systematically improve the quality metric and intelligently navigate the large space of possible solutions. ERA achieves expert-level results when it explores and integrates complex research ideas from external sources. The effectiveness of tree search is demonstrated across a diverse range of tasks. In bioinformatics, ERA discovered 40 novel methods for single-cell data analysis that outperformed the top human-developed methods on a public leaderboard. In epidemiology, ERA generated 14 models that outperformed the CDC ensemble and all other individual models for forecasting COVID-19 hospitalizations. ERA also produced expert-level software for geospatial analysis, neural activity prediction in zebrafish, and numerical solution of integrals, and a novel rule-based construction for time series forecasting. By devising and implementing novel solutions to diverse tasks, ERA represents a significant step towards accelerating scientific progress.
You have full access to this article via your institution.
Similar content being viewed by others
Supplementary information
Rights and permissions
About this article
Cite this article
Aygün, E., Belyaeva, A., Comanici, G. et al. An AI system to help scientists write expert-level empirical software. Nature (2026). https://doi.org/10.1038/s41586-026-10658-6
-
Received:
-
Accepted:
-
Published:
-
DOI: https://doi.org/10.1038/s41586-026-10658-6