Define the package
Identify the paper, dataset, notebooks, intermediate artifacts, lab context, and access limits.
Scientific data infrastructure
We are laying the data foundation for scientific research agents. AI models need richer, better-contextualized research data to become useful to practicing scientists, but most high-quality scientific data is trapped under institutional complexity. We partner with universities and academic labs to transform this data into packages for training scientific agents, while helping academic labs structure their data for future research.
What we do
The most valuable scientific data is rarely just a cleaned table. It lives across lab notebooks, intermediate results, negative results, analysis code, instruments, and the judgment behind a published paper. New Atlantic works with originating labs and universities to turn that context into a well-described research asset that can support future scientific agents.
For PIs and labs
We help labs identify the data and context around a strong paper, organize it into a coherent package, and translate it into tasks that scientific agents can learn from or be evaluated on. The goal is to preserve the work in a form that is useful beyond the original publication while keeping the PI and originating lab connected to any downstream value.
For universities and TTOs
We start with a narrow non-commercial pilot so the university can define the data package, access boundaries, and review process before negotiating commercial terms. New Atlantic handles packaging, environment design, and buyer readiness, while the institution and lab retain a clear role in permissions, licensing, and revenue share.
Commercial path
Frontier AI labs need training and evaluation environments that reflect real scientific work, not just public benchmarks. New Atlantic turns selected lab packages into legally licensable, agent-ready environments with stable tasks, feedback, held-out splits, and reward-integrity protections.
Pilot model
Identify the paper, dataset, notebooks, intermediate artifacts, lab context, and access limits.
Organize the materials into agent tasks with documentation, feedback signals, and evaluation splits.
Convert successful pilots into commercial environments with university and lab revenue share.
What gets built
Start a conversation