Tjitse Van Der Molen, Luka Cheney, Kamran Hussain, Ojas Brahme, Ash Robbins, Max Lim, Alex Spaeth, Jinghui Geng, David Parks, Kenneth Kosik, Mircea Teodorescu, David Haussler, Tal Sharf
Download Resource
Back
Access Resource
Abstract
Details
Large language models have the potential to transform scientific research and analysis, but without domain-specific structure they produce silent methodological errors, unreported decisions, and irreproducible results. Here we present SpikeLab, a text-to-analysis framework for neural spike data that combines composable data structures with a skill-based agentic system enforcing bounded autonomy: mandatory use of expert-vetted methods, correctness over efficiency, and clarification-seeking on ambiguous requests. In a controlled benchmark on electrophysiology data, Sonnet 4.6 with SpikeLab produced correct and reproducible results across all tasks, outperforming both the unassisted Sonnet and the more capable Opus 4.6, which exhibited deterministic failures including ad hoc method invention, silent data reduction, and inconsistent experimental designs. We demonstrate versatility across in vivo mouse, human, and in vitro brain organoid recordings, and apply the framework to a pharmacological dose-response study spanning single-unit dynamics, pairwise network structure, burst-level temporal sequences, and latent population states, all through natural language prompts without writing analysis code.