Experiments in Agentic AI for Science
This arXiv paper (2605.26305) introduces two agentic AI frameworks for scientific workflows. DeepTS/DeepCollector automates large-scale curation, extraction, and deduplication of time-series datasets. DeepScribe is an autonomous presentation analyzer that converts visually dense physics lectures into structured scientific reports. Both use a hybrid Local Body, Remote Brain architecture via Google Colab, with Python orchestrators invoking LLM cloud backends. Key engineering contributions include granular attribute extraction (Cellular RAG), remote data inspection, and distributed concurrency controls to overcome LLM context and reasoning limits. The authors generalize DeepTS to support deep knowledge graphs and discuss application to high-energy physics (DeepQCD). The paper is available on arXiv, published May 27, 2026.
Enables autonomous AI agents to handle complex scientific data curation and analysis tasks.