Today's digest · Thursday, May 28

The 43 things in AI/dev today.

LiveNext issue at 7:00 CET
#1 / TODAY
arXiv cs.AI·2 min·6d agoFREE

Intelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systems

arXiv paper 2605.27628 introduces a theory of "managed autonomy" for agentic AI systems, addressing failures from unbounded autonomy where agents operate despite rising uncertainty. The SMARt model, a four-layer framework (Stable, Meta-cognitive, Assisted, Regulated states), instantiates this theory. By using a timed, guarded Petri net formulation, the research establishes theoretically bounded properties, demonstrating how architectural design can formally mandate escalation, constrain invalid outputs, and ensure governance reachability for safer, more reliable AI agents.

Developers gain a formal framework to design agentic AI systems with built-in mechanisms for failure detection, recovery, and controlled surrender, enhancing reliability and safety.

aiagentsautonomygovernanceaisafety
arxiv.org
Intelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systems
Why LLMs Fail at Causal Discovery and How Interventional Agents Escape
#2 / TOP STORY
arXiv cs.AIFREE

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

Researchers prove that LLMs cannot reliably perform causal discovery due to a fundamental limitation in distinguishing causal graphs from observational data. They propose A-CBO, which uses a frozen LLM as an interventional oracle with an external Bayesian loop to achieve efficient causal graph identification.

Laguna M.1/XS.2 Technical Report
#3 / TOP STORY
arXiv cs.AIFREE

Laguna M.1/XS.2 Technical Report

Poolside has introduced Laguna M.1 and XS.2, two Mixture-of-Experts foundation models designed for long-horizon, agentic coding. M.1 features 225.8 billion total parameters, while XS.2 has 33.4 billion. Both models were trained using an internal "Model Factory" system. Laguna XS.2's weights are now openly available under the Apache 2.0 license on Hugging Face, offering developers a new competitive option for agentic software engineering tasks and terminal benchmarks. This release provides a smaller, capable model for integration into developer workflows.

aigest · daily

Get this every morning.

One email. The signal. Built for builders.

Free · Unsubscribe in one click · No trackers

// Today40 stories

Enables AI agents to query their own traces and logs directly in the browser.

agentsjavascriptquery-engineopen-source
arXiv cs.AI6d ago1mFREE

This tool simplifies building privacy-preserving, local-first AI agents by leveraging SQLite for data storage and offering multi-LLM support.

sqliteagentsllmsprivacy
Simon Willison6d ago1mFREE

This architectural perspective could simplify AI system design, foster innovation through component reuse, and improve the maintainability of complex AI applications for developers.

aiarchitecturemodularityopensourceaidataengineering
Simon Willison7d ago1mFREE
// Yesterday41 stories