Today's digest · Monday, May 25

The 40 things in AI/dev today.

LiveNext issue at 7:00 CET
Stories
40
3 top · 37 rest
#1 / TODAY
arXiv cs.AI·1 min·38h agoFREE

Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems

Researchers introduce Inductive Deductive Synthesis (IDS), an LLM agent system that jointly synthesizes implementation and formal proof for distributed systems. IDS achieves 7/7 verified specifications in ~6.8 hours at $106 per spec, roughly 200x faster than expert effort and 17% cheaper than state-of-the-art agents.

IDS makes formal verification of distributed systems practical for developers, reducing time and cost dramatically.

llmformal-verificationdistributed-systemsai-agents
arxiv.org
Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research
#2 / TOP STORY
arXiv cs.AIFREE

SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research

SciAtlas is a large-scale knowledge graph integrating 43M papers across 26 disciplines, with 157M entities and 3B triplets. It provides a structured topological substrate for AI agents, enabling neuro-symbolic retrieval with tri-path recall and graph reranking to reduce logical hallucinations and inference costs.

Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems
#3 / TOP STORY
arXiv cs.AIFREE

Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems

A new arXiv paper introduces A-LEMS (Agentic LLM Energy Measurement System), a framework redefining AI energy accounting from per-inference to Energy per Successful Goal (EpG). EpG aggregates total workflow energy across all execution attempts, including failures and retries, normalized by successfully completed goals. This system addresses the limitations of current benchmarks for multi-step agentic AI, where single invocation counts misrepresent true costs. A-LEMS provides a clearer understanding of the energy footprint for complex agentic workflows, enabling more accurate cost optimization.

aigest · daily

Get this every morning.

One email. The signal. Built for builders.

Free · Unsubscribe in one click · No trackers

// Today37 stories

Developers can apply these Constraint Programming and MIP modeling techniques to optimize complex real-world scheduling and resource allocation problems across various industries.

schedulingoptimizationconstraintprogrammingmipprogramming
arXiv cs.AI38h ago2mFREE

Program verification benchmarks may need redesigning as agentic provers exceed current difficulty levels.

claudeagentsprogram-verificationlean4
arXiv cs.AI38h ago1mFREE

Developers can test upcoming features and provide feedback, directly influencing the stable 1.0 release of Datasette.

datasettesqlitedata-publishingalpha-release
Simon Willison43h ago2mFREE

Developers can now experiment with integrating AI agents into their Datasette projects, enabling more interactive and automated data exploration.

datasetteai-agentsdata-toolspython
Simon Willison43h ago1mFREE

This simplifies setting up Datasette projects with consistent data, improving reproducibility and streamlining development workflows.

datasetteplugindata-managementdevelopment-tools
Simon Willison45h ago2mFREE

Reminds developers to prioritize simplicity in tooling to avoid long-term maintenance issues.

simplicitysoftware-designflask
Simon Willison2d ago1mFREE
// Yesterday34 stories