Today's digest · Wednesday, June 24

The 19 things in AI/dev today.

LiveNext issue at 7:00 CET
#1 / TODAY
Simon Willison·1 min·37h agoFREE

Prompt Injection as Role Confusion

Researchers Charles Ye, Jasmine Cui, and Dylan Hadfield-Menell found that LLMs can be confused by text styled like internal role tags (e.g., <system>, <think>), overriding training. 'Destyling' text to look less like role formats reduced attack success from 61% to 10%. They call this 'role confusion' and warn that injection defense may remain a whack-a-mole game.

Role confusion undermines current prompt injection defenses, making LLM security a perpetual whack-a-mole game.

prompt-injectionllm-securityjailbreakingrole-confusion
simonwillison.net
Prompt Injection as Role Confusion
Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code
#2 / TOP STORY
Simon WillisonFREE

Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code

Simon Willison ported the Moebius 0.2B image inpainting model to run in a browser using WebGPU, with help from Claude Code. The model, originally requiring PyTorch and NVIDIA CUDA, was converted to ONNX and deployed on Hugging Face. The demo is available at simonw.github.io/moebius-web/. The project was a side effort while working on a Datasette feature.

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
#3 / TOP STORY
Hugging FaceFREE

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

IBM Research introduced CUGA, a lightweight harness designed to simplify the development of agentic AI applications. CUGA provides a structured environment with two dozen working examples covering various agentic patterns, from task automation to multi-agent collaborations. This toolkit aims to lower the barrier to entry for developers, enabling quicker prototyping and deployment of agentic solutions.

aigest · daily

Get this every morning.

One email. The signal. Built for builders.

Free · Unsubscribe in one click · No trackers

// Worth acting on7 stories
// Worth knowing8 stories

The provided source text does not detail any specific consequences for developers regarding the OPFS and Pyodide test harness.

opfspyodideweb-developmentpython
Simon Willison18h ago1mFREE
More selected · 1
// Yesterday2 stories