arXiv cs.AIWednesday · May 27, 2026FREE

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

llm-agentscudakernel-generationfeedback

Researchers introduced CUDAnalyst, a unified analysis layer for studying how LLM agents make feedback-to-plan decisions during CUDA kernel generation. The framework uses trajectory freezing and selective feedback injection to isolate the effects of different feedback components. Key findings include: explicit planning is beneficial only when feedback is aligned; effective planning emerges from structured multi-feedback interactions; and high-level plans from stronger reasoning models can partially transfer to weaker ones. These results hold across reference backbones and workloads. The work addresses the opacity of planning decisions in iterative self-evolving agents, providing a method for stable generation-level evaluation and coalitional-style attribution.

// why it matters

Developers can now better understand and improve LLM agent planning for code generation tasks.

Sources

Primary · arXiv cs.AI

▸ Read original at arxiv.org

Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles SKILLC: Learning Autonomous Skill Internalization in LLM Agents via Contrastive Credit Assignment Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

Sources

Related

Like this? Get the next digest.