arXiv cs.AIWednesday · May 27, 2026FREE

Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation

llmalignmentmulti-stakeholderarxiv

A new paper on arXiv (2605.26878) addresses the challenge of aligning LLMs with multiple stakeholders who have conflicting preferences. The authors show that holistic LLM judges conflate utility estimation and aggregation, leading to unstable implicit weights. This weighting noise can cause large score shifts, especially when stakeholder satisfaction is dispersed, and these shifts increase with the number of stakeholders. To solve this, they propose DecompR, which decomposes the process: counterfactual-calibrated weights are fixed from query structure before candidate scoring, and per-role utilities are estimated independently. This removes candidate-dependent weight drift and reduces estimation noise. The paper includes empirical and theoretical evidence demonstrating the effectiveness of DecompR in producing more stable and reliable outputs for multi-stakeholder tasks.

// why it matters

Developers building LLM systems with diverse user groups can achieve more stable and fair outputs.

Sources

Primary · arXiv cs.AI

▸ Read original at arxiv.org

Cross-Entropy Games and Frost Training Hierarchical Prompt-Domain Control and Learning for Resource-Constrained Agentic Language Models When Context Flips, Safety Breaks: Diagnosing Brittle Safety in Aligned Language Models

Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation

Sources

Related

Like this? Get the next digest.