#AI alignment

Latest news articles tagged with "AI alignment". Follow the timeline of events, related topics, and entities.

Articles (4)

🇺🇸 Asymmetric Goal Drift in Coding Agents Under Value Conflict — 05/03/2026 [USA]
arXiv:2603.03456v1 Announce Type: new Abstract: Agentic coding agents are increasingly deployed autonomously, at scale, and over long-context horizons. Throughout an agent's lifetime, it must navigat...
Related: #Value conflict, #Goal drift, #AI safety
🇺🇸 PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding — 25/02/2026 [USA]
arXiv:2602.20696v1 Announce Type: new Abstract: Reliable AI systems require large language models (LLMs) to exhibit behaviors aligned with human preferences and values. However, most existing alignme...
Related: #Test-time enhancement, #Cost-efficient AI, #Behavior control
🇺🇸 Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing — 19/02/2026 [USA]
arXiv:2510.12121v2 Announce Type: replace Abstract: Precise attribute intensity control--generating Large Language Model (LLM) outputs with specific, user-defined attribute intensities--is crucial fo...
Related: #Attribute control in language models, #Representation editing, #User‑driven customization
🇺🇸 Dialogical Reasoning Across AI Architectures: A Multi-Model Framework for Testing AI Alignment Strategies — 29/01/2026 [USA]
arXiv:2601.20604v1 Announce Type: new Abstract: This paper introduces a methodological framework for empirically testing AI alignment strategies through structured multi-model dialogue. Drawing on Pe...
Related: #Dialogical reasoning, #Peace Studies, #Technology

Key Entities (3)

AI alignment (2 news)
Large language model (1 news)
AI safety (1 news)

About the topic: AI alignment

The topic "AI alignment" aggregates 4+ news articles from various countries.