#AI alignment
Latest news articles tagged with "AI alignment". Follow the timeline of events, related topics, and entities.
Articles (4)
-
๐บ๐ธ Asymmetric Goal Drift in Coding Agents Under Value Conflict
[USA]
arXiv:2603.03456v1 Announce Type: new Abstract: Agentic coding agents are increasingly deployed autonomously, at scale, and over long-context horizons. Throughout an agent's lifetime, it must navigat...
Related: #Value conflict, #Goal drift, #AI safety -
๐บ๐ธ PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding
[USA]
arXiv:2602.20696v1 Announce Type: new Abstract: Reliable AI systems require large language models (LLMs) to exhibit behaviors aligned with human preferences and values. However, most existing alignme...
Related: #Test-time enhancement, #Cost-efficient AI, #Behavior control -
๐บ๐ธ Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
[USA]
arXiv:2510.12121v2 Announce Type: replace Abstract: Precise attribute intensity control--generating Large Language Model (LLM) outputs with specific, user-defined attribute intensities--is crucial fo...
Related: #Attribute control in language models, #Representation editing, #Userโdriven customization -
๐บ๐ธ Dialogical Reasoning Across AI Architectures: A Multi-Model Framework for Testing AI Alignment Strategies
[USA]
arXiv:2601.20604v1 Announce Type: new Abstract: This paper introduces a methodological framework for empirically testing AI alignment strategies through structured multi-model dialogue. Drawing on Pe...
Related: #Dialogical reasoning, #Peace Studies, #Technology
Key Entities (3)
- AI alignment (2 news)
- Large language model (1 news)
- AI safety (1 news)
About the topic: AI alignment
The topic "AI alignment" aggregates 4+ news articles from various countries.