PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding
#PromptCD #Polarity-Prompt Contrastive Decoding #AI alignment #Test-time behavior enhancement #Large language models #Vision-language models #3H alignment objectives
📌 Key Takeaways
- PromptCD operates at test time without requiring additional training data
- The method uses paired positive and negative prompts to enhance AI behavior
- Significant improvements demonstrated on '3H' alignment objectives for LLMs
- Enhances VQA performance for vision-language models through visual attention reinforcement
📖 Full Retelling
🏷️ Themes
AI alignment, Test-time enhancement, Cost-efficient AI, Behavior control
📚 Related People & Topics
Large language model
Type of machine learning model
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are generative pre-trained transformers (GPTs) that provide the c...
AI alignment
Conformance of AI to intended objectives
In the field of artificial intelligence (AI), alignment aims to steer AI systems toward a person's or group's intended goals, preferences, or ethical principles. An AI system is considered aligned if it advances the intended objectives. A misaligned AI system pursues unintended objectives.
Entity Intersection Graph
Connections for Large language model: