#Language Model Evaluation
Latest news articles tagged with "Language Model Evaluation". Follow the timeline of events, related topics, and entities.
Articles (3)
-
πΊπΈ Pressure Reveals Character: Behavioural Alignment Evaluation at Depth
[USA]
arXiv:2602.20813v1 Announce Type: new Abstract: Evaluating alignment in language models requires testing how they behave under realistic pressure, not just what they claim they would do. While alignm...
Related: #AI Safety, #Alignment Research -
πΊπΈ Three Concrete Challenges and Two Hopes for the Safety of Unsupervised Elicitation
[USA]
arXiv:2602.20400v1 Announce Type: cross Abstract: To steer language models towards truthful outputs on tasks which are beyond human capability, previous work has suggested training models on easy tas...
Related: #Machine Learning Safety, #Unsupervised Learning -
πΊπΈ Redefining Evaluation Standards: A Unified Framework for Evaluating the Korean Capabilities of Language Models
[USA]
arXiv:2503.22968v5 Announce Type: replace-cross Abstract: Recent advancements in Korean large language models (LLMs) have driven numerous benchmarks and evaluation methods, yet inconsistent protocols...
Related: #Korean NLP, #Research Standards