#Artificial Intelligence Safety
Latest news articles tagged with "Artificial Intelligence Safety". Follow the timeline of events, related topics, and entities.
Articles (2)
-
πΊπΈ CAGE: A Framework for Culturally Adaptive Red-Teaming Benchmark Generation
[USA]
arXiv:2602.20170v1 Announce Type: cross Abstract: Existing red-teaming benchmarks, when adapted to new languages via direct translation, fail to capture socio-technical vulnerabilities rooted in loca...
Related: #Cultural Adaptation, #Benchmark Development -
πΊπΈ Latent Veracity Inference for Identifying Errors in Stepwise Reasoning
[USA]
arXiv:2505.11824v3 Announce Type: replace-cross Abstract: Chain-of-Thought (CoT) reasoning has advanced the capabilities and transparency of language models (LMs); however, reasoning chains can conta...
Related: #Model Transparency and Explainability, #Error Detection in Stepwise Reasoning, #Search Algorithms for Verification