#AI Benchmarking
Latest news articles tagged with "AI Benchmarking". Follow the timeline of events, related topics, and entities.
Articles (2)
-
πΊπΈ ConstraintBench: Benchmarking LLM Constraint Reasoning on Direct Optimization
[USA]
arXiv:2602.22465v1 Announce Type: new Abstract: Large language models are increasingly applied to operational decision-making where the underlying structure is constrained optimization. Existing benc...
Related: #Constraint Reasoning, #Optimization -
πΊπΈ HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models
[USA]
arXiv:2506.03922v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have demonstrated significant potential to advance a broad range of domains. However, current benchm...
Related: #Interdisciplinary Research, #Multimodal AI