#Game‑Based Evaluation
Latest news articles tagged with "Game‑Based Evaluation". Follow the timeline of events, related topics, and entities.
Articles (1)
-
🇺🇸 Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?
[USA]
arXiv:2602.15867v1 Announce Type: cross Abstract: In this positioning paper, we evaluate the problem-solving and reasoning capabilities of contemporary Large Language Models (LLMs) through their perf...
Related: #Large Language Models, #Natural Language Understanding, #Problem‑Solving & Reasoning