#LLM Agents

Latest news articles tagged with "LLM Agents". Follow the timeline of events, related topics, and entities.

Articles (6)

🇺🇸 On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents — 13/03/2026 [USA]
arXiv:2603.12109v1 Announce Type: new Abstract: Reinforcement learning (RL) with outcome-based rewards has achieved significant success in training large language model (LLM) agents for complex reaso...
Related: #Reinforcement Learning, #Reasoning
🇺🇸 Hindsight Credit Assignment for Long-Horizon LLM Agents — 11/03/2026 [USA]
arXiv:2603.08754v1 Announce Type: cross Abstract: Large Language Model (LLM) agents often face significant credit assignment challenges in long-horizon, multi-step tasks due to sparse rewards. Existi...
Related: #AI Research
🇺🇸 Memory for Autonomous LLM Agents:Mechanisms, Evaluation, and Emerging Frontiers — 10/03/2026 [USA]
arXiv:2603.07670v1 Announce Type: new Abstract: Large language model (LLM) agents increasingly operate in settings where a single context window is far too small to capture what has happened, what wa...
Related: #AI Memory
🇺🇸 Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation — 09/03/2026 [USA]
arXiv:2603.06064v1 Announce Type: new Abstract: Task planning, the problem of sequencing actions to reach a goal from an initial state, is a core capability requirement for autonomous robotic systems...
Related: #AI Planning
🇺🇸 EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection — 06/03/2026 [USA]
arXiv:2603.04900v1 Announce Type: new Abstract: LLM-based agents depend on effective tool-use policies to solve complex tasks, yet optimizing these policies remains challenging due to delayed supervi...
Related: #AI Optimization
🇺🇸 CaveAgent: Transforming LLMs into Stateful Runtime Operators — 19/02/2026 [USA]
arXiv:2601.01569v2 Announce Type: replace Abstract: LLM-based agents are increasingly capable of complex task execution, yet current agentic systems remain constrained by text-centric paradigms that ...
Related: #Runtime Operator Paradigm, #Dual‑Stream Architecture, #Long‑Horizon Task Execution, #Text‑Centric Limitations

Key Entities (3)

Planning Domain Definition Language (1 news)
Large language model (1 news)
Reinforcement learning (1 news)

About the topic: LLM Agents

The topic "LLM Agents" aggregates 6+ news articles from various countries.