#LLM Agents
Latest news articles tagged with "LLM Agents". Follow the timeline of events, related topics, and entities.
Articles (6)
-
🇺🇸 On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
[USA]
arXiv:2603.12109v1 Announce Type: new Abstract: Reinforcement learning (RL) with outcome-based rewards has achieved significant success in training large language model (LLM) agents for complex reaso...
Related: #Reinforcement Learning, #Reasoning -
🇺🇸 Hindsight Credit Assignment for Long-Horizon LLM Agents
[USA]
arXiv:2603.08754v1 Announce Type: cross Abstract: Large Language Model (LLM) agents often face significant credit assignment challenges in long-horizon, multi-step tasks due to sparse rewards. Existi...
Related: #AI Research -
🇺🇸 Memory for Autonomous LLM Agents:Mechanisms, Evaluation, and Emerging Frontiers
[USA]
arXiv:2603.07670v1 Announce Type: new Abstract: Large language model (LLM) agents increasingly operate in settings where a single context window is far too small to capture what has happened, what wa...
Related: #AI Memory -
🇺🇸 Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation
[USA]
arXiv:2603.06064v1 Announce Type: new Abstract: Task planning, the problem of sequencing actions to reach a goal from an initial state, is a core capability requirement for autonomous robotic systems...
Related: #AI Planning -
🇺🇸 EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
[USA]
arXiv:2603.04900v1 Announce Type: new Abstract: LLM-based agents depend on effective tool-use policies to solve complex tasks, yet optimizing these policies remains challenging due to delayed supervi...
Related: #AI Optimization -
🇺🇸 CaveAgent: Transforming LLMs into Stateful Runtime Operators
[USA]
arXiv:2601.01569v2 Announce Type: replace Abstract: LLM-based agents are increasingly capable of complex task execution, yet current agentic systems remain constrained by text-centric paradigms that ...
Related: #Runtime Operator Paradigm, #Dual‑Stream Architecture, #Long‑Horizon Task Execution, #Text‑Centric Limitations
Key Entities (3)
- Planning Domain Definition Language (1 news)
- Large language model (1 news)
- Reinforcement learning (1 news)
About the topic: LLM Agents
The topic "LLM Agents" aggregates 6+ news articles from various countries.