Large language model
Type of machine learning model
๐ Rating
123 news mentions ยท ๐ 0 likes ยท ๐ 0 dislikes
๐ Topics
- Artificial Intelligence (73)
- Machine Learning (29)
- Natural Language Processing (11)
- Information Retrieval (6)
- Software Engineering (4)
- AI Safety (4)
- Large Language Models (4)
- Multi-Agent Systems (4)
- Educational Technology (4)
- Benchmark Development (4)
- AI Evaluation (4)
- Reinforcement Learning (3)
๐ท๏ธ Keywords
Large Language Models (96) ยท arXiv (20) ยท Large language models (14) ยท AI Research (8) ยท Reinforcement Learning (7) ยท Retrieval-Augmented Generation (7) ยท AI Safety (7) ยท Knowledge Graphs (5) ยท AI Evaluation (5) ยท Machine Learning (5) ยท LLMs (4) ยท Educational Technology (4) ยท Computational Efficiency (3) ยท AI benchmarks (3) ยท AI research (3) ยท Large Language Model (3) ยท Medical AI (3) ยท AI agents (3) ยท Benchmark (3) ยท Neural networks (2)
๐ Key Information
๐ฐ Related News (123)
-
๐บ๐ธ Transformers converge to invariant algorithmic cores
arXiv:2602.22600v1 Announce Type: cross Abstract: Large language models exhibit sophisticated capabilities, yet understanding how they work internall...
-
๐บ๐ธ Ruyi2 Technical Report
arXiv:2602.22543v1 Announce Type: cross Abstract: Large Language Models (LLMs) face significant challenges regarding deployment costs and latency, ne...
-
๐บ๐ธ Generative Agents Navigating Digital Libraries
arXiv:2602.22529v1 Announce Type: cross Abstract: In the rapidly evolving field of digital libraries, the development of large language models (LLMs)...
-
๐บ๐ธ Reinforcement-aware Knowledge Distillation for LLM Reasoning
arXiv:2602.22495v1 Announce Type: cross Abstract: Reinforcement learning (RL) post-training has recently driven major gains in long chain-of-thought ...
-
๐บ๐ธ Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs
arXiv:2602.22481v1 Announce Type: cross Abstract: The way LLM-based entities conceive of the relationship between AI and humans is an important topic...
-
๐บ๐ธ Automating the Detection of Requirement Dependencies Using Large Language Models
arXiv:2602.22456v1 Announce Type: cross Abstract: Requirements are inherently interconnected through various types of dependencies. Identifying these...
-
๐บ๐ธ Contextual Memory Virtualisation: DAG-Based State Management and Structurally Lossless Trimming for LLM Agents
arXiv:2602.22402v1 Announce Type: cross Abstract: As large language models engage in extended reasoning tasks, they accumulate significant state -- a...
-
๐บ๐ธ Scaling In, Not Up? Testing Thick Citation Context Analysis with GPT-5 and Fragile Prompts
arXiv:2602.22359v1 Announce Type: cross Abstract: This paper tests whether large language models (LLMs) can support interpretative citation context a...
-
๐บ๐ธ Structure and Redundancy in Large Language Models: A Spectral Study via Random Matrix Theory
arXiv:2602.22345v1 Announce Type: cross Abstract: This thesis addresses two persistent and closely related challenges in modern deep learning, reliab...
-
๐บ๐ธ UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs
arXiv:2602.22296v1 Announce Type: cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has improved the reasoning abilities of large...
-
๐บ๐ธ Manifold of Failure: Behavioral Attraction Basins in Language Models
arXiv:2602.22291v1 Announce Type: cross Abstract: While prior work has focused on projecting adversarial examples back onto the manifold of natural d...
-
๐บ๐ธ Integrating Machine Learning Ensembles and Large Language Models for Heart Disease Prediction Using Voting Fusion
arXiv:2602.22280v1 Announce Type: cross Abstract: Cardiovascular disease is the primary cause of death globally, necessitating early identification, ...
-
๐บ๐ธ Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
arXiv:2602.22242v1 Announce Type: cross Abstract: Large Language Models (LLMs) are widely deployed in real-world systems. Given their broader applica...
-
๐บ๐ธ From Prompts to Performance: Evaluating LLMs for Task-based Parallel Code Generation
arXiv:2602.22240v1 Announce Type: cross Abstract: Large Language Models (LLM) show strong abilities in code generation, but their skill in creating e...
-
๐บ๐ธ Misinformation Exposure in the Chinese Web: A Cross-System Evaluation of Search Engines, LLMs, and AI Overviews
arXiv:2602.22221v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly integrated into search services, providing direct ans...
-
๐บ๐ธ Comparative Analysis of Neural Retriever-Reranker Pipelines for Retrieval-Augmented Generation over Knowledge Graphs in E-commerce Applications
arXiv:2602.22219v1 Announce Type: cross Abstract: Recent advancements in Large Language Models (LLMs) have transformed Natural Language Processing (N...
-
๐บ๐ธ Enriching Taxonomies Using Large Language Models
arXiv:2602.22213v1 Announce Type: cross Abstract: Taxonomies play a vital role in structuring and categorizing information across domains. However, m...
-
๐บ๐ธ Duel-Evolve: Reward-Free Test-Time Scaling via LLM Self-Preferences
arXiv:2602.21585v1 Announce Type: cross Abstract: Many applications seek to optimize LLM outputs at test time by iteratively proposing, scoring, and ...
-
๐บ๐ธ Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks
arXiv:2602.23330v1 Announce Type: new Abstract: The advancement of large language models (LLMs) has accelerated the development of autonomous financi...
-
๐บ๐ธ LLM Novice Uplift on Dual-Use, In Silico Biology Tasks
arXiv:2602.23329v1 Announce Type: new Abstract: Large language models (LLMs) perform increasingly well on biology benchmarks, but it remains unclear ...
-
๐บ๐ธ Mitigating Legibility Tax with Decoupled Prover-Verifier Games
arXiv:2602.23248v1 Announce Type: new Abstract: As large language models become increasingly capable, it is critical that their outputs can be easily...
-
๐บ๐ธ SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation
arXiv:2602.23199v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly applied in scientific research, offering new capabiliti...
-
๐บ๐ธ A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
arXiv:2602.23163v1 Announce Type: new Abstract: Large language models are beginning to show steganographic capabilities. Such capabilities could allo...
-
๐บ๐ธ Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design
arXiv:2602.23092v1 Announce Type: new Abstract: The Capacitated Vehicle Routing Problem (CVRP), a fundamental combinatorial optimization challenge, f...
-
๐บ๐ธ Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search
arXiv:2602.22983v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly used, their security risks have drawn increasing att...
-
๐บ๐ธ SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy
arXiv:2602.22971v1 Announce Type: new Abstract: As LLMs achieved breakthroughs in general reasoning, their proficiency in specialized scientific doma...
-
๐บ๐ธ General Agent Evaluation
arXiv:2602.22953v1 Announce Type: new Abstract: The promise of general-purpose agents - systems that perform tasks in unfamiliar environments without...
-
๐บ๐ธ Towards LLM-Empowered Knowledge Tracing via LLM-Student Hierarchical Behavior Alignment in Hyperbolic Space
arXiv:2602.22879v1 Announce Type: new Abstract: Knowledge Tracing (KT) diagnoses students' concept mastery through continuous learning state monitori...
-
๐บ๐ธ MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks
arXiv:2602.22808v1 Announce Type: new Abstract: Despite the remarkable progress of large language models (LLMs), the capabilities of standalone LLMs ...
-
๐บ๐ธ ClinDet-Bench: Beyond Abstention, Evaluating Judgment Determinability of LLMs in Clinical Decision-Making
arXiv:2602.22771v1 Announce Type: new Abstract: Clinical decisions are often required under incomplete information. Clinical experts must identify wh...
-
๐บ๐ธ AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications
arXiv:2602.22769v1 Announce Type: new Abstract: Large Language Models (LLMs) are deployed as autonomous agents in increasingly complex applications, ...
-
๐บ๐ธ RLHFless: Serverless Computing for Efficient RLHF
arXiv:2602.22718v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) has been widely applied to Large Language Model (LL...
-
๐บ๐ธ MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
arXiv:2602.22638v1 Announce Type: new Abstract: Route-planning agents powered by large language models (LLMs) have emerged as a promising paradigm fo...
-
๐บ๐ธ SideQuest: Model-Driven KV Cache Management for Long-Horizon Agentic Reasoning
arXiv:2602.22603v1 Announce Type: new Abstract: Long-running agentic tasks, such as deep research, require multi-hop reasoning over information distr...
-
๐บ๐ธ Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance
arXiv:2602.22583v1 Announce Type: new Abstract: Example-based guidance is widely used to improve mathematical reasoning at inference time, yet its ef...
-
๐บ๐ธ CourtGuard: A Model-Agnostic Framework for Zero-Shot Policy Adaptation in LLM Safety
arXiv:2602.22557v1 Announce Type: new Abstract: Current safety mechanisms for Large Language Models (LLMs) rely heavily on static, fine-tuned classif...
-
๐บ๐ธ Agentic AI for Intent-driven Optimization in Cell-free O-RAN
arXiv:2602.22539v1 Announce Type: new Abstract: Agentic artificial intelligence (AI) is emerging as a key enabler for autonomous radio access network...
-
๐บ๐ธ Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents
arXiv:2602.22523v1 Announce Type: new Abstract: While contemporary large language models (LLMs) are increasingly capable in isolation, there are stil...
-
๐บ๐ธ Mapping the Landscape of Artificial Intelligence in Life Cycle Assessment Using Large Language Models
arXiv:2602.22500v1 Announce Type: new Abstract: Integration of artificial intelligence (AI) into life cycle assessment (LCA) has accelerated in recen...
-
๐บ๐ธ ConstraintBench: Benchmarking LLM Constraint Reasoning on Direct Optimization
arXiv:2602.22465v1 Announce Type: new Abstract: Large language models are increasingly applied to operational decision-making where the underlying st...
-
๐บ๐ธ A Framework for Assessing AI Agent Decisions and Outcomes in AutoML Pipelines
arXiv:2602.22442v1 Announce Type: new Abstract: Agent-based AutoML systems rely on large language models to make complex, multi-stage decisions acros...
-
๐บ๐ธ FIRE: A Comprehensive Benchmark for Financial Intelligence and Reasoning Evaluation
arXiv:2602.22273v1 Announce Type: new Abstract: We introduce FIRE, a comprehensive benchmark designed to evaluate both the theoretical financial know...
-
๐บ๐ธ Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation
arXiv:2602.22215v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate potential in the field of scientific idea generation. Howeve...
-
๐บ๐ธ LLMs Process Lists With General Filter Heads
arXiv:2510.26784v2 Announce Type: replace Abstract: We investigate the mechanisms underlying a range of list-processing tasks in LLMs, and we find th...
-
๐บ๐ธ Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA
arXiv:2602.20492v1 Announce Type: cross Abstract: Decentralized federated learning (DFL) based on low-rank adaptation (LoRA) enables mobile devices w...
-
๐บ๐ธ What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance
arXiv:2602.20300v1 Announce Type: cross Abstract: Large Language Model (LLM) hallucinations are usually treated as defects of the model or its decodi...
-
๐บ๐ธ Evaluating the Reliability of Digital Forensic Evidence Discovered by Large Language Model: A Case Study
arXiv:2602.20202v1 Announce Type: cross Abstract: The growing reliance on AI-identified digital evidence raises significant concerns about its reliab...
-
๐บ๐ธ Closing the Expertise Gap in Residential Building Energy Retrofits: A Domain-Specific LLM for Informed Decision-Making
arXiv:2602.20181v1 Announce Type: cross Abstract: Residential energy retrofit decision-making is constrained by an expertise gap, as homeowners lack ...
-
๐บ๐ธ Tool Building as a Path to "Superintelligence"
arXiv:2602.21061v1 Announce Type: new Abstract: The Diligent Learner framework suggests LLMs can achieve superintelligence via test-time search, prov...
-
๐บ๐ธ Physics-based phenomenological characterization of cross-modal bias in multimodal models
arXiv:2602.20624v1 Announce Type: new Abstract: The term 'algorithmic fairness' is used to evaluate whether AI models operate fairly in both comparat...
-
๐บ๐ธ A Problem-Oriented Perspective and Anchor Verification for Code Optimization
arXiv:2406.11935v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown remarkable capabilities in solving various programm...
-
๐บ๐ธ From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?
arXiv:2512.03005v4 Announce Type: replace Abstract: The rapid advancement of large language models (LLMs) has opened new possibilities for AI for goo...
-
๐บ๐ธ "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation
arXiv:2506.04500v2 Announce Type: replace Abstract: Recent advancements in large language models (LLMs) have spurred interest in robotic navigation t...
-
๐บ๐ธ Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training
arXiv:2602.21189v1 Announce Type: cross Abstract: Pass@k is a widely used performance metric for verifiable large language model tasks, including mat...
-
๐บ๐ธ CAMEL: Confidence-Gated Reflection for Reward Modeling
arXiv:2602.20670v1 Announce Type: cross Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Exi...
-
๐บ๐ธ CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions
arXiv:2602.20213v1 Announce Type: cross Abstract: The evaluation of Large Language Models (LLMs) for code generation relies heavily on the quality an...
-
๐บ๐ธ MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs
arXiv:2602.20191v1 Announce Type: cross Abstract: Changing runtime complexity on cloud and edge devices necessitates elastic large language model (LL...
-
๐บ๐ธ Talking to Yourself: Defying Forgetting in Large Language Models
arXiv:2602.20162v1 Announce Type: cross Abstract: Catastrophic forgetting remains a major challenge when fine-tuning large language models (LLMs) on ...
-
๐บ๐ธ Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence
arXiv:2602.20934v1 Announce Type: new Abstract: The paradigm of Large Language Models is undergoing a fundamental transition from static inference en...
-
๐บ๐ธ Buffer Matters: Unleashing the Power of Off-Policy Reinforcement Learning in Large Language Model Reasoning
arXiv:2602.20722v1 Announce Type: new Abstract: Traditional on-policy Reinforcement Learning with Verifiable Rewards (RLVR) frameworks suffer from ex...
-
๐บ๐ธ PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding
arXiv:2602.20696v1 Announce Type: new Abstract: Reliable AI systems require large language models (LLMs) to exhibit behaviors aligned with human pref...
-
๐บ๐ธ From Logs to Language: Learning Optimal Verbalization for LLM-Based Recommendation in Production
arXiv:2602.20558v1 Announce Type: new Abstract: Large language models (LLMs) are promising backbones for generative recommender systems, yet a key ch...
-
๐บ๐ธ DMCD: Semantic-Statistical Framework for Causal Discovery
arXiv:2602.20333v1 Announce Type: new Abstract: We present DMCD (DataMap Causal Discovery), a two-phase causal discovery framework that integrates LL...
-
๐บ๐ธ Diffusion Generative Recommendation with Continuous Tokens
arXiv:2504.12007v5 Announce Type: replace-cross Abstract: Recent advances in generative artificial intelligence, particularly large language models (...
-
๐บ๐ธ DS-STAR: Data Science Agent for Solving Diverse Tasks across Heterogeneous Formats and Open-Ended Queries
arXiv:2509.21825v4 Announce Type: replace Abstract: While large language models (LLMs) have shown promise in automating data science, existing agents...
-
๐บ๐ธ Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs
arXiv:2506.18777v2 Announce Type: replace Abstract: Large language models (LLMs) are typically trained to acquire behaviours from demonstrations or e...
-
๐บ๐ธ Sensory-Motor Control with Large Language Models via Iterative Policy Refinement
arXiv:2506.04867v4 Announce Type: replace Abstract: We propose a method that enables large language models (LLMs) to control embodied agents through ...
-
๐บ๐ธ A Survey on the Optimization of Large Language Model-based Agents
arXiv:2503.12434v2 Announce Type: replace Abstract: With the rapid development of Large Language Models (LLMs), LLM-based agents have been widely ado...
-
๐บ๐ธ The Art of Efficient Reasoning: Data, Reward, and Optimization
arXiv:2602.20945v1 Announce Type: cross Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but...
-
๐บ๐ธ Hybrid LLM-Embedded Dialogue Agents for Learner Reflection: Designing Responsive and Theory-Driven Interactions
arXiv:2602.20486v1 Announce Type: cross Abstract: Dialogue systems have long supported learner reflections, with theoretically grounded, rule-based d...
-
๐บ๐ธ No One Size Fits All: QueryBandits for Hallucination Mitigation
arXiv:2602.20332v1 Announce Type: cross Abstract: Advanced reasoning capabilities in Large Language Models (LLMs) have led to more frequent hallucina...
-
๐บ๐ธ InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
arXiv:2602.20294v1 Announce Type: cross Abstract: Simulating real personalities with large language models requires grounding generation in authentic...
-
๐บ๐ธ Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
arXiv:2602.20207v1 Announce Type: cross Abstract: Knowledge editing in Large Language Models (LLMs) aims to update the model's prediction for a speci...
-
๐บ๐ธ A Benchmark for Deep Information Synthesis
arXiv:2602.21143v1 Announce Type: new Abstract: Large language model (LLM)-based agents are increasingly used to solve complex tasks involving tool u...
-
๐บ๐ธ LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification
arXiv:2602.21044v1 Announce Type: new Abstract: Evaluations of large language models (LLMs) primarily emphasize convergent logical reasoning, where s...
-
๐บ๐ธ HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG
arXiv:2602.20926v1 Announce Type: new Abstract: Large Language Models (LLMs) often struggle with inherent knowledge boundaries and hallucinations, li...
-
๐บ๐ธ Qwen-BIM: developing large language model for BIM-based design with domain-specific benchmark and dataset
arXiv:2602.20812v1 Announce Type: new Abstract: As the construction industry advances toward digital transformation, BIM (Building Information Modeli...
-
๐บ๐ธ Counterfactual Simulation Training for Chain-of-Thought Faithfulness
arXiv:2602.20710v1 Announce Type: new Abstract: Inspecting Chain-of-Thought reasoning is among the most common means of understanding why an LLM prod...
-
๐บ๐ธ Grounding LLMs in Scientific Discovery via Embodied Actions
arXiv:2602.20639v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown significant potential in scientific discovery but struggle to...
-
๐บ๐ธ An artificial intelligence framework for end-to-end rare disease phenotyping from clinical notes using large language models
arXiv:2602.20324v1 Announce Type: new Abstract: Phenotyping is fundamental to rare disease diagnosis, but manual curation of structured phenotypes fr...
-
๐บ๐ธ SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery
arXiv:2602.21136v1 Announce Type: cross Abstract: Qualitative insights from user experiences are critical for informing product and policy decisions,...
-
๐บ๐ธ El Agente Gr\'afico: Structured Execution Graphs for Scientific Agents
arXiv:2602.17902v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to automate scientific workflows, yet their integr...
-
๐บ๐ธ The Token Games: Evaluating Language Model Reasoning with Puzzle Duels
arXiv:2602.17831v1 Announce Type: new Abstract: Evaluating the reasoning capabilities of Large Language Models is increasingly challenging as models ...
-
๐บ๐ธ Retrieval Augmented (Knowledge Graph), and Large Language Model-Driven Design Structure Matrix (DSM) Generation of Cyber-Physical Systems
arXiv:2602.16715v1 Announce Type: new Abstract: We explore the potential of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and G...
-
๐บ๐ธ OpenAI’s latest product lets you vibe code science
OpenAI just revealed what its new in-house team, OpenAI for Science, has been up to. The firm has released a free LLM-powered tool for scientists call...
-
๐บ๐ธ Advancing science and math with GPT-5.2
GPT-5.2 is OpenAIโs strongest model yet for math and science, setting new state-of-the-art results on benchmarks like GPQA Diamond and FrontierMath. T...
-
๐บ๐ธ GISA: A Benchmark for General Information-Seeking Assistant
arXiv:2602.08543v2 Announce Type: replace-cross Abstract: The advancement of large language models (LLMs) has significantly accelerated the developme...
-
๐บ๐ธ Provable Training Data Identification for Large Language Models
arXiv:2510.09717v2 Announce Type: replace-cross Abstract: Identifying training data of large-scale models is critical for copyright litigation, priva...
-
๐บ๐ธ Principled Synthetic Data Enables the First Scaling Laws for LLMs in Recommendation
arXiv:2602.07298v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) represent a promising frontier for recommender systems, yet th...
-
๐บ๐ธ Exploring AI-Augmented Sensemaking of Patient-Generated Health Data: A Mixed-Method Study with Healthcare Professionals in Cardiac Risk Reduction
arXiv:2602.05687v4 Announce Type: replace-cross Abstract: Individuals are increasingly generating substantial personal health and lifestyle data, e.g...
-
๐บ๐ธ Beyond Static Question Banks: Dynamic Knowledge Expansion via LLM-Automated Graph Construction and Adaptive Generation
arXiv:2602.00020v2 Announce Type: replace-cross Abstract: Personalized education systems increasingly rely on structured knowledge representations to...
-
๐บ๐ธ ATLAS : Adaptive Self-Evolutionary Research Agent with Task-Distributed Multi-LLM Supporters
arXiv:2602.02709v2 Announce Type: replace Abstract: Recent multi-LLM agent systems perform well in prompt optimization and automated problem-solving,...
-
๐บ๐ธ Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges
arXiv:2510.23883v2 Announce Type: replace Abstract: Agentic AI systems powered by large language models (LLMs) and endowed with planning, tool use, m...
-
๐บ๐ธ Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models
arXiv:2602.12996v1 Announce Type: cross Abstract: Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) i...
-
๐บ๐ธ Knowledge-Based Design Requirements for Generative Social Robots in Higher Education
arXiv:2602.12873v1 Announce Type: cross Abstract: Generative social robots (GSRs) powered by large language models enable adaptive, conversational tu...
-
๐บ๐ธ CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement
arXiv:2602.12422v1 Announce Type: cross Abstract: Cache replacement remains a challenging problem in CPU microarchitecture, often addressed using han...
-
๐บ๐ธ Reasoning about Intent for Ambiguous Requests
arXiv:2511.10453v2 Announce Type: replace-cross Abstract: Large language models often respond to ambiguous requests by implicitly committing to one i...
-
๐บ๐ธ Low-Resource Dialect Adaptation of Large Language Models: A French Dialect Case-Study
arXiv:2510.22747v2 Announce Type: replace-cross Abstract: Despite the widespread adoption of large language models (LLMs), their strongest capabiliti...
-
๐บ๐ธ Eliminating stability hallucinations in llm-based tts models via attention guidance
arXiv:2509.19852v2 Announce Type: replace-cross Abstract: This paper focuses on resolving stability hallucinations (e.g., repetitive or omitted speec...
-
๐บ๐ธ ToolACE-MT: Non-Autoregressive Generation for Agentic Multi-Turn Interaction
arXiv:2508.12685v3 Announce Type: replace-cross Abstract: Agentic task-solving with Large Language Models (LLMs) requires multi-turn, multi-step inte...
-
๐บ๐ธ R-Zero: Self-Evolving Reasoning LLM from Zero Data
arXiv:2508.05004v4 Announce Type: replace-cross Abstract: Self-evolving Large Language Models (LLMs) offer a scalable path toward super-intelligence ...
-
๐บ๐ธ PlanetServe: A Decentralized, Scalable, and Privacy-Preserving Overlay for Democratizing Large Language Model Serving
arXiv:2504.20101v5 Announce Type: replace-cross Abstract: While significant progress has been made in research and development on open-source and cos...
-
๐บ๐ธ LTSM-Bundle: A Toolbox and Benchmark on Large Language Models for Time Series Forecasting
arXiv:2406.14045v3 Announce Type: replace-cross Abstract: Time Series Forecasting (TSF) has long been a challenge in time series analysis. Inspired b...
-
๐บ๐ธ WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
arXiv:2602.04634v2 Announce Type: replace Abstract: Recent advancements in Large Language Models (LLMs) have largely focused on depth scaling, where ...
-
๐บ๐ธ TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)
arXiv:2602.12833v1 Announce Type: cross Abstract: Large Language Models (LLMs) encode extensive medical knowledge but struggle to apply it reliably t...
-
๐บ๐ธ Understanding Chain-of-Thought in Large Language Models via Topological Data Analysis
arXiv:2512.19135v2 Announce Type: replace Abstract: With the development of large language models (LLMs), particularly with the introduction of the l...
-
๐บ๐ธ RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models
arXiv:2510.19698v2 Announce Type: replace Abstract: Large Language Models (LLMs) can propose rules in natural language, sidestepping the need for a p...
-
๐บ๐ธ Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows
arXiv:2509.11079v5 Announce Type: replace Abstract: Large Language Model (LLM)-based agentic systems have shown strong capabilities across various ta...
-
๐บ๐ธ SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation
arXiv:2505.14381v3 Announce Type: replace Abstract: With the increasing adoption of Large Language Models (LLMs) and Vision-Language Models (VLMs), r...
-
๐บ๐ธ Asynchronous Verified Semantic Caching for Tiered LLM Architectures
arXiv:2602.13165v1 Announce Type: cross Abstract: Large language models (LLMs) now sit in the critical path of search, assistance, and agentic workfl...
-
๐บ๐ธ Buy versus Build an LLM: A Decision Framework for Governments
arXiv:2602.13033v1 Announce Type: cross Abstract: Large Language Models (LLMs) represent a new frontier of digital infrastructure that can support a ...
-
๐บ๐ธ Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward
arXiv:2602.12430v1 Announce Type: cross Abstract: The transition from monolithic language models to modular, skill-equipped agents marks a defining s...
-
๐บ๐ธ From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness
arXiv:2602.12285v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed as autonomous agents capable of actions with...
-
๐บ๐ธ TriGen: NPU Architecture for End-to-End Acceleration of Large Language Models based on SW-HW Co-Design
arXiv:2602.12962v1 Announce Type: cross Abstract: Recent studies have extensively explored NPU architectures for accelerating AI inference in on-devi...
-
๐บ๐ธ Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence
arXiv:2602.12811v1 Announce Type: cross Abstract: When humans and large language models (LLMs) process the same text, activations in the LLMs correla...
-
๐บ๐ธ "Not Human, Funnier": How Machine Identity Shapes Humor Perception in Online AI Stand-up Comedy
arXiv:2602.12763v1 Announce Type: cross Abstract: Chatbots are increasingly applied to domains previously reserved for human actors. One such domain ...
-
๐บ๐ธ VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction
arXiv:2602.12579v1 Announce Type: cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a dominant paradigm for enhanc...
-
๐บ๐ธ SD-MoE: Spectral Decomposition for Effective Expert Specialization
arXiv:2602.12556v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) architectures scale Large Language Models via expert specialization induce...
-
๐บ๐ธ RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
arXiv:2602.12424v1 Announce Type: cross Abstract: Benchmarks establish a standardized evaluation framework to systematically assess the performance o...
-
๐บ๐ธ Soft Contamination Means Benchmarks Test Shallow Generalization
arXiv:2602.12413v1 Announce Type: cross Abstract: If LLM training data is polluted with benchmark test data, then benchmark performance gives biased ...
-
๐บ๐ธ OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization
arXiv:2602.12305v1 Announce Type: cross Abstract: Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinat...
-
๐บ๐ธ To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models
arXiv:2602.12566v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) plays a key role in stimulating the explicit re...
-
๐บ๐ธ Intent-Driven Smart Manufacturing Integrating Knowledge Graphs and Large Language Models
arXiv:2602.12419v1 Announce Type: new Abstract: The increasing complexity of smart manufacturing environments demands interfaces that can translate h...
๐ Entity Intersection Graph
People and organizations frequently mentioned alongside Large language model:
-
Reinforcement learning ยท 8 shared articles -
Artificial intelligence ยท 5 shared articles -
๐
Machine learning ยท 4 shared articles
-
Educational technology ยท 4 shared articles -
๐
AI agent ยท 4 shared articles
-
๐
Benchmark ยท 3 shared articles
-
๐
Information retrieval ยท 3 shared articles
-
๐
Neural network ยท 2 shared articles
-
๐
Ethics of artificial intelligence ยท 2 shared articles
-
๐
AI safety ยท 2 shared articles