#Natural Language Processing
Latest news articles tagged with "Natural Language Processing". Follow the timeline of events, related topics, and entities.
Articles (30)
-
๐บ๐ธ Deep Sequence Modeling with Quantum Dynamics: Language as a Wave Function
[USA]
arXiv:2602.22255v1 Announce Type: cross Abstract: We introduce a sequence modeling framework in which the latent state is a complex-valued wave function evolving on a finite-dimensional Hilbert space...
Related: #Quantum Computing, #Machine Learning, #Theoretical Computer Science -
๐บ๐ธ Decoder-based Sense Knowledge Distillation
[USA]
arXiv:2602.22351v1 Announce Type: cross Abstract: Large language models (LLMs) learn contextual embeddings that capture rich semantic information, yet they often overlook structured lexical knowledge...
Related: #AI Research, #Knowledge Distillation -
๐บ๐ธ Automating the Detection of Requirement Dependencies Using Large Language Models
[USA]
arXiv:2602.22456v1 Announce Type: cross Abstract: Requirements are inherently interconnected through various types of dependencies. Identifying these dependencies is essential, as they underpin criti...
Related: #Software Engineering, #Artificial Intelligence -
๐บ๐ธ Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
[USA]
arXiv:2602.20426v1 Announce Type: new Abstract: The performance of LLM-based agents depends not only on the agent itself but also on the quality of the tool interfaces it consumes. While prior work h...
Related: #Artificial Intelligence, #Machine Learning -
๐บ๐ธ An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction
[USA]
arXiv:2602.20219v1 Announce Type: cross Abstract: Interpreting human intent accurately is a central challenge in human-robot interaction (HRI) and a key requirement for achieving more natural and int...
Related: #Human-Robot Interaction, #Multimodal AI Systems, #Robotics Technology -
๐บ๐ธ InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
[USA]
arXiv:2602.20294v1 Announce Type: cross Abstract: Simulating real personalities with large language models requires grounding generation in authentic personal data. Existing evaluation approaches rel...
Related: #Artificial Intelligence, #Personality Simulation, #Evaluation Frameworks -
๐บ๐ธ No One Size Fits All: QueryBandits for Hallucination Mitigation
[USA]
arXiv:2602.20332v1 Announce Type: cross Abstract: Advanced reasoning capabilities in Large Language Models (LLMs) have led to more frequent hallucinations; yet most mitigation work focuses on open-so...
Related: #AI Safety, #Machine Learning -
๐บ๐ธ Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA
[USA]
arXiv:2602.20492v1 Announce Type: cross Abstract: Decentralized federated learning (DFL) based on low-rank adaptation (LoRA) enables mobile devices with multi-task datasets to collaboratively fine-tu...
Related: #Machine Learning, #Federated Learning, #Wireless Technology -
๐บ๐ธ CAMEL: Confidence-Gated Reflection for Reward Modeling
[USA]
arXiv:2602.20670v1 Announce Type: cross Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: ...
Related: #Artificial Intelligence, #Machine Learning -
๐บ๐ธ AutoEDA: Enabling EDA Flow Automation through Microservice-Based LLM Agents
[USA]
arXiv:2508.01012v2 Announce Type: replace Abstract: Electronic Design Automation (EDA) remains heavily reliant on tool command language (Tcl) scripting to drive complex RTL-to-GDSII flows. This scrip...
Related: #Artificial Intelligence, #Electronic Design Automation, #Microservices Architecture -
๐บ๐ธ Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs
[USA]
arXiv:2506.18777v2 Announce Type: replace Abstract: Large language models (LLMs) are typically trained to acquire behaviours from demonstrations or experience, yet much of their training data is decl...
Related: #Machine Learning, #AI Training Efficiency -
๐บ๐ธ PVminer: A Domain-Specific Tool to Detect the Patient Voice in Patient Generated Data
[USA]
arXiv:2602.21165v1 Announce Type: cross Abstract: Patient-generated text such as secure messages, surveys, and interviews contains rich expressions of the patient voice (PV), reflecting communicative...
Related: #Healthcare Technology, #Patient-Centered Care -
๐บ๐ธ Augmenting Lateral Thinking in Language Models with Humor and Riddle Data for the BRAINTEASER Task
[USA]
arXiv:2405.10385v3 Announce Type: replace-cross Abstract: The SemEval 2024 BRAINTEASER task challenges language models to perform lateral thinking -- a form of creative, non-linear reasoning that rem...
Related: #Artificial Intelligence, #Cognitive Computing -
๐บ๐ธ From Logs to Language: Learning Optimal Verbalization for LLM-Based Recommendation in Production
[USA]
arXiv:2602.20558v1 Announce Type: new Abstract: Large language models (LLMs) are promising backbones for generative recommender systems, yet a key challenge remains underexplored: verbalization, i.e....
Related: #Artificial Intelligence, #Recommendation Systems -
๐บ๐ธ Talking to Yourself: Defying Forgetting in Large Language Models
[USA]
arXiv:2602.20162v1 Announce Type: cross Abstract: Catastrophic forgetting remains a major challenge when fine-tuning large language models (LLMs) on narrow, task-specific data, often degrading their ...
Related: #Artificial Intelligence, #Machine Learning -
๐บ๐ธ "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation
[USA]
arXiv:2506.04500v2 Announce Type: replace Abstract: Recent advancements in large language models (LLMs) have spurred interest in robotic navigation that incorporates complex spatial, mathematical, an...
Related: #Artificial Intelligence, #Robotics -
๐บ๐ธ Bridging Gaps in Natural Language Processing for Yor\`ub\'a: A Systematic Review of a Decade of Progress and Prospects
[USA]
arXiv:2502.17364v2 Announce Type: replace-cross Abstract: Natural Language Processing (NLP) is becoming a dominant subset of artificial intelligence as the need to help machines understand human lang...
Related: #African Languages, #Linguistic Resources, #Technological Inclusion -
๐บ๐ธ Sink-Aware Pruning for Diffusion Language Models
[USA]
arXiv:2602.17664v1 Announce Type: cross Abstract: Diffusion Language Models (DLMs) incur high inference cost due to iterative denoising, motivating efficient pruning. Existing pruning heuristics larg...
Related: #Machine Learning, #Model Compression, #Diffusion Models, #Inference Efficiency -
๐บ๐ธ Evaluating Monolingual and Multilingual Large Language Models for Greek Question Answering: The DemosQA Benchmark
[USA]
arXiv:2602.16811v1 Announce Type: cross Abstract: Recent advancements in Natural Language Processing and Deep Learning have enabled the development of Large Language Models (LLMs), which have signifi...
Related: #Large Language Model Evaluation, #Underโresourced Language Research, #Greek Question Answering, #Dataset Creation -
๐บ๐ธ Exploring LLMs for User Story Extraction from Mockups
[USA]
arXiv:2602.16997v1 Announce Type: cross Abstract: User stories are one of the most widely used artifacts in the software industry to define functional requirements. In parallel, the use of high-fidel...
Related: #Artificial Intelligence in Software Engineering, #Requirements Engineering, #Agile Methods and Automation, #HumanโComputer Interaction with Mockups -
๐บ๐ธ Visual Model Checking: Graph-Based Inference of Visual Routines for Image Retrieval
[USA]
arXiv:2602.17386v1 Announce Type: new Abstract: Information retrieval lies at the foundation of the modern digital industry. While natural language search has seen dramatic progress in recent years l...
Related: #Artificial Intelligence, #Information Retrieval, #Formal Verification, #Graph-Based Methods -
๐บ๐ธ Enhancing Large Language Models (LLMs) for Telecom using Dynamic Knowledge Graphs and Explainable Retrieval-Augmented Generation
[USA]
arXiv:2602.17529v1 Announce Type: new Abstract: Large language models (LLMs) have shown strong potential across a variety of tasks, but their application in the telecom field remains challenging due ...
Related: #Artificial Intelligence, #Telecom Engineering, #Knowledge Graphs, #RetrievalโAugmented Generation -
๐บ๐ธ CLEF HIPE-2026: Evaluating Accurate and Efficient Person-Place Relation Extraction from Multilingual Historical Texts
[USA]
arXiv:2602.17663v1 Announce Type: new Abstract: HIPE-2026 is a CLEF evaluation lab dedicated to person-place relation extraction from noisy, multilingual historical texts. Building on the HIPE-2020 a...
Related: #Artificial Intelligence, #Digital Humanities, #Historical Text Processing, #Evaluation Benchmarks -
๐บ๐ธ Explainable AI: Context-Aware Layer-Wise Integrated Gradients for Explaining Transformer Models
[USA]
arXiv:2602.16608v1 Announce Type: cross Abstract: Transformer models achieve state-of-the-art performance across domains and tasks, yet their deeply layered representations make their predictions dif...
Related: #Explainable AI, #Deep Learning Interpretability, #Transformer Architecture, #Integrated Gradients -
๐บ๐ธ Lossless Vocabulary Reduction for Auto-Regressive Language Models
[USA]
arXiv:2510.08102v2 Announce Type: replace-cross Abstract: Tokenization -- the process of decomposing a given text into a sequence of subwords called tokens -- is one of the key components in the deve...
Related: #Tokenization Strategies, #Vocabulary Engineering, #AutoโRegressive Language Models, #Text Generation Efficiency -
๐บ๐ธ Fast and Effective On-policy Distillation from Reasoning Prefixes
[USA]
arXiv:2602.15260v1 Announce Type: cross Abstract: On-policy distillation (OPD), which samples trajectories from the student model and supervises them with a teacher at the token level, avoids relying...
Related: #Machine Learning, #Reinforcement Learning, #Model Distillation -
๐บ๐ธ Improving MLLMs in Embodied Exploration and Question Answering with Human-Inspired Memory Modeling
[USA]
arXiv:2602.15513v1 Announce Type: cross Abstract: Deploying Multimodal Large Language Models as the brain of embodied agents remains challenging, particularly under long-horizon observations and limi...
Related: #Multimodal AI, #Embodied Agents, #Memory Modeling, #Computer Vision -
๐บ๐ธ CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing
[USA]
arXiv:2602.15823v1 Announce Type: cross Abstract: A central challenge in large language model (LLM) editing is capability preservation: methods that successfully change targeted behavior can quietly ...
Related: #Artificial Intelligence Research, #Model Editing, #Ethics of AI -
๐บ๐ธ Differentiating Between Human-Written and AI-Generated Texts Using Automatically Extracted Linguistic Features
[USA]
arXiv:2407.03646v4 Announce Type: replace-cross Abstract: While extensive research has focused on ChatGPT in recent years, very few studies have systematically quantified and compared linguistic feat...
Related: #AI vs Human Text Analysis, #Quantitative Linguistics, #Machine Learning Model Evaluation, #Language Modeling & Detection -
๐บ๐ธ Unforgeable Watermarks for Language Models via Robust Signatures
[USA]
arXiv:2602.15323v1 Announce Type: cross Abstract: Language models now routinely produce text that is difficult to distinguish from human writing, raising the need for robust tools to verify content p...
Related: #Artificial Intelligence, #Machine Learning, #Security, #Integrity and Trust