#LLM Vulnerabilities

Latest news articles tagged with "LLM Vulnerabilities". Follow the timeline of events, related topics, and entities.

Articles (6)

🇺🇸 The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail? — 09/04/2026 [USA]
arXiv:2604.06436v1 Announce Type: cross Abstract: We prove that no continuous, utility-preserving wrapper defense-a function $D: X\to X$ that preprocesses inputs before the model sees them-can make a...
Related: #AI Security, #Theoretical Computer Science
🇺🇸 Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models — 18/03/2026 [USA]
arXiv:2603.16342v1 Announce Type: cross Abstract: The proliferation of large-scale IoT networks has been both a blessing and a curse. Not only has it revolutionized the way organizations operate by i...
Related: #AI Security
🇺🇸 Targeted Bit-Flip Attacks on LLM-Based Agents — 12/03/2026 [USA]
arXiv:2603.10042v1 Announce Type: cross Abstract: Targeted bit-flip attacks (BFAs) exploit hardware faults to manipulate model parameters, posing a significant security threat. While prior work targe...
Related: #AI Security
🇺🇸 Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference — 12/03/2026 [USA]
arXiv:2603.10091v1 Announce Type: cross Abstract: The widespread adoption of thinking mode in large language models (LLMs) has significantly enhanced complex task processing capabilities while introd...
Related: #AI Security
🇺🇸 The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs — 10/03/2026 [USA]
arXiv:2603.08234v1 Announce Type: new Abstract: With the rapid advancement of large language models (LLMs), the safety of LLMs has become a critical concern. Despite significant efforts in safety ali...
Related: #AI Safety
🇺🇸 Depth Charge: Jailbreak Large Language Models from Deep Safety Attention Heads — 09/03/2026 [USA]
arXiv:2603.05772v1 Announce Type: cross Abstract: Currently, open-sourced large language models (OSLLMs) have demonstrated remarkable generative performance. However, as their structure and weights a...
Related: #AI Safety

Key Entities (2)

Depth charge (1 news)
Large language model (1 news)

About the topic: LLM Vulnerabilities

The topic "LLM Vulnerabilities" aggregates 6+ news articles from various countries.