#LLM Vulnerabilities
Latest news articles tagged with "LLM Vulnerabilities". Follow the timeline of events, related topics, and entities.
Articles (6)
-
πΊπΈ The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
[USA]
arXiv:2604.06436v1 Announce Type: cross Abstract: We prove that no continuous, utility-preserving wrapper defense-a function $D: X\to X$ that preprocesses inputs before the model sees them-can make a...
Related: #AI Security, #Theoretical Computer Science -
πΊπΈ Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models
[USA]
arXiv:2603.16342v1 Announce Type: cross Abstract: The proliferation of large-scale IoT networks has been both a blessing and a curse. Not only has it revolutionized the way organizations operate by i...
Related: #AI Security -
πΊπΈ Targeted Bit-Flip Attacks on LLM-Based Agents
[USA]
arXiv:2603.10042v1 Announce Type: cross Abstract: Targeted bit-flip attacks (BFAs) exploit hardware faults to manipulate model parameters, posing a significant security threat. While prior work targe...
Related: #AI Security -
πΊπΈ Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference
[USA]
arXiv:2603.10091v1 Announce Type: cross Abstract: The widespread adoption of thinking mode in large language models (LLMs) has significantly enhanced complex task processing capabilities while introd...
Related: #AI Security -
πΊπΈ The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs
[USA]
arXiv:2603.08234v1 Announce Type: new Abstract: With the rapid advancement of large language models (LLMs), the safety of LLMs has become a critical concern. Despite significant efforts in safety ali...
Related: #AI Safety -
πΊπΈ Depth Charge: Jailbreak Large Language Models from Deep Safety Attention Heads
[USA]
arXiv:2603.05772v1 Announce Type: cross Abstract: Currently, open-sourced large language models (OSLLMs) have demonstrated remarkable generative performance. However, as their structure and weights a...
Related: #AI Safety
Key Entities (2)
- Depth charge (1 news)
- Large language model (1 news)
About the topic: LLM Vulnerabilities
The topic "LLM Vulnerabilities" aggregates 6+ news articles from various countries.