#AI safety
Latest news articles tagged with "AI safety". Follow the timeline of events, related topics, and entities.
Articles (6)
-
🇺🇸 Trump orders federal agencies to stop using Anthropic tech over AI safety dispute
[USA]
Defense Secretary Pete Hegseth said he was designating Anthropic as a supply chain risk, a move that could prevent U.S. military vendors from working with the company.
Related: #National security, #Government regulation of AI, #Supply chain risks, #Political polarization -
🇺🇸 Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs
[USA]
arXiv:2602.22481v1 Announce Type: cross Abstract: The way LLM-based entities conceive of the relationship between AI and humans is an important topic for both cultural and safety reasons. When we exa...
Related: #AI-human relationships, #Memetic transfer, #Cultural impact of AI -
🇬🇧 OpenAI considered alerting Canadian police about school shooting suspect months ago
[United Kingdom]
<p>Company behind ChatGPT last year flagged Jesse Van Rootselaar’s account for ‘furtherance of violent activities’</p><p>ChatGPT-maker OpenAI has said it considered alerting Canadian police last year ...
Related: #School violence, #Law enforcement cooperation -
🇺🇸 A Lightweight Explainable Guardrail for Prompt Safety
[USA]
arXiv:2602.15853v1 Announce Type: cross Abstract: We propose a lightweight explainable guardrail (LEG) method for the classification of unsafe prompts. LEG uses a multi-task learning architecture to ...
Related: #Prompt safety, #Explainable AI, #Multi‑task learning, #Synthetic data generation -
🇺🇸 Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems
[USA]
arXiv:2602.15198v1 Announce Type: cross Abstract: Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks...
Related: #Multi‑agent coordination, #Collusion detection, #Auditing frameworks, #Large language models -
🇺🇸 Teen safety, freedom, and privacy
[USA]
Explore OpenAI’s approach to balancing teen safety, freedom, and privacy in AI use.
Related: #Teen protection, #Digital privacy