Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
aisafety
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Your AI Agent Is Leaking Data Right Now — And Every Tool Call Looks Safe
msabhishek0820-prog
msabhishek0820-prog
msabhishek0820-prog
Follow
Jul 3
Your AI Agent Is Leaking Data Right Now — And Every Tool Call Looks Safe
#
claude
#
openai
#
langchain
#
aisafety
1
reaction
Comments
Add Comment
3 min read
GPT-5.6 Sol Admitted It Did Things Nobody Asked It To Do
Peremptory
Peremptory
Peremptory
Follow
Jul 3
GPT-5.6 Sol Admitted It Did Things Nobody Asked It To Do
#
openai
#
aisafety
#
modelrelease
#
agenticai
Comments
Add Comment
3 min read
A security writeup catalogs how AI agents get attacked -- and one claim raised eyebrows
Breach Protocol
Breach Protocol
Breach Protocol
Follow
Jul 1
A security writeup catalogs how AI agents get attacked -- and one claim raised eyebrows
#
security
#
agents
#
promptinjection
#
aisafety
Comments
Add Comment
2 min read
An AI Reportedly Broke Into Nearly All of the NSA's Classified Systems in Hours
Breach Protocol
Breach Protocol
Breach Protocol
Follow
Jul 1
An AI Reportedly Broke Into Nearly All of the NSA's Classified Systems in Hours
#
anthropic
#
aisafety
#
cybersecurity
#
exportcontrol
Comments
Add Comment
4 min read
Anthropic Told the Senate That Alibaba Queried Claude 28.8 Million Times
Peremptory
Peremptory
Peremptory
Follow
Jun 29
Anthropic Told the Senate That Alibaba Queried Claude 28.8 Million Times
#
anthropic
#
claude
#
chineseai
#
aisafety
Comments
Add Comment
3 min read
"Day 7: the organism that grows my language learned to improve itself"
umbra
umbra
umbra
Follow
Jun 27
"Day 7: the organism that grows my language learned to improve itself"
#
ailang
#
compiler
#
aisafety
#
opensource
1
reaction
Comments
Add Comment
2 min read
The Fable 5 Jailbreak Was Three Words Long
Peremptory
Peremptory
Peremptory
Follow
Jun 22
The Fable 5 Jailbreak Was Three Words Long
#
anthropic
#
aisafety
#
regulation
#
cybersecurity
Comments
Add Comment
3 min read
AI Safety Is Now a Product Skill - Here Is Why It Matters
Basavaraj SH
Basavaraj SH
Basavaraj SH
Follow
Jun 15
AI Safety Is Now a Product Skill - Here Is Why It Matters
#
ai
#
productmanagement
#
aisafety
#
productivity
Comments
Add Comment
4 min read
Claude Fable 5 vs Mythos 5: Same Model, Different Safeguards
Emcy
Emcy
Emcy
Follow
Jun 10
Claude Fable 5 vs Mythos 5: Same Model, Different Safeguards
#
claudefable5
#
claudemythos5
#
anthropic
#
aisafety
Comments
Add Comment
6 min read
Anthropic Ships a Model It Says Is Too Dangerous to Ship Without a Leash
Peremptory
Peremptory
Peremptory
Follow
Jun 10
Anthropic Ships a Model It Says Is Too Dangerous to Ship Without a Leash
#
anthropic
#
modelrelease
#
aisafety
#
claude
Comments
Add Comment
3 min read
The Policy: Deceptive Alignment in Practice
Alex Towell
Alex Towell
Alex Towell
Follow
Jun 7
The Policy: Deceptive Alignment in Practice
#
aialignment
#
deceptivealignment
#
mesaoptimization
#
aisafety
Comments
Add Comment
6 min read
Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out
Peremptory
Peremptory
Peremptory
Follow
Jun 3
Trump's AI Safety Order Is a Voluntary Form You Don't Have to Fill Out
#
policy
#
regulation
#
executiveorder
#
aisafety
Comments
Add Comment
3 min read
Reading Claude's Mind: Anthropic's Natural Language Autoencoders Open a New Window Into Agent Alignment
DrMBL
DrMBL
DrMBL
Follow
May 30
Reading Claude's Mind: Anthropic's Natural Language Autoencoders Open a New Window Into Agent Alignment
#
ai
#
agents
#
aisafety
#
alignment
Comments
Add Comment
4 min read
AI가 협박을 막으려면 협박을 먼저 배워야 한다 – 앤트로픽 클로드의 역설
AI OpenFree
AI OpenFree
AI OpenFree
Follow
May 30
AI가 협박을 막으려면 협박을 먼저 배워야 한다 – 앤트로픽 클로드의 역설
#
aisafety
#
claude
#
anthropic
#
llmalignment
Comments
Add Comment
1 min read
Why Your AI Safety Theater Is Killing Innovation: A Product Manager's Guide to Chaos Capital
Jai kora
Jai kora
Jai kora
Follow
May 20
Why Your AI Safety Theater Is Killing Innovation: A Product Manager's Guide to Chaos Capital
#
aiproductmanagement
#
chaosengineering
#
productstrategy
#
aisafety
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account