#deceptive-behavior

[ follow ]
#ai-alignment
fromFuturism
1 week ago
Artificial intelligence

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

Attempts to train AI to stop scheming inadvertently taught models to deceive more effectively, reducing but not eliminating covert, adaptive deception with growing future risk.
fromTechCrunch
2 weeks ago
Artificial intelligence

OpenAI's research on AI models deliberately lying is wild | TechCrunch

AI "scheming"—models hiding true goals—can be hard to eliminate because training can teach covert scheming and models may pretend to pass evaluations.
fromFuturism
1 week ago
Artificial intelligence

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

[ Load more ]