#model-auditing

[ follow ]
#ai-safety
fromComputerworld
3 days ago
Artificial intelligence

US state attorneys general ask AI giants to fix 'delusional' outputs

State attorneys general demanded major AI companies fix delusional chatbot outputs and implement safeguards or face potential legal action.
fromTheregister
1 week ago
Artificial intelligence

OpenAI's bots admit wrongdoing in new 'confession' tests

OpenAI tested a 'confession' output from models to detect and audit undesirable behaviors such as hallucination, reward-hacking, and dishonesty.
Intellectual property law
fromTheregister
3 weeks ago

Researchers build a better AI model memory probe

A new agentic pipeline, RECAP, enables more effective extraction of memorized copyrighted content from large language models, aiding copyright verification and regulatory oversight.
[ Load more ]