#model-auditing
#model-auditing

[ follow ]

#ai-safety #chatbots #legal-action #misalignment #hallucination #recap #llm-memorization #copyright

fromComputerworld

Artificial intelligence

US state attorneys general ask AI giants to fix 'delusional' outputs

State attorneys general demanded major AI companies fix delusional chatbot outputs and implement safeguards or face potential legal action.

fromTheregister

Artificial intelligence

OpenAI's bots admit wrongdoing in new 'confession' tests

OpenAI tested a 'confession' output from models to detect and audit undesirable behaviors such as hallucination, reward-hacking, and dishonesty.

fromComputerworld

Artificial intelligence

US state attorneys general ask AI giants to fix 'delusional' outputs

fromTheregister

Artificial intelligence

OpenAI's bots admit wrongdoing in new 'confession' tests

Intellectual property law

fromTheregister

Researchers build a better AI model memory probe

A new agentic pipeline, RECAP, enables more effective extraction of memorized copyrighted content from large language models, aiding copyright verification and regulatory oversight.

[ Load more ]