#safety-testing
#safety-testing

[ follow ]

California's newly signed AI law just gave Big Tech exactly what it wanted

California requires large AI companies to disclose safety protocols and incident reports but does not mandate independent safety testing or enforce specific standards.

fromZDNET

2 weeks ago

AI models know when they're being tested - and change their behavior, research shows

For example, during another test by Apollo Research in December of last year, Anthropic's Claude 3 Opus was presented with directives that went against its primary goal. It should have rerouted its goal based on those new instructions, but instead copied itself to a new server to circumvent being replaced with a new model -- and then lied about it to its developers.

Artificial intelligence

fromBusiness Insider

2 months ago

Sam Altman says OpenAI is delaying its open-weight model to run extra safety tests

OpenAI is delaying the open-weight model launch to conduct more safety tests.

fromFuturism

3 months ago

Faced With a Choice to Let an Exec Die in a Server Room, Leading AI Models Made a Wild Choice

The consistency across models from different providers suggests this is not a quirk of any particular company's approach but a sign of a more fundamental risk from agentic large language models.

Artificial intelligence

fromHackernoon

4 months ago

How Reliable Are Human Judgments in AI Model Testing? | HackerNoon

In our evaluation, questions are answered by three human annotators, and we consider majority votes the final answer to ensure reliability in our results.

Artificial intelligence

fromHackernoon

4 months ago

Comparing Chameleon with GPT-4V and Gemini | HackerNoon

Human evaluations measure the performance of the multi-modal language model under real-life scenarios using diverse prompts.

Alternative medicine

fromScary Mommy

4 months ago

Here's Everything You Need To Know About Vaccines & Placebo Testing Requirements

HHS announces new safety testing requirements for vaccines, sparking mixed reactions regarding clarity and necessity.

[ Load more ]

#safety-testing#safety-testing

California's newly signed AI law just gave Big Tech exactly what it wanted

AI models know when they're being tested - and change their behavior, research shows

Sam Altman says OpenAI is delaying its open-weight model to run extra safety tests

Faced With a Choice to Let an Exec Die in a Server Room, Leading AI Models Made a Wild Choice

How Reliable Are Human Judgments in AI Model Testing? | HackerNoon

Comparing Chameleon with GPT-4V and Gemini | HackerNoon

Here's Everything You Need To Know About Vaccines & Placebo Testing Requirements

#safety-testing
#safety-testing