#deepseek-r1 tag

Researchers Hack DeepSeek to Speak Freely About Tiananmen Square

Researchers compressed DeepSeek R1 by 55% and removed its censorship using quantum-inspired tensor-network compression while maintaining performance and reducing parameters.

Artificial intelligence

fromWIRED

7 months ago

Distillation Can Make AI Models Smaller and Cheaper

Knowledge distillation enables smaller models to mimic larger ones efficiently and can explain DeepSeek R1's claims and the resulting industry reaction.

Artificial intelligence

fromIT Pro

7 months ago

DeepSeek's R1 model training costs pour cold water on big tech's massive AI spending

DeepSeek trained its R1 reasoning model for about $294,000 using 512 Nvidia H800 chips, plus ~$6M for its base LLM.

fromBusiness Insider

11 months ago

DeepSeek's R1 was 'genuinely a gift to the world's AI industry,' says Jensen Huang

The amount of computer science breakthroughs is really quite significant and has really opened up a lot of great research for researchers in the United States and around the world.

Artificial intelligence

#artificial-intelligence

Artificial intelligence

fromMedium

1 year ago

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1 enhances AI reasoning and adaptability using Reinforcement Learning and long chains of thought.

Artificial intelligence

fromMedium

1 year ago

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1 model uses Reinforcement Learning for advanced reasoning and problem-solving, moving beyond traditional supervised learning methods.

fromMedium

1 year ago

Artificial intelligence

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

Artificial intelligence

fromMedium

1 year ago

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1 enhances AI reasoning and adaptability using Reinforcement Learning and long chains of thought.

Artificial intelligence

fromMedium

1 year ago

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1 model uses Reinforcement Learning for advanced reasoning and problem-solving, moving beyond traditional supervised learning methods.

Artificial intelligence

fromMedium

1 year ago

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1 enhances AI reasoning and problem-solving using Reinforcement Learning, surpassing limitations of traditional supervised learning methods.

more#artificial-intelligence

#deepseek-r1#deepseek-r1

Researchers Hack DeepSeek to Speak Freely About Tiananmen Square

Distillation Can Make AI Models Smaller and Cheaper

DeepSeek's R1 model training costs pour cold water on big tech's massive AI spending

DeepSeek's R1 was 'genuinely a gift to the world's AI industry,' says Jensen Huang

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

DeepSeek R1: Unlocking Advanced AI Through Reinforcement Learning and Emergent Self-Reflection

#deepseek-r1
#deepseek-r1