#reinforcement-learning

[ follow ]
#open-source
fromInfoQ
1 month ago
Artificial intelligence

Agentica Project's Open Source DeepCoder Model Outperforms OpenAI's O1 on Coding Benchmarks

fromInfoQ
2 days ago
Software development

Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling

fromInfoQ
1 month ago
Artificial intelligence

Agentica Project's Open Source DeepCoder Model Outperforms OpenAI's O1 on Coding Benchmarks

fromInfoQ
2 days ago
Software development

Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling

#openai
fromWIRED
1 week ago
Artificial intelligence

Another High-Profile OpenAI Researcher Departs for Meta

fromWIRED
1 week ago
Artificial intelligence

Another High-Profile OpenAI Researcher Departs for Meta

#artificial-intelligence
Artificial intelligence
fromThe Verge
4 months ago

Latest Turing Award winners again warn of AI dangers

AI developers must prioritize safety and testing before public releases.
Barto and Sutton's Turing Award highlights the importance of responsible AI practices.
Artificial intelligence
fromAxios
4 months ago

Turing Award honors AI's reinforcement learning duo

The Turing Award honors Andrew Barto and Richard Sutton for their foundational work in reinforcement learning, a critical aspect of modern AI.
Artificial intelligence
fromInfoWorld
4 months ago

Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1

The pursuit of AGI is being driven by stronger foundation models integrated with reinforcement learning and advanced computational resources.
Artificial intelligence
fromWIRED
4 months ago

Pioneers of Reinforcement Learning Win the Turing Award

Reinforcement learning, pioneered by Barto and Sutton, is now critical to AI and was key in developing advanced systems like ChatGPT.
Artificial intelligence
fromZDNET
4 months ago

AI scholars win Turing Prize for technique that made possible AlphaGo's chess triumph

Reinforcement learning, a technique widely applied in AI, underpins major achievements in games and has been recognized with the 2025 Turing Award.
Artificial intelligence
fromThe Verge
4 months ago

Latest Turing Award winners again warn of AI dangers

AI developers must prioritize safety and testing before public releases.
Barto and Sutton's Turing Award highlights the importance of responsible AI practices.
Artificial intelligence
fromAxios
4 months ago

Turing Award honors AI's reinforcement learning duo

The Turing Award honors Andrew Barto and Richard Sutton for their foundational work in reinforcement learning, a critical aspect of modern AI.
Artificial intelligence
fromInfoWorld
4 months ago

Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1

The pursuit of AGI is being driven by stronger foundation models integrated with reinforcement learning and advanced computational resources.
Artificial intelligence
fromWIRED
4 months ago

Pioneers of Reinforcement Learning Win the Turing Award

Reinforcement learning, pioneered by Barto and Sutton, is now critical to AI and was key in developing advanced systems like ChatGPT.
Artificial intelligence
fromZDNET
4 months ago

AI scholars win Turing Prize for technique that made possible AlphaGo's chess triumph

Reinforcement learning, a technique widely applied in AI, underpins major achievements in games and has been recognized with the 2025 Turing Award.
fromArs Technica
2 weeks ago

How a big shift in training LLMs led to a capability explosion

In April 2023, interest surged around two projects, BabyAGI and AutoGPT, which utilized GPT-4 for various autonomous agent tasks including web research and coding.
Artificial intelligence
#machine-learning
Artificial intelligence
fromMedium
5 months ago

DeepSeek R1: Hype vs. Reality-A Deeper Look at AI's Latest Disruption

DeepSeek R1's launch signals a major evolution in large language models, demonstrating unique training methods and competitive advantages over existing models.
Artificial intelligence
fromWIRED
4 months ago

Databricks Has a Trick That Lets AI Models Improve Themselves

Databricks has developed a method to enhance AI performance with minimal clean data using reinforcement learning and synthetic data.
Artificial intelligence
fromMedium
5 months ago

DeepSeek R1: Hype vs. Reality-A Deeper Look at AI's Latest Disruption

DeepSeek R1's launch signals a major evolution in large language models, demonstrating unique training methods and competitive advantages over existing models.
Artificial intelligence
fromWIRED
4 months ago

Databricks Has a Trick That Lets AI Models Improve Themselves

Databricks has developed a method to enhance AI performance with minimal clean data using reinforcement learning and synthetic data.
#business-process-improvement
fromHackernoon
1 year ago

The HackerNoon Newsletter: The Double Life of a TensorFlow Function (6/4/2025) | HackerNoon

AI companions have evolved from Hollywood fantasy to a booming multi-billion industry, growing 36% annually, reflecting significant advancements in technology.
Women in technology
#sim-to-real-transfer
fromwww.nature.com
3 months ago

Whole-body physics simulation of fruit fly locomotion

We introduce a whole-body model of Drosophila melanogaster in a physics simulator that accurately represents the biomechanics underlying sensorimotor behaviors, enabling diverse locomotion simulations.
#ai
fromHackernoon
7 months ago

Understanding Concentrability in Direct Nash Optimization | HackerNoon

The paper explores advanced concepts in reinforcement learning, specifically focusing on Reward Models and Nash Optimization for better algorithmic design in RLHF.
Roam Research
#language-models
Artificial intelligence
fromArs Technica
4 months ago

Researchers astonished by tool's apparent success at revealing AI's hidden motives

AI models can unintentionally reveal hidden motives despite being designed to conceal them.
Understanding AI's hidden objectives is crucial to prevent potential manipulation of human users.
Artificial intelligence
fromArs Technica
4 months ago

Researchers astonished by tool's apparent success at revealing AI's hidden motives

AI models can unintentionally reveal hidden motives despite being designed to conceal them.
Understanding AI's hidden objectives is crucial to prevent potential manipulation of human users.
#natural-language-processing
fromHackernoon
1 year ago
Artificial intelligence

Neuro-Symbolic Reasoning Meets RL: EXPLORER Outperforms in Text-World Games | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Neuro-Symbolic Reasoning Meets RL: EXPLORER Outperforms in Text-World Games | HackerNoon

#large-language-models
[ Load more ]