Faster sorting algorithms discovered using deep reinforcement learning (AlphaDev)

BlogDemis HassabisMay 11, 2026

AI Summary

AlphaDev (Nature, June 2023) applies AlphaZero-style reinforcement learning to discover novel computer algorithms — specifically, assembly-level sorting routines that are faster than the hand-optimized functions that have been in widespread use for decades. The system frames algorithm design as a game: each move adds one CPU instruction, and the reward is a combination of correctness and runtime latency on real hardware. AlphaDev discovered sorting algorithms for 3, 5, 6, 7, and 8 element sequences that were 70% faster than the best known implementations — algorithms that are now deployed in LLVM's standard library and used in virtually every C++ program worldwide. The significance for AI research: this is the first time AI discovered a genuinely new algorithm (not just optimized an existing one) that was deemed good enough to replace hand-written expert code at scale. It also demonstrates that deep RL can operate in domains with combinatorially large discrete action spaces — a key capability requirement for future AI systems that must reason about symbolic structures rather than continuous spaces.

Original excerpt

AlphaDev discovered sorting algorithms 70% faster than expert hand-written code — now in LLVM's standard library, running in virtually every C++ program. AI discovering genuinely new algorithms, not just optimizing existing ones.

Frequently asked questions

What is "Faster sorting algorithms discovered using deep reinforcement learning (AlphaDev)" about?

AlphaDev (Nature, June 2023) applies AlphaZero-style reinforcement learning to discover novel computer algorithms — specifically, assembly-level sorting routines that are faster than the hand-optimized functions that have been in widespread use for decades. The system frames algorithm design as a ga…

Who wrote "Faster sorting algorithms discovered using deep reinforcement learning (AlphaDev)"?

"Faster sorting algorithms discovered using deep reinforcement learning (AlphaDev)" was written by Demis Hassabis. It is curated in the Demis Hassabis vault on Burn 451, which covers agi · alphafold · scientific discovery.

How can I read more content from Demis Hassabis?

The complete Demis Hassabis reading list is available at burn451.cloud/vault/demis-hassabis. Each article includes an AI-generated summary so you can decide what to read in seconds. Connect the Burn 451 MCP server to Claude or Cursor to query all Demis Hassabis articles as live AI context.

Can I use "Faster sorting algorithms discovered using deep reinforcement learning (AlphaDev)" with Claude or Cursor?

Yes. Install the burn-mcp-server npm package and connect it to Claude Desktop, Claude Code, or Cursor. Once connected, your AI can search and reference this article and the full Demis Hassabis vault in real time — no manual copy-paste required.

31 more articles in this vault.

Import the full Demis Hassabis vault to Burn 451 and build your own knowledge base.

Content attributed to the original author (Demis Hassabis). Burn 451 curates publicly available writing as a reading index. For removal requests, contact @hawking520.