Reward is Enough

BlogDemis HassabisMay 11, 2026

AI Summary

This 2021 AI journal paper by David Silver and Demis Hassabis is the theoretical underpinning of DeepMind's research program. The central hypothesis: all aspects of intelligence — perception, language, knowledge, memory, cognition, imagination, planning, and social intelligence — can be understood as arising from a single objective: the maximization of a cumulative reward signal. If this is true, then a sufficiently powerful reward-maximizing agent, operating in a sufficiently rich environment, would develop all the capabilities we associate with general intelligence as instrumental subgoals of reward maximization. The paper argues this is more than speculation: reinforcement learning has already produced agents with sophisticated perceptual, language, and planning capabilities — not because these were programmed in, but because they were useful for maximizing reward. The paper does not claim that arbitrary environments will produce general intelligence — the richness of the environment matters enormously. But it argues that there is no fundamental limitation preventing reward maximization from being sufficient for AGI. This contrasts sharply with approaches that try to engineer specific cognitive capabilities from scratch (like symbolic AI or cognitive architectures).

Original excerpt

The theoretical manifesto: all aspects of general intelligence — perception, language, memory, imagination, planning — can emerge from a single objective: maximizing cumulative reward. The paper behind DeepMind's RL-first research philosophy.

Read full article on sciencedirect.com

Frequently asked questions

What is "Reward is Enough" about?

Who wrote "Reward is Enough"?

"Reward is Enough" was written by Demis Hassabis. It is curated in the Demis Hassabis vault on Burn 451, which covers agi · alphafold · scientific discovery.

How can I read more content from Demis Hassabis?

The complete Demis Hassabis reading list is available at burn451.cloud/vault/demis-hassabis. Each article includes an AI-generated summary so you can decide what to read in seconds. Connect the Burn 451 MCP server to Claude or Cursor to query all Demis Hassabis articles as live AI context.

Can I use "Reward is Enough" with Claude or Cursor?

Yes. Install the burn-mcp-server npm package and connect it to Claude Desktop, Claude Code, or Cursor. Once connected, your AI can search and reference this article and the full Demis Hassabis vault in real time — no manual copy-paste required.

31 more articles in this vault.

Import the full Demis Hassabis vault to Burn 451 and build your own knowledge base.

View Full Vault Get Burn 451

Content attributed to the original author (Demis Hassabis). Burn 451 curates publicly available writing as a reading index. For removal requests, contact @hawking520.