AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space

BlogDemis HassabisMay 11, 2026

AI Summary

This 2022 Nucleic Acids Research paper describes the open AlphaFold Protein Structure Database — the delivery mechanism for AlphaFold2's scientific impact. The database launched with 350,000 structures covering the entire human proteome and 20 model organisms. In the expanded version described in this paper, it covers over 200 million protein structures representing virtually all catalogued proteins in UniProt. The paper documents the computational pipeline for batch prediction at scale, the quality filtering applied to distinguish high-confidence from low-confidence predictions (pLDDT scores), and the public API enabling any researcher to query structures programmatically. The database essentially gives every biology lab in the world access to structural data that would have required years of X-ray crystallography or cryo-EM experiments. Hassabis frames this as the realization of the 'AlphaFold for all' promise — making the tool's benefits universally accessible rather than only to well-funded labs. Within one year of launch, the database had been accessed by over 500,000 researchers worldwide.

Original excerpt

The open-access delivery of AlphaFold2's results — 200 million protein structures, freely available. The infrastructure paper behind the Nobel Prize impact.

Frequently asked questions

What is "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space" about?

This 2022 Nucleic Acids Research paper describes the open AlphaFold Protein Structure Database — the delivery mechanism for AlphaFold2's scientific impact. The database launched with 350,000 structures covering the entire human proteome and 20 model organisms. In the expanded version described in th…

Who wrote "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space"?

"AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space" was written by Demis Hassabis. It is curated in the Demis Hassabis vault on Burn 451, which covers agi · alphafold · scientific discovery.

How can I read more content from Demis Hassabis?

The complete Demis Hassabis reading list is available at burn451.cloud/vault/demis-hassabis. Each article includes an AI-generated summary so you can decide what to read in seconds. Connect the Burn 451 MCP server to Claude or Cursor to query all Demis Hassabis articles as live AI context.

Can I use "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space" with Claude or Cursor?

Yes. Install the burn-mcp-server npm package and connect it to Claude Desktop, Claude Code, or Cursor. Once connected, your AI can search and reference this article and the full Demis Hassabis vault in real time — no manual copy-paste required.

31 more articles in this vault.

Import the full Demis Hassabis vault to Burn 451 and build your own knowledge base.

Content attributed to the original author (Demis Hassabis). Burn 451 curates publicly available writing as a reading index. For removal requests, contact @hawking520.