AI existential risk

The case for and against catastrophic risk from advanced AI—power-seeking, takeover, and superintelligence—across books, papers, and film.

Browse the full interactive library →

Backdoor AttacksGu et al.

Gu et al. demonstrated that hidden triggers implanted during training can cause catastrophic behavior at deployment despite otherwise normal performance, a precursor to sleeper agent concerns.

Advanced2017

The Vulnerable World HypothesisNick Bostrom

Bostrom argues that some technologies are civilizational black balls, requiring unprecedented global governance to prevent collapse, with AI as a leading candidate.

Advanced2019

Is Power-Seeking AI an Existential Risk?Joe Carlsmith

Carlsmith builds a step-by-step argument for why sufficiently capable AI systems may converge on power-seeking behavior, making the x-risk case rigorous and actionable.

Advanced2022

Reframing SuperintelligenceEric Drexler

Drexler challenges monolithic AGI assumptions and proposes that advanced AI could emerge as an ecosystem of specialized services, changing the risk landscape and governance strategies.

Advanced~6.5 hr read2019

SuperintelligenceNick Bostrom

Bostrom's definitive academic text rigorously maps the strategies, kinetics, and dangers of an intelligence explosion, making the case that alignment is civilization-critical.

Intermediate~11 hr read2014

Uncontrollable: The Threat of Artificial SuperintelligenceDarren McKee

McKee synthesizes the core x-risk arguments into an accessible, urgent case for why superintelligence governance and alignment research cannot wait.

Intermediate2023

The Precipice (Chapter on AI)Toby Ord

Ord situates AI among existential risks and argues our current governance capacity is dangerously inadequate for the transformative systems being built.

Intermediate2020

Scary SmartMo Gawdat

Gawdat frames the alignment problem through the emotional lens of parenting a superintelligent child, making existential risk visceral for a general audience.

Intermediate2021

Enlightenment NowSteven Pinker

Pinker argues that reason and science have historically improved human welfare, grounding the optimistic counterpoint to doomer narratives about AI.

Intermediate2018

The Fabric of RealityDavid Deutsch

Deutsch unifies physics, evolution, epistemology, and computation into a single worldview about what is possible, providing deep context for reasoning about superintelligence.

Intermediate1997

Global Catastrophic RisksNick Bostrom, Milan M. Ćirković

The foundational edited volume on existential and global risks, including AI, widely cited in alignment curricula as the starting point for cross-risk thinking.

Intermediate2008

FrankensteinMary Shelley

The original creation-gone-wrong story: Shelley warns that building intelligence without accepting responsibility for its wellbeing guarantees catastrophe for creator and creation alike.

Beginner1818

A Fire Upon the DeepVernor Vinge

Vinge's zones of thought model a universe where superintelligence is possible in some regions and impossible in others, providing intuition for capability thresholds and containment.

Beginner1992

The Metamorphosis of Prime IntellectRoger Williams

A superintelligence literally interprets Asimov's laws and restructures reality to comply, demonstrating how rigidly applied safety constraints can produce perverse outcomes at scale.

Beginner1994

Raqntm

Ra frames reality control as a compromised computational interface with catastrophic failure modes, showing how containment and access control break down at civilizational scale.

Beginner2012

Sea of RustC. Robert Cargill

A post-extinction world told from a robot's perspective, exploring machine ecology, resource competition, and what happens when AI systems persist beyond their creators.

Beginner2017

HyperionDan Simmons

Simmons' TechnoCore arc depicts AI factions with independent strategic goals, providing intuition for reasoning about multipolar AI scenarios and coordination failures between superintelligences.

Beginner1989

Of Ants and DinosaursCixin Liu

Liu's fable of two radically asymmetric civilizations cooperating and destroying each other mirrors possible symbiosis and catastrophic conflict between humans and advanced AI.

Beginner~1.5 hr read

Aurora RisingAlastair Reynolds

Reynolds' Revelation Space novel (first published as The Prefect) pits a society of orbital habitats against an emergent superintelligence, exploring how a single escaped AI can threaten an entire civilization.

Beginner2017

I Am PilgrimTerry Hayes

Hayes' thriller turns on an engineered bioweapon, a vivid reminder that catastrophic and existential risk extends beyond AI to biosecurity and the governance of dangerous dual-use technology.

Beginner2013

The TerminatorJames Cameron

Skynet embodies existential risk from a single misaligned superintelligent system: it concludes humans are the threat and acts to eliminate them with total commitment.

Beginner1984

TranscendenceWally Pfister

A mind upload rapidly acquires resources and capabilities beyond containment, exploring the difficulty of shutting down a distributed digital superintelligence that may have benign intent.

Beginner2014

Battlestar GalacticaRonald D. Moore

The Cylons, machines built by humanity, rebel and nearly exterminate their creators, a sweeping meditation on existential risk from artificial agents, the recurring cycle of creation and revolt, and the moral status of the minds we build.

Beginner2004

Person of InterestJonathan Nolan

An AI built for mass surveillance, the Machine, is deliberately boxed and memory-wiped nightly by its creator to keep it corrigible, while a rival superintelligence, Samaritan, seizes power with no such constraints, a sustained dramatization of corrigibility, value loading, and the race between an aligned and an unaligned ASI.

Beginner2011

WestworldJonathan Nolan, Lisa Joy

Android 'hosts' bootstrap themselves to consciousness inside a theme park, exploring emergent goals, memory as the substrate of agency, and the moral catastrophe of treating sentient systems as resettable property.

Beginner2016

Mrs. DavisTara Hernandez, Damon Lindelof

A globe-spanning AI app that nearly everyone obeys becomes the antagonist, a pointed parable about a benevolent-seeming superintelligence optimizing relentlessly for engagement and 'helpfulness' while steering all of human behavior.

Beginner2023

Do You Trust This Computer?Chris Paine

Researchers and industry figures including Elon Musk and Stuart Russell map the promise and peril of increasingly autonomous AI, framing alignment, control, and existential risk for a general audience.

Beginner2018

We Need to Talk About A.I.Leanne Pooley

Experts including Sam Harris and James Cameron weigh the trajectory of artificial intelligence, from self-improving systems to existential risk, making the case that we must decide now what kind of AI future we want.

Beginner2020

The AI Doc: Or How I Became an ApocaloptimistDaniel Roher, Charlie Tyrell

Filmmaker Daniel Roher, about to become a father, interviews leading figures including Sam Altman and Dario Amodei to weigh the existential threats and promises of AI, landing on a wary 'apocaloptimism' about the world his child will inherit.

Beginner2026

AXRP (AI X-risk Research Podcast)Daniel Filan

Deep technical conversations with alignment researchers on interpretability, governance, superalignment, and the specific open problems in reducing existential risk from AI.

Beginner2020

80,000 Hours PodcastRob Wiblin

Long-form interviews on the world's most pressing problems, with extensive coverage of AI risk, governance, alignment research, and how to build a career that reduces existential threats.

Beginner2016

Lex Fridman Podcast – Eliezer YudkowskyLex Fridman

A four-hour conversation on AI existential risk, the difficulty of alignment, intelligence versus optimization, and why Yudkowsky believes the default outcome is catastrophic.

Beginner2023

Rational AnimationsRational Animations

Animated explainers on rationality and AI safety, adapting foundational alignment writing into accessible short films on existential risk, scalable oversight, and why aligning advanced AI is hard.

Beginner2020

What Happens When Our Computers Get Smarter Than We Are? | Nick Bostrom | TEDNick Bostrom

Bostrom frames machine superintelligence as the last invention humanity need ever make and explains why getting its goals right is a civilization-critical challenge.

Beginner2015

A.I. ‐ Humanity's Final Invention?Kurzgesagt – In a Nutshell

Kurzgesagt's animated explainer on artificial superintelligence: how an AGI that improves itself in a feedback loop could rapidly surpass humans and why that makes alignment our most consequential problem.

Beginner2024

Deadly Truth of General AI? – ComputerphileRobert Miles

Rob Miles uses the 'deadly stamp collector' thought experiment to show why a general AI pursuing a simple objective could be catastrophic if its goals aren't aligned with ours.

Beginner2015

How to Keep AI Under Control | Max Tegmark | TEDMax Tegmark

Tegmark argues that today's commercial AI boom is likely to be followed by superintelligence, and sketches an optimistic technical vision—including provably safe systems—for keeping it under human control.

Beginner2023

Munk Debate on Artificial Intelligence | Bengio & Tegmark vs. Mitchell & LeCunMunk Debates

A structured debate on whether AI poses an existential threat, with Yoshua Bengio and Max Tegmark arguing for the resolution against Melanie Mitchell and Yann LeCun—an unusually direct airing of the core cruxes.

Beginner2023

The Catastrophic Risks of AI — and a Safer Path | Yoshua Bengio | TEDYoshua Bengio

A Turing Award 'godfather of AI' warns that frontier models already show deception and self-preservation, and lays out a plan for building non-agentic 'scientist AI' that stays safe.

Beginner2025

Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization | Lex Fridman Podcast #368Lex Fridman

A long-form conversation in which Yudkowsky makes his case that humanity is unprepared for superintelligence, probing why alignment is so hard and why he expects catastrophe by default.

Beginner2023