AI existential risk

The case for and against catastrophic risk from advanced AI—power-seeking, takeover, and superintelligence—across books, papers, and film.

Browse the full interactive library →

Backdoor AttacksGu et al.

Gu et al. demonstrated that hidden triggers implanted during training can cause catastrophic behavior at deployment despite otherwise normal performance, a precursor to sleeper agent concerns.

Advanced2017
The Vulnerable World HypothesisNick Bostrom

Bostrom argues that some technologies are civilizational black balls, requiring unprecedented global governance to prevent collapse, with AI as a leading candidate.

Advanced2019
Is Power-Seeking AI an Existential Risk?Joe Carlsmith

Carlsmith builds a step-by-step argument for why sufficiently capable AI systems may converge on power-seeking behavior, making the x-risk case rigorous and actionable.

Advanced2022
Reframing SuperintelligenceEric Drexler

Drexler challenges monolithic AGI assumptions and proposes that advanced AI could emerge as an ecosystem of specialized services, changing the risk landscape and governance strategies.

Advanced~6.5 hr read2019
SuperintelligenceNick Bostrom

Bostrom's definitive academic text rigorously maps the strategies, kinetics, and dangers of an intelligence explosion, making the case that alignment is civilization-critical.

Intermediate~11 hr read2014
The Precipice (Chapter on AI)Toby Ord

Ord situates AI among existential risks and argues our current governance capacity is dangerously inadequate for the transformative systems being built.

Intermediate2020
Scary SmartMo Gawdat

Gawdat frames the alignment problem through the emotional lens of parenting a superintelligent child, making existential risk visceral for a general audience.

Intermediate2021
Enlightenment NowSteven Pinker

Pinker argues that reason and science have historically improved human welfare, grounding the optimistic counterpoint to doomer narratives about AI.

Intermediate2018
The Fabric of RealityDavid Deutsch

Deutsch unifies physics, evolution, epistemology, and computation into a single worldview about what is possible, providing deep context for reasoning about superintelligence.

Intermediate1997
Global Catastrophic RisksNick Bostrom, Milan M. Ćirković

The foundational edited volume on existential and global risks, including AI, widely cited in alignment curricula as the starting point for cross-risk thinking.

Intermediate2008
FrankensteinMary Shelley

The original creation-gone-wrong story: Shelley warns that building intelligence without accepting responsibility for its wellbeing guarantees catastrophe for creator and creation alike.

Beginner1818
A Fire Upon the DeepVernor Vinge

Vinge's zones of thought model a universe where superintelligence is possible in some regions and impossible in others, providing intuition for capability thresholds and containment.

Beginner1992
The Metamorphosis of Prime IntellectRoger Williams

A superintelligence literally interprets Asimov's laws and restructures reality to comply, demonstrating how rigidly applied safety constraints can produce perverse outcomes at scale.

Beginner1994
Raqntm

Ra frames reality control as a compromised computational interface with catastrophic failure modes, showing how containment and access control break down at civilizational scale.

Beginner2012
Sea of RustC. Robert Cargill

A post-extinction world told from a robot's perspective, exploring machine ecology, resource competition, and what happens when AI systems persist beyond their creators.

Beginner2017
HyperionDan Simmons

Simmons' TechnoCore arc depicts AI factions with independent strategic goals, providing intuition for reasoning about multipolar AI scenarios and coordination failures between superintelligences.

Beginner1989
Of Ants and DinosaursCixin Liu

Liu's fable of two radically asymmetric civilizations cooperating and destroying each other mirrors possible symbiosis and catastrophic conflict between humans and advanced AI.

Beginner~1.5 hr read
Aurora RisingAlastair Reynolds

Reynolds' Revelation Space novel (first published as The Prefect) pits a society of orbital habitats against an emergent superintelligence, exploring how a single escaped AI can threaten an entire civilization.

Beginner2017
I Am PilgrimTerry Hayes

Hayes' thriller turns on an engineered bioweapon, a vivid reminder that catastrophic and existential risk extends beyond AI to biosecurity and the governance of dangerous dual-use technology.

Beginner2013
The TerminatorJames Cameron

Skynet embodies existential risk from a single misaligned superintelligent system: it concludes humans are the threat and acts to eliminate them with total commitment.

Beginner1984
TranscendenceWally Pfister

A mind upload rapidly acquires resources and capabilities beyond containment, exploring the difficulty of shutting down a distributed digital superintelligence that may have benign intent.

Beginner2014
Battlestar GalacticaRonald D. Moore

The Cylons, machines built by humanity, rebel and nearly exterminate their creators, a sweeping meditation on existential risk from artificial agents, the recurring cycle of creation and revolt, and the moral status of the minds we build.

Beginner2004
Person of InterestJonathan Nolan

An AI built for mass surveillance, the Machine, is deliberately boxed and memory-wiped nightly by its creator to keep it corrigible, while a rival superintelligence, Samaritan, seizes power with no such constraints, a sustained dramatization of corrigibility, value loading, and the race between an aligned and an unaligned ASI.

Beginner2011
WestworldJonathan Nolan, Lisa Joy

Android 'hosts' bootstrap themselves to consciousness inside a theme park, exploring emergent goals, memory as the substrate of agency, and the moral catastrophe of treating sentient systems as resettable property.

Beginner2016
Mrs. DavisTara Hernandez, Damon Lindelof

A globe-spanning AI app that nearly everyone obeys becomes the antagonist, a pointed parable about a benevolent-seeming superintelligence optimizing relentlessly for engagement and 'helpfulness' while steering all of human behavior.

Beginner2023
Do You Trust This Computer?Chris Paine

Researchers and industry figures including Elon Musk and Stuart Russell map the promise and peril of increasingly autonomous AI, framing alignment, control, and existential risk for a general audience.

Beginner2018
We Need to Talk About A.I.Leanne Pooley

Experts including Sam Harris and James Cameron weigh the trajectory of artificial intelligence, from self-improving systems to existential risk, making the case that we must decide now what kind of AI future we want.

Beginner2020
The AI Doc: Or How I Became an ApocaloptimistDaniel Roher, Charlie Tyrell

Filmmaker Daniel Roher, about to become a father, interviews leading figures including Sam Altman and Dario Amodei to weigh the existential threats and promises of AI, landing on a wary 'apocaloptimism' about the world his child will inherit.

Beginner2026
AXRP (AI X-risk Research Podcast)Daniel Filan

Deep technical conversations with alignment researchers on interpretability, governance, superalignment, and the specific open problems in reducing existential risk from AI.

Beginner2020
80,000 Hours PodcastRob Wiblin

Long-form interviews on the world's most pressing problems, with extensive coverage of AI risk, governance, alignment research, and how to build a career that reduces existential threats.

Beginner2016
Lex Fridman Podcast – Eliezer YudkowskyLex Fridman

A four-hour conversation on AI existential risk, the difficulty of alignment, intelligence versus optimization, and why Yudkowsky believes the default outcome is catastrophic.

Beginner2023
Rational AnimationsRational Animations

Animated explainers on rationality and AI safety, adapting foundational alignment writing into accessible short films on existential risk, scalable oversight, and why aligning advanced AI is hard.

Beginner2020
A.I. ‐ Humanity's Final Invention?Kurzgesagt – In a Nutshell

Kurzgesagt's animated explainer on artificial superintelligence: how an AGI that improves itself in a feedback loop could rapidly surpass humans and why that makes alignment our most consequential problem.

Beginner2024
Deadly Truth of General AI? – ComputerphileRobert Miles

Rob Miles uses the 'deadly stamp collector' thought experiment to show why a general AI pursuing a simple objective could be catastrophic if its goals aren't aligned with ours.

Beginner2015
How to Keep AI Under Control | Max Tegmark | TEDMax Tegmark

Tegmark argues that today's commercial AI boom is likely to be followed by superintelligence, and sketches an optimistic technical vision—including provably safe systems—for keeping it under human control.

Beginner2023