Non-fiction Books

Non-fiction books on AI safety, alignment, and related topics—from primers to foundational texts.

Browse this category in the interactive library →

Bostrom's definitive academic text rigorously maps the strategies, kinetics, and dangers of an intelligence explosion, making the case that alignment is civilization-critical.

2014

The Singularity is NearRay Kurzweil

Kurzweil presents a maximalist case for merging with machines backed by decades of exponential trend data, shaping how the public and policymakers think about AI timelines.

2005

The Age of EmRobin Hanson

Hanson applies rigorous economics to a world of brain emulations, modeling how AI-era wages, wars, and social structures could actually function.

2016

Human CompatibleStuart Russell

Russell argues the standard AI paradigm of optimizing fixed objectives is fundamentally dangerous, proposing instead that machines should defer to uncertain human preferences.

2019

The Alignment ProblemBrian Christian

Christian traces the technical and historical roots of alignment, showing why objective misspecification keeps recurring across every AI paradigm from expert systems to deep learning.

2020

Life 3.0Max Tegmark

Tegmark maps concrete governance and alignment choices that determine whether advanced AI expands human agency or permanently concentrates power.

2017

Introduction to AI Safety, Ethics, and SocietyDan Hendrycks

Hendrycks' textbook surveys technical failure modes, governance constraints, and ethical trade-offs in deploying advanced AI, suitable as a first course in the field.

2024

Uncontrollable: The Threat of Artificial SuperintelligenceDarren McKee

McKee synthesizes the core x-risk arguments into an accessible, urgent case for why superintelligence governance and alignment research cannot wait.

2023

You Look Like a Thing and I Love YouJanelle Shane

Shane uses concrete and often hilarious ML failures to explain why AI systems can be impressive yet brittle, biased, and dangerously easy to mis-specify.

2019

Hello World: Being Human in the Age of AlgorithmsHannah Fry

Fry examines real algorithmic decision systems in justice, medicine, and transport to show where AI improves outcomes and where accountability structures fail.

2018

AI SuperpowersKai-Fu Lee

Lee maps the US-China AI race and explains how geopolitical competition can accelerate deployment well before safety institutions are ready.

2018

The Precipice (Chapter on AI)Toby Ord

Ord situates AI among existential risks and argues our current governance capacity is dangerously inadequate for the transformative systems being built.

2020

The Ethical AlgorithmMichael Kearns, Aaron Roth

Kearns and Roth give technical foundations for fairness, privacy, and accountability in algorithms, prerequisites for any credible AI safety framework.

2019

Army of None: Autonomous Weapons and the Future of WarPaul Scharre

Scharre details how military AI autonomy changes escalation dynamics and why human-in-the-loop control mechanisms consistently lag behind battlefield capability.

2018

The Age of Spiritual MachinesRay Kurzweil

Kurzweil's early timeline forecasts shaped modern discourse on AI trajectories and remain a key reference point for evaluating long-horizon predictions.

1999

Deep LearningIan Goodfellow, Yoshua Bengio, Aaron Courville

The standard technical reference for deep learning, essential context for understanding the architectures and training methods that alignment research targets.

2016

Artificial Intelligence: A Guide for Thinking HumansMelanie Mitchell

Mitchell offers a grounded, skeptical look at current AI capabilities, countering hype with hard limits and clarifying what today's systems actually can and cannot do.

2019

Scary SmartMo Gawdat

Gawdat frames the alignment problem through the emotional lens of parenting a superintelligent child, making existential risk visceral for a general audience.

2021

The Coming WaveMustafa Suleyman

Suleyman argues that containing omni-use technologies like AI is the defining geopolitical challenge of the century, proposing a containment framework from inside the industry.

2023

SuperforecastingPhilip Tetlock

Tetlock teaches the cognitive tools needed to predict technological risks with better-than-random accuracy, directly useful for AI timeline and governance forecasting.

2015

The Scout MindsetJulia Galef

Galef explains how to seek truth over comfort, a critical psychological stance for honestly confronting AI risks without retreating into denial or panic.

2021

Thinking, Fast and SlowDaniel Kahneman

Kahneman reveals the cognitive biases that prevent humans from intuitively grasping exponential growth, tail risks, and the kind of strategic thinking AI safety demands.

2011

Co-IntelligenceEthan Mollick

Mollick offers a practical guide for working alongside current LLMs while understanding their jagged capability frontiers and failure modes.

2024

Gödel, Escher, BachDouglas Hofstadter

Hofstadter explores how consciousness and meaning can emerge from formal systems that look meaningless locally, the deepest conceptual puzzle behind machine intelligence.

1979

A Brief History of IntelligenceMax Bennett

Bennett traces the evolution of intelligence from single-celled organisms to modern brains, clarifying what makes aligned cognition biologically difficult and computationally treacherous.

2024

The Beginning of InfinityDavid Deutsch

Deutsch argues that knowledge creation is unbounded and all problems are solvable in principle, grounding the optimistic case that alignment is achievable.

2011

Genius MakersCade Metz

Metz provides the definitive narrative history of the deep learning revolution and the personalities, rivalries, and safety concerns that shaped it.

2021

CyberneticsNorbert Wiener

Wiener founded the study of feedback and control systems, anticipating by decades the governance problems that arise when intelligent machines act on their own models of the world.

1948

Mind ChildrenHans Moravec

Moravec predicts a future in which robotic descendants supersede humans through technological evolution, an early and influential take on the human obsolescence scenario.

1988

The Society of MindMarvin Minsky

Minsky proposes that intelligence emerges from many small non-intelligent processes coordinated at scale, a framework that anticipated multi-agent AI architectures.

1986

On IntelligenceJeff Hawkins

Hawkins argues that hierarchical prediction is the core organizing principle of biological intelligence, offering a lens for evaluating how artificial systems differ.

2004

Homo DeusYuval Noah Harari

Harari explores the transition toward data-driven authority where algorithms may know us better than we know ourselves, eroding the basis for human autonomy.

2015

Enlightenment NowSteven Pinker

Pinker argues that reason and science have historically improved human welfare, grounding the optimistic counterpoint to doomer narratives about AI.

2018

The Fabric of RealityDavid Deutsch

Deutsch unifies physics, evolution, epistemology, and computation into a single worldview about what is possible, providing deep context for reasoning about superintelligence.

1997

Simulation and SimulacraJean Baudrillard

Baudrillard explains how representations can displace reality entirely, a prescient lens for understanding generative AI media saturation and epistemic erosion.

1981

Finite and Infinite GamesJames Carse

Carse distinguishes short-horizon winning from preserving the long game, a useful framing for AI governance where the goal is keeping options open, not racing to win.

1986

Complexity: A Guided TourMelanie Mitchell

Mitchell explains how complex behavior emerges from simple rules, foundational for understanding why adaptive AI systems resist top-down control.

2009

Out of ControlKevin Kelly

Kelly argues that the most powerful systems must be cultivated rather than rigidly engineered, anticipating challenges in controlling emergent AI behavior.

1994

Whole Earth DisciplineStewart Brand

Brand argues for responsible stewardship of high-powered technologies rather than blanket rejection, a pragmatic stance applicable to AI governance.

2009

Profiles of the FutureArthur C. Clarke

Clarke's forecasting framework, including his famous three laws, remains a classic guide to thinking clearly about radical technological change.

1962

Global Catastrophic RisksNick Bostrom, Milan M. Ćirković

The foundational edited volume on existential and global risks, including AI, widely cited in alignment curricula as the starting point for cross-risk thinking.

2008