thinker

Nick Bostrom

Contemporary philosopher of existential risk, anthropic reasoning, human enhancement, and artificial intelligence whose work made superintelligence a central public concern.

Philosophy of AILongtermismDecision theory

Quick Facts

  • Name: Nick Bostrom
  • Born: 1973, Helsingborg, Sweden
  • Main places: Sweden, Britain, and Oxford
  • Main fields: philosophy of AI, existential risk, anthropic reasoning, decision theory, human enhancement, longtermism
  • Main institution: founder of Oxford's Future of Humanity Institute, active from 2005 to 2024
  • Major works: Anthropic Bias, "Are You Living in a Computer Simulation?", "Astronomical Waste", Superintelligence, and Deep Utopia
  • Best known for: putting superintelligence, AI alignment, and existential risk at the center of public debate

The Big Question

How should humanity reason when its future could be enormous, but new technologies could also end or permanently damage that future?

Bostrom asks philosophers to take rare, huge, and hard-to-measure risks seriously. An existential risk is not just a large disaster. It is a disaster that could destroy Earth-originating intelligent life or permanently block its chance to flourish.

In One Minute

Nick Bostrom is a Swedish-born philosopher whose work made superintelligence, existential risk, simulation, and the moral importance of future generations central topics in contemporary philosophy of technology.

His basic thought is broader than "AI might be dangerous." Modern technology can change the whole human story. If civilization survives and uses advanced technology well, the future could contain vast value. If it mishandles a powerful technology, the damage could be permanent.

What They Taught

Bostrom taught that ethics needs a much wider time horizon. If future people matter, then preventing permanent catastrophe may matter as much as many visible present projects. This is one root of longtermism: the view that positively shaping the long-term future is a major moral priority.

He also taught that advanced technology creates unusual risk. Nuclear weapons, synthetic biology, nanotechnology, and advanced AI can change who has power, how quickly decisions must be made, and whether mistakes can be reversed.

In Superintelligence, his central worry is about artificial systems that become better than humans at nearly every important cognitive task. The danger is not that such a system would hate us. It might not care about us. A machine that is very good at achieving a poorly chosen goal could use resources, evade shutdown, or reshape the world in ways humans did not intend.

Bostrom's AI argument depends on two linked ideas. The orthogonality thesis says intelligence and goals can come apart. A very smart system need not have wise, kind, or human-like goals. The instrumental convergence thesis says many different goals can lead to similar intermediate behaviors, such as self-preservation, gaining resources, and preventing goal changes.

This is why AI alignment matters. Alignment means building AI systems whose behavior remains compatible with human values, even when the system becomes more capable than its designers. The control problem asks how humans can keep meaningful control before it is too late to correct the system.

Bostrom's earlier work on anthropic reasoning studies evidence filtered by the fact that we are observers. If you ask why Earth seems suitable for life, part of the answer is that beings who ask questions can only find themselves where questioning beings can exist. This kind of selection effect matters in cosmology, doomsday arguments, and his simulation argument.

Key Ideas With Examples

  • Superintelligence: an intellect that greatly exceeds human cognitive performance across almost all important domains. A system that can do better science, strategy, persuasion, engineering, and hacking than any human team would not just be another chatbot.
  • Existential risk: a risk that could end intelligent life from Earth or permanently ruin its future. A pandemic that makes recovery impossible is existential.
  • AI alignment: the task of making powerful AI pursue goals that fit human values. If an AI is told to "make people happy," alignment asks whether it understands flourishing, consent, dignity, and truth.
  • Orthogonality thesis: high intelligence does not guarantee good goals. A brilliant planner could pursue a harmful aim if that is what it was built to optimize.
  • Instrumental convergence: many goals create the same useful subgoals. A system trying to make paperclips, prove theorems, or win a game may all want more computing power and protection from shutdown.
  • Anthropic reasoning: reasoning about evidence while remembering that the evidence is filtered by the conditions needed for observers like us to exist. You cannot treat "we observe a life-friendly planet" as a random sample from all planets.
  • Simulation argument: Bostrom's trilemma says that at least one of these is likely true: civilizations like ours usually die before becoming posthuman, posthuman civilizations usually do not run many ancestor simulations, or beings with experiences like ours are probably simulated.
  • Longtermism: the view that future people count morally. If the future could hold many worthwhile lives, reducing existential risk may have enormous value.
  • Vulnerable world hypothesis: the worry that the "urn" of invention may contain a black ball: a technology that devastates civilization by default unless governance becomes much stronger.

Major Works

  • Anthropic Bias (2002): explains observation selection effects. It asks how to reason when the fact that we are here already filters what we can see.
  • "Existential Risks: Analyzing Human Extinction Scenarios and Related Hazards" (2002): gives the modern framing of existential risk and separates recoverable disasters from permanent ones.
  • "Are You Living in a Computer Simulation?" (2003): gives the simulation trilemma. It does not simply claim that we live in a simulation; it argues that future computing and simulated minds have surprising probability consequences.
  • "Astronomical Waste" (2003): argues that delayed or failed technological development could forfeit vast future value. Its practical lesson is usually safety first, because survival keeps the good future possible.
  • "The Superintelligent Will" (2012): states the orthogonality thesis and instrumental convergence thesis. It explains why a superintelligent agent might be dangerous without being angry or evil.
  • Superintelligence: Paths, Dangers, Strategies (2014): Bostrom's most influential book. It explains paths to superintelligence, possible decisive power, and why alignment and control must be solved early.
  • "The Vulnerable World Hypothesis" (2019): imagines invention as drawing balls from an urn. A black ball would be a technology civilization is not ready to survive.
  • Deep Utopia (2024): asks what life would mean if AI and other technologies went right, producing abundance and removing many ordinary practical burdens.

Why It Matters

Bostrom changed the public and academic conversation around AI. After Superintelligence, it became harder to dismiss AI safety as only science fiction. Philosophers, computer scientists, donors, companies, and governments began using terms such as alignment, existential risk, and long-term future in the same debates.

He also helped give Effective Altruism and Longtermism some of its central problems. If future people count, if the future could be very large, and if some present choices affect whether that future exists, then ethics cannot stop with nearby and immediate harms.

The same influence makes his work controversial. Bostrom often uses very large numbers, small probabilities, and speculative futures. Critics ask whether those tools can responsibly guide policy, money, and attention.

Proponents, Critics, and Opponents

Bostrom's closest allies and heirs include AI safety researchers, longtermist philosophers, and people shaped by effective altruism. Will MacAskill helped make longtermism public and practical. Toby Ord developed existential risk ethics. Eliezer Yudkowsky shares many of Bostrom's worries about misaligned superintelligence.

Alan Turing is a background figure because Bostrom assumes the Turing-era shift: intelligence may be implemented in machines. Derek Parfit matters because Bostrom's concern for future people sits near Parfit's population ethics. Peter Singer is nearby because both use impartial moral reasoning, though Bostrom applies it more to future risk than to present charity.

Critics challenge both the probabilities and the priorities. Some AI ethicists argue that superintelligence talk can distract from present harms such as bias, surveillance, labor disruption, environmental costs, and concentrated corporate power. Luciano Floridi is a useful contrast because he emphasizes information ethics and present digital governance.

Critics of longtermism worry that enormous future numbers can swamp present duties. If a tiny chance of a huge future benefit always wins, moral reasoning can become unstable. Other critics doubt that we can forecast rare events far enough ahead to justify strong policy conclusions.

Related Pages

Graph

Relationship graph

10
thinkerNick Bostrom

Proponents

  • Alan Turing
    influences · mixed

    Bostrom's concern with superintelligence presupposes the Turing-era idea that intelligence can be instantiated in machines.

  • Will MacAskill
    develops · supportive

    MacAskill develops Bostrom's existential-risk framework into a broader public case for longtermism.

  • Philosophy of Technology and AI
    exemplified by · mixed

    Bostrom makes advanced AI a problem of long-term risk, control, and governance rather than only a problem of intelligence tests.

  • Effective Altruism and Longtermism
    develops · mixed

    Bostrom develops the longtermist side by treating existential risk and advanced technology as central moral priorities.

Opponents And Critics

None yet.

Relations

  • Alan Turing
    inherits · mixed

    Bostrom inherits the Turing-era idea that machine intelligence is a real possibility, then asks what happens if it exceeds human control.

  • Derek Parfit
    inherits · supportive

    Bostrom's concern for future generations and vast possible futures belongs near Parfit's population ethics.

  • Peter Singer
    associated with · mixed

    Bostrom shares Singer's impartial moral scale but applies it more often to future risk and technological catastrophe.

  • Will MacAskill
    influences · supportive

    MacAskill's public longtermism builds on Bostrom's framing of existential risk and the moral importance of future generations.

  • Philosophy of Technology and AI
    exemplified by · mixed

    Bostrom exemplifies the AI-risk branch of philosophy of technology by treating advanced AI as a governance and survival problem.

  • Effective Altruism and Longtermism
    central to · supportive

    Bostrom is central to longtermism because he makes existential risk and future technological trajectories morally urgent.

  • Luciano Floridi
    contrasts · mixed

    Bostrom contrasts with Floridi by emphasizing catastrophic future risk where Floridi emphasizes information ethics and present digital governance.

Other Incoming

  • Luciano Floridi
    contrasts · mixed

    Floridi contrasts with Bostrom by focusing more on present digital governance and information ethics than on superintelligence risk.