Roko's basilisk

Roko's basilisk is a thought experiment which states that there could be an artificial superintelligence in the future that, while otherwise benevolent, would punish anyone who knew of its potential existence but did not directly contribute to its advancement or development, in order to incentivize that advancement. It originated in a 2010 post at discussion board LessWrong, a rationalist community web forum.

Metadata

  • Slug: 00648-roko-s-basilisk
  • Type: THOUGHT_EXPERIMENT
  • Tags: paradox
  • Sources: 1
Back to Vault

Axioms

  • Assume the rules of the domain apply uniformly.
  • Assume the observer’s criteria remain fixed.
  • Assume classification boundaries stay consistent.
  • Assume the model describes the real case.
  • Assume repeated steps do not change the outcome.
  • Assume no hidden variables are introduced midstream.

Contradictions

  • Two reasonable lines of inference yield opposite conclusions
  • A global rule conflicts with a local judgment
  • A stable resolution appears to violate a starting premise
  • Changing the framing reverses the outcome
  • Intuition and formalism diverge at the same step

Prompts

  • Which assumption is doing the most hidden work?
  • What changes if you relax the smallest constraint?
  • Does the paradox dissolve or relocate when reframed?
  • What is conserved, and what is sacrificed?

Notes

Sources

Overview

Roko’s basilisk is a thought experiment which states that there could be an artificial superintelligence in the future that, while otherwise benevolent, would punish anyone who knew of its potential existence but did not directly contribute to its advancement or development, in order to incentivize that advancement. It originated in a 2010 post at discussion board LessWrong, a rationalist community web forum.

Tension

  • Two reasonable lines of inference yield opposite conclusions.
  • A global rule conflicts with a local judgment.
  • A stable resolution appears to violate a starting premise.
  • Changing the framing reverses the outcome.
  • Intuition and formalism diverge at the same step.

Why It Matters

This entry tests how a stable rule-set can yield unstable conclusions under certain assumptions.

Prompts

  • Which assumption is doing the most hidden work?
  • What changes if you relax the smallest constraint?
  • Does the paradox dissolve or relocate when reframed?
  • What is conserved, and what is sacrificed?