If you sat down to read the human genome aloud — one base pair per second, without stopping, day and night — it would take you nine and a half years to finish. In that time you would have read not random noise, not mere chemical complexity, but a message: a four-letter code organized into genes, regulatory sequences, and hierarchical instruction sets that direct the construction of a functional organism from a single cell. The question is not whether information of this kind exists in biology. The question is where information of this kind comes from.
I. The Nature of Biological Information
Francis Crick, co-discoverer of the double-helix structure of DNA, defined what he called the "sequence hypothesis" in 1958: the sequence of bases in DNA specifies the sequence of amino acids in proteins, and this specification is not determined by any chemical law. The bases in DNA do not attract their neighbors by chemical affinity. They are physically equivalent in terms of bonding. The information they carry is entirely in their arrangement — not in their chemistry.
This is the central point that is frequently missed in popular discussions of biology. DNA is not merely complex — it is specified. There is a difference between a long random polymer and a long polymer whose sequence encodes a functional protein. The difference is precisely what information theorists call specificity: the sequence matters, and it matters in a way that cannot be explained by the physical or chemical properties of the components.
Claude Shannon, in his 1948 paper establishing information theory, defined information mathematically in terms of the reduction of uncertainty across a set of possible outcomes. DNA satisfies Shannon's criteria precisely: it is a discrete, linear code, with four symbols (A, T, G, C), in sequences that are not determined by any rule derivable from the symbol set itself. The code is arbitrary in the same sense that English is arbitrary — the word "dog" has no physical resemblance to a dog. The relationship between sequence and function is assigned, not compelled by physics.
II. Where Does Specified Information Come From?
In every domain of human experience, we observe one consistent pattern: specified, functional information has an intelligent source. Books, software, engineering blueprints, musical notation, highway signs — every instance of functional, specified information we have ever observed was produced by a mind. We have never observed a physical or chemical process spontaneously generating it.
This is not a philosophical claim about what physical processes could do in principle. It is an empirical claim about what we have actually observed. The inference is inductive: from the universal experience that specified information has intelligent causes, we infer that the specified information in DNA has an intelligent cause. This is the same logic by which a geologist infers human tool-making from a shaped flint, or an archaeologist infers inscription from marks on stone. The inference is not mystical. It is pattern recognition.
III. The Origin of Life Problem
The origin of life — abiogenesis — is the point at which the information argument is most acute. Before the first self-replicating molecule existed, there was no natural selection. Before natural selection, there was no mechanism to preferentially retain functional sequences over non-functional ones. Yet functional sequences are required for the first self-replicating molecule to exist. This is not a gap in our knowledge of chemistry. It is a logical circularity at the foundation of the problem.
The RNA World hypothesis — currently the leading scientific framework for abiogenesis — proposes that RNA molecules capable of both information storage and catalytic function preceded DNA and proteins. This is a serious research program with genuine progress. It does not resolve the information problem. It relocates it: the first functional RNA molecule still required a specific sequence to perform its role. Where did that specification come from before selection existed to preserve it?
Fred Hoyle — an atheist and one of the most respected astrophysicists of the 20th century — calculated the probability of assembling a functional protein by random chemistry at approximately 1 in 10⁴⁰. He famously compared this to the probability of a tornado assembling a Boeing 747 from parts scattered across a junkyard. Hoyle drew no theological conclusions from this — he proposed panspermia instead. But the calculation stands independent of its author's preferences.
IV. The Genetic Code as an Arbitrary Assignment
The genetic code — the mapping of codons (three-base sequences) to amino acids — is nearly universal across all life on Earth. Every organism, from bacteria to humans, uses the same code with only minor variations. This universality is explained by common descent from a single origin. But the code itself is arbitrary in a philosophically precise sense: there is no chemical reason why the codon CAG codes for glutamine rather than any other amino acid. The assignment is functional but not chemically compelled.
Hubert Yockey, an information theorist who worked on the Manhattan Project and later turned to biology, argued in his 1992 book Information Theory and Molecular Biology that the genetic code satisfies all the formal requirements of a genuine communications system — including a sender, a channel, a receiver, and an encoding scheme that is independent of the physical medium carrying it. He was explicit: the code is not an analogy to a communications system. It is a communications system.
V. What the Evidence Asks of You
The argument from biological coherence does not require you to reject evolution. Evolution can explain the diversification of life from the first self-replicating molecule forward. What evolution cannot explain is the origin of the first functional, specified information that made the first self-replicating molecule possible. The argument is not against biology. It is about the preconditions of biology — the information structure that biology presupposes but cannot itself generate.
The inference from specified, functional information to an intelligent source is the same kind of inference used in every branch of forensic, archaeological, and intelligence science. It is not a fallacy. It is not a gap argument. It is an argument about the origin of information — about what kind of cause is sufficient to produce what we actually observe.
The genome is not a metaphor for a library. It is a library. The question is whether libraries write themselves — and whether any honest examination of the evidence supports that conclusion.
The biological information argument draws on information theory, molecular biology, and philosophy of science simultaneously. The scientific literature is genuine and the philosophical stakes are high. Read the best work on both sides.
- Crick, F.H.C. (1958). "On Protein Synthesis." Symposia of the Society for Experimental Biology, 12, 138–163. The original statement of the sequence hypothesis and the Central Dogma of molecular biology. The paper in which Crick first articulated that biological information is sequence-dependent rather than chemistry-dependent. Search this source ↗
- Shannon, C.E. (1948). "A Mathematical Theory of Communication." Bell System Technical Journal, 27, 379–423. The foundational paper of information theory. Understanding this paper is prerequisite to understanding the biological information argument. Shannon's framework applies directly to DNA whether or not he intended it to. Read free ↗
- Meyer, S.C. (2009). Signature in the Cell. HarperOne. The most thorough book-length treatment of the biological information argument from an ID perspective. Engage the evidence on its own terms before accepting or rejecting the conclusion. Search this source ↗
- Yockey, H.P. (1992). Information Theory and Molecular Biology. Cambridge University Press. The rigorous information-theoretic treatment by a non-ID physicist who applied Shannon's mathematics to biology. Yockey was not a theist — his conclusions arise from the mathematics, not from theology. Search this source ↗
- International Human Genome Sequencing Consortium (2001). "Initial sequencing and analysis of the human genome." Nature, 409, 860–921. The primary scientific publication of the human genome sequence. The raw data that establishes the scale of biological information in a single human cell. Read source ↗
- Szostak, J.W. (2012). "The eightfold path to non-enzymatic RNA replication." Journal of Systems Chemistry, 3(2). Current research on the RNA World hypothesis — the naturalistic framework that best addresses abiogenesis. Read this alongside Meyer to understand the actual state of the debate. Search this source ↗
Where Does This Argument Lead You?
Select the conclusion that most honestly fits your assessment.