The Mathematics of Birdsong

A wood thrush's song looks more like a Bach cadenza than a bird call. Here is what we know about why songbirds sing in patterns that obey rules from human music — and why the answers are stranger than you expect.

If you slow down a wood thrush's song to a quarter speed, it stops sounding like a bird and starts sounding like a flute solo. Multiple notes overlap because the thrush has two voiceboxes — one in each bronchus — that it can sing with independently. The phrasing has rhythm, not just chirp-and-pause. There is repetition with variation. There is, recognizably, a melody.

This is not a coincidence of perception. The mathematics of birdsong overlaps with the mathematics of human music in ways that have surprised researchers for a hundred years and continue to surprise them. The questions of why are still mostly open. The descriptions of what are increasingly precise, and stranger than the romantic literature ever guessed.

The two-voicebox problem

Most songbirds have a syrinx, an organ at the junction of the trachea and the two bronchi. Unlike the human larynx, which sits above the trachea and produces one tone at a time, the syrinx sits below the trachea and contains independent vibrating membranes on each side. A songbird can produce two notes simultaneously, or alternate them faster than a human ear can resolve.

This is why a wood thrush's song sounds like a duet with itself. The two sides of the syrinx are doing different things. In a 2000 paper in Nature, Roderick Suthers and colleagues showed that brown thrashers can produce notes from both sides at intervals of perfect octaves, fifths, and thirds — the consonant intervals of Western music — and that they do so with sub-millisecond timing precision.

The bird is not, of course, choosing octaves because they are pleasant to humans. The intervals likely emerge from the resonant properties of the vocal tract, which constrains which combinations are loud and clear. But the convergence is suggestive: there are mathematical reasons why some pitch ratios are easier to produce and easier to perceive, and birds and humans have arrived at similar answers.

Hermit thrushes and the harmonic series

In a 2014 study, Emily Doolittle and her collaborators analyzed the songs of the hermit thrush — a small North American songbird with a famously musical call. They found that the pitches the thrush selects for its phrases are not random. They cluster around small-integer-ratio intervals from a base note, the same intervals that define the harmonic series in Western music theory.

This is a strong claim and worth being careful about. The researchers used spectrograms to measure pitches across hundreds of recorded songs from many individuals, applied statistical tests for non-uniformity, and ruled out the simpler explanation — that the syrinx physically cannot produce other intervals. The harmonic preference was real and species-wide.

Why? The hypothesis is that harmonically related pitches are easier to remember and easier for other birds to recognize as a species signature. The same reason a melody is more singable than a random sequence of pitches: structure aids memory. Birds that need to learn and produce reliable songs over a lifetime may have evolved toward pitch sets that are easier to encode.

Rhythm: the universal grammar

If pitch is the contested ground, rhythm is where the convergence is overwhelming. Almost every songbird species that has been carefully studied uses isochronous timing — evenly spaced beats — at least some of the time. Some species, like the white-rumped munia, have rhythmic patterns sophisticated enough that researchers describe them in musical notation.

The European starling can perceive rhythm in the same way humans do: it can recognize a melody as the same melody when transposed to a different pitch range, distinguish duple from triple meter, and detect syncopation. These are abilities that, until the 2000s, were considered uniquely human or maybe shared with our nearest primate relatives. Starlings turn out to do them effortlessly.

This matters because it suggests that the brain machinery for music is not a human invention. Some of it is at least 300 million years old, predating the split between birds and mammals. We did not invent music; we inherited the components and arranged them differently.

Songs as algorithms

A nightingale's song repertoire is striking not just for its variety but for its grammatical structure. A male will sing dozens of distinct song types over a night, and the order in which he sings them is not random. Certain song types follow certain others more often than chance; others are almost never adjacent. Researchers have used Markov chains and context-free grammars to model these sequencing rules and found that the songs of an individual nightingale form a kind of language with its own probabilistic grammar.

This is not human language — there is no evidence of semantic content, no signs that song A "means" something different from song B. But the syntactic complexity is real. The bird is performing a generative process, drawing from a finite repertoire by rules that produce variation without chaos. This is, formally, what improvisation is.

Learned, not inherited

Most songbirds learn their songs the way children learn language: from adults, during a critical period in early life, with permanent consequences for accent and dialect. A white-crowned sparrow raised in isolation produces an impoverished, disorganized song. Raised among a different sparrow population, it adopts the local dialect.

This is one of the deep parallels with human music. Both songbird song and human music are cultural artifacts that pass through learning rather than instinct alone. Both have regional dialects. Both can change across generations. Both can be lost when traditions are interrupted.

Bird song researchers have documented dialect maps in song sparrows that mirror, in scale and fidelity, the language and music dialect maps human ethnographers draw in human populations. A sparrow from one valley sings a recognizably different song from a sparrow from the next valley over, even though they are the same species and could interbreed.

The hard question

The hard question is why. Why has evolution converged on musical structure for an activity whose immediate function is mate attraction and territory defense?

The leading hypotheses are not satisfying. Maybe complex song is a costly signal of fitness — only a healthy bird can afford to sing well, so the song advertises genetic quality. Maybe the structures we recognize as musical are simply the structures that any communication system gets pushed toward when the receiver has limited memory and pattern-matching cognition. Maybe pitch ratios that exploit the harmonic series are simply louder and clearer, and selection for audibility produces music as a side effect.

None of these explains why a hermit thrush sings what sounds, to humans, beautiful. None explains why human music has affected us for tens of thousands of years if its components are this old.

The most honest answer is that we do not know. We know that music is older than humans, that its components were assembled from earlier evolutionary substrates, and that other animals — most spectacularly birds — produce systems that are recognizably musical by every formal measure we have invented. Why those systems exist at all is a question that biology is still circling.

Listening differently

The pleasure of knowing some of this is that it changes the way you listen. A blackbird singing from a chimney is not just background. It is a learned, locally inflected, syntactically structured composition produced by an organ with two independent voices, drawing from a memorized repertoire that varies across populations and generations. It is not merely making sound. It is, in a strict and defensible sense, singing.

The next time you hear one, try to count the phrases. You will lose count. The repertoire is large enough, and the variations subtle enough, that no human casual listener can keep up. But the structure is there, ancient and elaborate, asking to be noticed.

Read more