But back to biology, where the neutralists’ most outspoken proponent was the Japanese scientist Motoo Kimura, who had developed a sophisticated and successful mathematical theory to explain the evolutionary fate of such neutral mutations. Kimura asserted that most genetic variation seen in nature is neutral. The genomic era has taught us that he was wrong on that count—neutral variants are no more frequent than those providing an advantage. However, his hunch of neutral change’s being important was dead on, though it took another few decades to understand why.
One reason is that neutral change is critical for navigating genotype networks. Neutral change provides the browsers of nature’s libraries with a safe path to innovations through treacherous territory of meaningless texts. Without genotype networks and the neutral changes they allow, the exploration of nature’s libraries would be just about impossible.
Another reason is that a change that is neutral when it first appears doesn’t have to stay that way. Once-neutral changes can turn into essential parts of innovations—like the theorems of number theory. And once they do, natural selection can preserve them. Which means that both selectionists and neutralists had a point, because both kinds of changes are essential in evolution. After neutral changes have paved the way to an innovation, selection preserves those neutral changes that contributed to the innovation.
A single example from a well-studied RNA enzyme known as the hammerhead ribozyme can illustrate to what extent neutrality and genotype networks can accelerate evolution’s search for innovations. The job of this particular RNA enzyme is to cleave RNA at a particular point along its nucleotide strand. The shape of the ribozyme, named for its apparent resemblance to the hammerhead shark, is what allows it to perform this job—adequately but not necessarily optimally. Somewhere out in the vast library of all RNA molecules might be new shapes, new phenotypes that would endow the ribozyme with sharper blades.
If no genotype networks existed, then a crowd of readers in the RNA library—an evolving RNA population—would have to congregate around the forty-three-letter-long text that encodes the RNA, and could only explore the shapes that are one letter change away. This particular RNA enzyme happens to have 129 neighbors, and because we can compute their shapes, we can determine that there are forty-six new shapes in this neighborhood.27 That’s the number of shapes evolution can explore without genotype networks.
And with them? If we only step to the text’s neutral neighbors—those with the same hammerhead shape—and determine the shape of all their neighbors, we already find 962 new shapes. And if we just walk one step further, to those neighbors’ neutral neighbors, we find 1,752 new shapes. Just two steps along this ribozyme’s genotype network, we can access almost forty times more shapes than in its immediate vicinity. The genotype network of the hammerhead shape of course extends much further than just two steps, and it has more than 1019 members, too many to count all the new shapes near them with current computers.28 But we can say with certainty that countless millions and billions of new shapes are near them, all explorable because evolution’s readers can spread out along the genotype network without suffering death.29
That’s how much genotype networks accelerate innovation. They are like the warp drives of Star Trek, science fiction’s solution for faster-than-light interstellar travel. Extrapolate from the hammerhead ribozyme and imagine that evolution had unfolded merely forty times more slowly without genotype networks. Instead of being four billion years along its path, life would have evolved only as far as it did during the planet’s first hundred million years. A few kinds of bacteria would be around, but certainly no multicellular organisms, let alone fish, land plants, dinosaurs, or nonfiction book authors. And genotype networks accelerate evolution much more than fortyfold—so much more we can’t even yet compute it. Without them, life would never have crawled out of the primordial soup.
Science fiction also has another solution to faster-than-light travel: changing the shape of space itself. Inventive science-fiction authors have postulated technologies like “wormhole drives” that allow instantaneous travels between places thousands of light-years apart. It turns out that genotype networks also do something similar. They shrink the distance between texts in the libraries of metabolisms, macromolecules, and regulatory circuits.
FIGURE 19. The square stands for the entire library, the circle for a neighborhood of some genotypic text (dot) in the library.
Imagine a crowd of evolution’s readers—organisms in a population—congregating near a text describing a circuit with a specific expression code that helps shape some body part, like a bird’s wing. Now imagine that somewhere in the library of regulatory circuits a new code exists that modifies the wing to make it slightly more aerodynamic or lighter. The further the readers must travel to find it, the more time they need to find this innovation.
Browsing through such an enormous library seems, at first glance, like hunting for a particular needle in a haystack. You might find the right needle immediately, but chances are you would have to examine most of the haystack—perhaps all of it—before achieving success. Common sense dictates that the same applies to the library, that a new expression code is like a single needle in a haystack many times the size of the universe.
But common sense fails in this library. We had already learned this from our discovery that there are innumerable circuits with the same expression code—the haystack has many needles—but the library is even more bizarre, as we found out by searching for circuits with specific new expression codes. In this search, we created arbitrary expression codes—thousands of them—and for each such code we used our computers to generate a pair of circuits, where the first circuit produced one of these codes, and the second produced the other. The two circuits differed also in most of their wiring pattern, in the who-regulates-whom among the circuit genes. We then changed the first circuit gradually, one wire at a time, requiring that each such gene regulation change preserve the circuit’s expression code. How close could we get to the second circuit? Very close, we found out, for example to within 85 percent of the second circuit’s wiring for circuits of twenty genes. In other words, starting from anywhere in the library—anywhere—you need not walk very far, only fifteen steps away from a genotype network, before finding the genotype network of any other circuit. It is as if your needle were always nearby, no matter where you started to search.30
If this doesn’t sound strange enough, get ready for something even stranger.
Imagine that the square in figure 19 is the library, and the dot is a single text in it. The circle around this dot has a radius that is 15 percent as long as the sides—this is how far a reader would have to travel from a genotype network, on average, before finding a specific new expression code, as we had found out from exploring the circuit library. A simple calculation shows that a circle with a radius of fifteen centimeters, inside a square that is one hundred centimeters on a side, has an area of 707 square centimeters, a little more than 7 percent of the square’s area.
Actual libraries aren’t two-dimensional, of course. They exist in three dimensions. For simplicity’s sake, let’s say that our library building is housed in a three-dimensional cube, and the area of the library containing circuits with a new expression code is now a sphere. If so, then the sphere inscribed inside the library will now have a radius 15 percent as long as the sides of the cube, just as with the square. The ratio of their volumes, however, becomes very different. The volume of the sphere covers not 7 percent of that of the cube, but only 1.4 percent.
The libraries of regulatory circuits, of course, aren’t three- or even four-dimensional. They occupy higher dimensions, where cubes become hypercubes and balls become hyperspheres. In four dimensions, our hypersphere—still with the same radius—contains 0.2 percent of a hypercube’s volume. In five dimensions, it covers 0.04 percent, and so on.
In the number of dimensions where our circuit library exists—get ready for this—the sphere contains neither 0.1
percent, 0.01 percent, nor 0.001 percent. It contains less than 0.00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000001 percent, or one 10-100th, of the library.31 This is the tiny fraction of the circuit library a reader needs to explore, starting from its own genotype network, to find a circuit with any one expression code. That it is so small emerges from a simple geometric principle in spaces of ever-higher dimensions: A ball with constant radius occupies ever-decreasing and ever-tinier fractions of a cube’s volume.32 (This volume decreases not just for my example of a 15 percent ratio of volumes, but for any ratio, even one as high as 75 percent, where the volume drops to 49 percent in three dimensions, to 28 percent in four, to 14.7 percent in five, and so on, to ever-smaller fractions.)33
The same counterintuitive phenomenon holds in other libraries of innovation: The more dimensions they have—the larger their collection of metabolisms or molecules—the smaller the distance to find specific innovations. A browser who starts with a metabolism that can survive on some foods and then blindly searches for one that can thrive on others needs to change only a few reactions and explore a tiny fraction—too small to imagine—of the metabolic library before stumbling upon the right text. The same holds true for RNA. Starting from an existing RNA molecule, a nearby molecule with a new shape—any new shape you choose—will be found after changing only a few of the molecule’s nucleotide building blocks and having explored a tiny fraction of the library.34
The astonishing fact that evolution needs to explore one 10-100th of a library to secure the arrival of the fittest goes a long way to explain how blind search produces life’s immense diversity. Evolution does not have to search the entire haystack, because the haystack contains more than one needle. In fact, thanks to robustness, and the genotypic disorder it permits, the haystack contains too many needles to count, and they are organized into sprawling but navigable networks.
And if we remember that the neighborhood of each text is extremely diverse, we have understood one more feature of the library’s organization: Genotype networks not only range far and wide but also are tightly interwoven. They form a dense tissue of networks, each genotype network surrounded by many others, interwoven with them on all sides, a tissue so complex that it looks nowhere the same, consisting of millions, billions, or more of different strands, each one corresponding to a different phenotype. If each strand had a different color, this tissue would be woven in such an intricate way that near any one strand, threads of billions of other colors would pass. Only a high-dimensional space can host such a fabric, whose texture is intricate beyond our grasp. This fabric is different from anything we know. It is hidden behind the visible splendor of each living thing, yet all this splendor emerges from it.
Because genotype networks and their fabric are a consequence of robustness, robustness is immensely valuable to innovation. But valuable things are usually not free, and robustness is no exception. Its price—a high one—is complexity.
It’s almost too easy to criticize complexity. In Lewis Carroll’s Through the Looking-Glass, Alice is navigating the tale’s fantastic chessboard when she is attacked by the Red Knight, who mistakes her for a white pawn. In the nick of time, Alice is rescued by his opposite number, the White Knight—tellingly, an inventor—who is eager to show his new friend his latest innovations, including boxes that open on the bottom to keep out the rain, a device for trapping mice when they appear on a horse’s back, and a dessert made of blotting paper, sealing wax, and gunpowder:
“You see,” he went on after a pause, “it’s as well to be provided for everything. That’s the reason the horse has all those anklets round his feet.”
“But what are they for?” Alice asked in a tone of great curiosity.
“To guard against the bites of sharks,” the Knight replied. “It’s an invention of my own.”
The White Knight is so handicapped by his complex and fantastic inventions that he is literally unable to ride and to accompany Alice on her journey, and so he quickly departs the story. He lives on, however, as an object lesson in the importance of simplicity.
Long before Carroll wrote his story, the fourteenth-century English friar William of Ockham had already expressed an enthusiasm for simplicity when he coined a now famous principle of parsimony: that phenomena should always be understood using the fewest possible facts, or “entities,” as Ockham called them. This ideal—often called Ockham’s (or Occam’s) razor—is usually held up to scientific explanations that are supposed to be both truthful and beautiful. But it could apply equally to the kinds of machines inventors and engineers build, even though engineers already have their own, more earthy motto: KISS, for Keep It Simple, Stupid.
The ideal of simplicity is not just an aesthetic ideal or a philosophical principle. In engineering it also has an economic angle. It costs money to manufacture the parts of a machine. More parts cost more money, a prospect that any sane manufacturer wants to avoid. In addition, assembling a complex machine is more error-prone. Simplicity is better for building machines that work.35
Anybody who has struggled to understand living beings, and has despaired at their complexity, will sympathize with this yearning for simplicity. Life seems unnecessarily complex in many ways. The regulation circuit that divides insects into fourteen segments contains dozens of molecules,36 but scientists have known for many years that just two molecules interacting in the right way could achieve the same goal.37 As if to spite us, thousands of insect species segment their bodies in a way that not only took decades to understand but that no self-respecting human engineer would ever devise. And remember the road networks of metabolisms, full of redundant lanes, alternative routes, and unused back alleys. They all raise the same question: Why? Why doesn’t ruthlessly efficient nature get rid of all this complexity?
The answer is “the environment”—or rather, “the environments.” What looks like a wastefully complex suite of genes is actually the secret to survival in more than one environment.
In the kind of nutrient-poor environment in which E. coli has only a single carbon source to manufacture those sixty essential biomass units that include amino acids and DNA nucleotides, nearly three-quarters of the bacterium’s metabolic reactions are completely dispensable.38 Knock them out and life continues—robustness.
But environments change. If the sole source of carbon changes from glucose to ethanol, some of those “dispensable” reactions can keep the biomass factories running. Each of the eighty carbon sources from which E. coli can synthesize biomass needs some dedicated reactions. And carbon is only one of the essential elements—metabolizing sources of other elements needs further reactions. A large collection of metabolic reactions makes an organism viable in multiple environments. In biology, increased complexity means increased robustness to environmental change.
For the same reason of changing environments, duplicate genes often persist in the genomes of organisms. Duplicate genes, like humans, are created equal, but they do not stay that way for long. They accumulate mutations that alter their DNA and its molecular meaning, and lead to increased specialization on one environment. Some human duplicate enzymes are best at cleaving molecules in the chemical environment of the liver, whereas others operate optimally in the brain. One duplicate yeast protein is best at importing glucose into the cell when this nutrient is abundant, whereas its duplicate partner specializes at scavenging glucose when it is scarce. The redundancy of many gene duplicates is more apparent than real, because they ensure robustness to changing environments.
The world of technology provides examples as well, for even though engineers prize simplicity, they must also design for changing environments. If you want to travel with a river’s current, a simple wooden raft will do the trick. But to cross the river or steer the raft, you already need more complexity: a rudder. If you want to avoid getting soaked by waves, you need a hull. To navigate upriver, you need oars or a sail. The simplest sail—the kind of square rig that Ph
oenicians and Egyptians had already built five thousand years ago—works well to sail downwind, but it is less effective when wind directions change, and useless for sailing upwind. To do that well you need a more complex fore-and-aft rig with two sails, a jib in front of the mast and a mainsail behind. Navigating a changing environment—current, waves, and wind—needs complex technology.
The converse is equally true, at least in biology. Over time, unchanging environments result in less complexity, because robustness becomes less important. To find examples, you don’t have to look much farther than a houseplant—or, more accurately, to the insects living on one.
Aphids, also called plant lice, blackflies, and greenflies, have been blood enemies of farmers and gardeners for millennia, though only a few hundred of the more than four thousand aphid species suck the sap of agriculturally valuable plants—not only of the houseplants in suburban homes, but of cotton, fruit trees, and grain crops. They were complicit in both the Irish potato famine of the 1840s and the Great French Wine Blight of the 1850s. They are among the most destructive of all insect parasites. Yet aphids are also valuable, at least to science.39 For deep inside them live other, even smaller organisms that can teach us a lot about both robustness and complexity.
Many people know that aphids suck sap, but fewer know that sap is a very poor food. It lacks essential molecules, including several amino acids. To get these, aphids have teamed up with relatives of the bacterium Escherichia coli, a species called Buchnera aphidicola that inhabits the bodies of aphids.
The alliance between Buchnera and aphids benefits both species and is also known as an endosymbiotic mutualism. It is a remarkably close relationship. Buchnera does not just live on or near aphids. Its cells live inside the aphid’s cells, where they provide their host with a vital service: They manufacture essential food molecules, especially amino acids that aphids cannot manufacture themselves, and that plant sap does not contain. Buchnera is a tiny food factory that keeps aphids alive.
Arrival of the Fittest: Solving Evolution's Greatest Puzzle Page 19