Skip to main content

Command Palette

Search for a command to run...

The Protein Whisperer

What Evolution Remembered

Published
7 min read

Dr. Maya Chen had spent fifteen years staring at the same problem, and it stared back at her the way all great problems do — with patience, and without mercy. She worked in a mid-sized university laboratory that smelled permanently of ethanol and cold coffee, surrounded by centrifuges that hummed like anxious monks and whiteboards covered in the accumulated frustration of many failed hypotheses. Her subject of study was proteins, and proteins, as she had come to understand them, were among the most maddeningly beautiful things in existence.

Every living creature ran on proteins. The enzymes that broke down food in your stomach, the hemoglobin carrying oxygen through your arteries, the antibodies patrolling your bloodstream for invaders — all of them proteins, all of them doing their work with a specificity and elegance that no human engineer had ever matched. But proteins kept their secrets jealously. They performed their functions not merely because of what they were made of, but because of the shape they assumed, and that shape was extraordinarily difficult to know.

Ravi, her graduate student, had arrived from Hyderabad with a sharp mind and an endearing habit of asking the questions that everyone else considered too basic to voice. On a grey Tuesday afternoon, while rain tapped steadily at the laboratory windows, he sat across from her and asked one of those questions. She had been explaining the problem of protein folding, and he had listened carefully, and then he said that it sounded impossible. She told him that it had been, for about fifty years.

To understand why, one had to begin with the alphabet. Proteins were built from amino acids, a set of twenty molecular building blocks, strung together in sequences that could run to tens of thousands of units in length. A gene was essentially an instruction: assemble these amino acids, in this order. But the resulting chain did not remain a chain. It folded. Guided by forces that operated at the atomic scale — hydrogen bonds drawing certain atoms toward each other, hydrophobic regions fleeing from water, electrical charges attracting and repelling across infinitesimal distances — the protein collapsed into a precise three-dimensional shape. That shape was not decorative. It was functional. The contours and cavities of a folded protein determined whether it would act as a catalyst, a carrier, a signal, a structural support, or a lock awaiting the right molecular key. When the folding went wrong, as it did in diseases like Alzheimer's and Parkinson's, the consequences spread through the body with slow catastrophe.

The only reliable way to observe a protein's shape, for most of the twentieth century, was a technique called X-ray crystallography, in which researchers coaxed proteins into forming crystals and then bombarded those crystals with X-ray beams, interpreting the resulting diffraction patterns to reconstruct the molecular architecture. It was painstaking work. A single structure could demand years of effort and sometimes could not be resolved at all. The known universe of proteins numbered in the hundreds of millions. The library of mapped structures, after decades of global scientific effort, contained perhaps two hundred thousand. The distance between what existed and what was known was, in practical terms, almost incomprehensible.

Maya walked to the whiteboard when she reached this part of the explanation, as she always did when ideas required space. She drew two parallel lines representing protein chains and spoke about evolution, which was, she said, simply the longest experiment the planet had ever conducted. Over billions of years of mutation and selection, proteins had been tested relentlessly. When an amino acid in a functional protein changed, the protein frequently broke, and the organism that depended on it died. But sometimes, when one amino acid changed, another changed alongside it, somewhere else in the chain, compensating, preserving the fold, maintaining the shape that kept the creature alive. These correlated mutations, tracked across thousands of species and millions of years, carried a hidden message. Amino acids that changed together across evolutionary time were, almost invariably, close neighbors in three-dimensional space. They were the protein's autobiography, written in the language of survival, and AlphaFold had learned to read it.

The first versions of AlphaFold, developed by a team at DeepMind, demonstrated that a machine learning system trained on the evolutionary record of protein sequences could predict folded structures with an accuracy that stunned the scientific community. The third iteration extended the ambition considerably. Proteins, it turned out, rarely acted in isolation. They bound to strands of DNA, latched onto RNA molecules, docked with small chemical compounds, and clasped the surfaces of other proteins in interactions whose geometry was as important as any individual structure. AlphaFold 3 was trained to model these interactions, to predict not merely the shape of a protein in solitude but the shape it assumed in relationship, the configuration it adopted when it was, as Maya put it to Ravi, moving through a room and deciding who to embrace.

The training data was the accumulated structural knowledge of biology — every crystal structure, every cryo-electron microscopy image, every nuclear magnetic resonance map that the scientific community had produced across decades of effort — digested by a system capable of finding patterns at a depth no human analyst could reach. The network did not memorize. It generalized. It extracted from the data something resembling an understanding of the underlying physical grammar of molecular shape, and it applied that understanding to sequences it had never encountered.

The practical consequences arrived quickly. Structures that had resisted experimental determination for years were predicted in minutes. Researchers working on neglected tropical diseases, on antibiotic-resistant bacteria, on cancers that had proven stubbornly unresponsive to existing treatments, found themselves suddenly able to examine the shapes of proteins that had previously been inaccessible. Drug designers could identify binding pockets — the molecular hollows and clefts where a carefully shaped compound might lodge itself and alter a protein's behavior. A laboratory in Seoul reported progress on a rare childhood disease. A researcher working in Kenya mapped a parasite protein that had never been structurally characterized. The tool was made publicly available, and people used it everywhere, from graduate seminars to high school science competitions.

Ravi asked, at the end of one of these conversations, whether AlphaFold 3 was simply correct, whether the problem had been solved and the remaining work was merely application. Maya gave him the kind of look that good scientists reserve for questions that contain comfortable assumptions. The system struggled with disordered proteins, she explained — molecules that did not hold a fixed shape but instead existed in a perpetual fluid motion, adopting different conformations in different contexts. It occasionally missed rare structural states. Its predictions, however accurate, remained predictions, models of what a protein probably looked like rather than direct observations of what it demonstrably was. Experimental confirmation still mattered. The picture was clearer, not complete.

But the nature of the problem had shifted, and that was perhaps the most significant thing. For half a century the central question had been whether the shape of a protein could be found at all, whether the gap between sequence and structure could be crossed. That question had changed. The new question was what to do with structures once they were known, how to interpret the shapes, how to use them to design molecules that might intervene in disease, how to understand the dynamic interactions between proteins and the environments they inhabited. It was, as Maya had observed, a considerably better problem to be working on. She turned away from the whiteboard and looked at the rain still falling outside, and felt, as she sometimes did in moments of genuine scientific progress, the specific satisfaction of a door that had stood closed for a very long time finally swinging open onto something worth seeing.

20 views