CLEMSON, South Carolina — A team of scientists from Clemson University and Cornell University is developing the first set of computational techniques that can predict how DNA mutations affect proteins and protein-to-protein interactions, which are vital in determining how the body’s tissues and organs function. Their study also holds the potential to accelerate the synthesis of new drug treatments for a variety of genetic disorders.
“Humans have about 20,000 different proteins in each cell, and each protein is involved in about five different interactions,” said Emil Alexov of Clemson’s department of physics and astronomy. “Imagine how that breaks down – each cell in our body then directs anywhere from 100,000 up to a million different interactions. Because DNA is different from person to person, these interactions are slightly different in each individual. Some of these differences are OK – that is the reason why we aren’t identical to each other. But some of these differences can cause diseases.”
Alexov is working with collaborators from Cornell with a $2.3 million grant from the National Institutes of Health.
Disease-causing mutations in DNA can code for the wrong amino acid – or sometimes delete an amino acid entirely – resulting in a misfolded protein that wreaks havoc on normal functioning.
The team’s goal, after a four-year study, is to be able to computationally predict how a mutation – like that implicated in cystic fibrosis – will affect corresponding protein interactions without the need for costly, time-consuming experiments.
“Human DNA contains about three billion base pairs. You cannot conduct three billion experiments, let alone billions and billions, once you consider all of the potential mutations,” Alexov said. “It’s simply not feasible; it’s got to be computationally done.”
To start, Alexov’s colleagues – professors Haiyuan Yu and Andrew G. Clark at Cornell – will home in on a total of 6,000 mutations: 4,000 that are common in the general population and 2,000 that will be nominated by researchers in the human genetics community.
After preparing the mutated samples, Yu and Clark will use high-throughput sequencing to generate millions upon millions of reads (short fragments of DNA sequence) that can be pieced together like a puzzle to render the original DNA sequence of the mutation. Through a handful of laboratory procedures, the pair will then test the mutant proteins to uncover interactions of interest and to discover which mutations result in enhanced or weakened interactions.
Alexov will direct the second half of the study by developing computational tools – using data points sent to him from Yu and Clark – to estimate the behavior of protein interactions.
“If you are lucky, there will be an experimental structure of the particular protein in question, and if you are luckier, the experimentally determined structure of this protein will interact with some other protein,” Alexov said. “That is the best-case scenario. But in the vast majority of cases, we aren’t that lucky, so we have to build our own structures.”
Alexov’s ultimate goal is to find a drug that can bind to a perturbed protein to restore its original shape and binding affinity and, therefore, its proper function – unlocking the potential to treat a multitude of genetic disorders.
“Often the best treatment can be highly facilitated if you know what the primary origin of the disease is,” Alexov said. “If we can understand how these DNA differences affect interactions in the proteins of our body, this will pave the way to develop personalized treatments for better patient care.”