|
December 9, 2002
Mouse genome sequence published with first comparative
analysis of mouse and human genomes
By Tim Stephens
Researchers in UCSC's Center for Biomolecular Science and Engineering
(CBSE) made significant contributions to the analysis of the mouse genome
sequence announced last week by the international Mouse Genome Sequencing
Consortium.
|

|
| The mouse carries virtually the
same set of genes as the human, making it invaluable in research.
Image: National Institutes of Health |
The consortium published a high-quality draft sequence of the mouse
genome--the genetic blueprint of a mouse--together with a comparative
analysis of the mouse and human genomes describing insights gleaned
from the two sequences.
The paper appeared in the December 5 issue of the journal Nature.
The achievement represents a landmark advance for the Human Genome
Project. It is the first time that scientists have compared the contents
of the human genome with that of another mammal.
This milestone is all the more significant given that the laboratory
mouse is the most important animal model in biomedical research.
David Haussler, professor of computer science and director of the CBSE,
and CBSE research scientist Jim Kent both worked on the analysis of
the mouse and human genomes and are coauthors on the Nature paper.
Their previous contributions to the assembly and analysis of the human
genome sequence, which was published last year, earned them widespread
acclaim. Other members of the UCSC genome bioinformatics group who are
involved in the mouse project and listed as coauthors on the paper include
graduate students Ryan Weber, Krishna Roskin, Mark Diekhans, and Robert
Baertsch; postdoctoral researcher Terrence Furey; software project manager
Donna Karolchik; and software developers Angie Hinrichs and Matt Schwartz.
Kent has created a web-based mouse genome browser similar to the browser
he created for the human genome. The browser allows scientists around
the world to download data and detailed, customized analyses of the
genome. The mouse and human genome browsers and other useful tools for
researchers are available on the UCSC genome
bioinformatics site.
 |
| Jim Kent, left, and David Haussler Photo:
Paul Schraub |
The mouse sequence provides scientists a powerful research tool to
extract meaning from the human genome sequence. It allows them to recognize
functionally important regions in the human genome by virtue of the
fact that they are conserved through the 75 million years of evolution
separating humans and mice.
"Regions of DNA in genomes evolve under Darwinian selection when
they perform a useful function, or under random neutral drift when they
do not," Haussler said. "Comparing the human genome to the
mouse genome, we can see the statistical evidence for selection in some
regions and study the properties of the neutral drift in others."
About 95 percent of the human genome looks like it is evolving under
neutral drift, Haussler said. Surprisingly, the rate of neutral drift
varies regionally along the chromosomes, with some regions drifting
faster than others. The researchers were not able to identify any features
of the DNA that would account for these variations in mutation rates.
About 5 percent of the genome contains sections of DNA that are conserved
between human and mouse. Because these DNA sequences have been preserved
by evolution over tens of millions of years, scientists infer that they
are functionally important and under some evolutionary selection.
Because the mouse carries virtually the same set of genes as the human
but can be used in laboratory research, this information will allow
scientists to experimentally test and learn more about the function
of human genes, leading to better understanding of human disease and
improved treatments and cures.
The mouse genome sequence shows the order of the DNA chemical bases
A, T, C, and G along the 20 chromosomes of a female mouse of the "Black
6" strain--the most commonly used mouse in biomedical research.
It includes more than 96 percent of the mouse genome with long, continuous
stretches of DNA sequence and represents a sevenfold coverage of the
genome. This means that the location of every base, or DNA letter, in
the mouse genome was determined an average of seven times, a frequency
that ensures a high degree of accuracy.
Return to Front Page
|