64 Human Genomes Sequenced Will Serve as New Reference for Genetic Variation and Predisposition to Human Diseases
Researchers at the University of Maryland School of Medicine (UMSOM) co-authored a study, published today in the journal Science, that details the sequencing of 64 full human genomes. This reference data includes individuals from around the world and better captures the genetic diversity of the human species. Among other applications, the work will enable population-specific studies on genetic predispositions to human diseases as well as the discovery of more complex forms of genetic variation.
Twenty years ago this month, the International Human Genome Sequencing Consortium announced the first draft of the human genome reference sequence. The Human Genome Project, as it was called, required 11 years of work and involved more than 1000 scientists from 40 countries. This reference, however, did not represent a single individual, but instead was a composite of humans that could not accurately capture the complexity of human genetic variation.
Building on this, scientists have conducted several sequencing projects over the last 20 years to identify and catalog genetic differences between an individual and the reference genome. Those differences usually focused on small single base changes and missed larger genetic alterations. Current technologies now are beginning to detect and characterize larger differences – called structural variants – such as insertions of new genetic material. Structural variants are more likely than smaller genetic differences to interfere with gene function.
The new finding in Science announced a new and significantly more comprehensive reference dataset that was obtained using a combination of advanced sequencing and mapping technologies. The new reference dataset reflects 64 assembled human genomes, representing 25 different human populations from across the globe. Importantly, each of the genomes was assembled without guidance from the first human genome composite. As a result, the new dataset better captures genetic differences from different human populations.
“We’ve entered a new era in genomics where whole human genomes can be sequenced with exciting new technologies that provide more substantial and accurate reads of the DNA bases,” said study co-author Scott Devine, PhD, Associate Professor of Medicine at UMSOM and faculty member of IGS. “This is allowing researchers to study areas of the genome that previously were not accessible but are relevant to human traits and diseases.”
Institute of Genome Science (IGS)’s Genome Resource Center (GRC) was one of three sequencing centers, along with Jackson Labs and the University of Washington, that generated the data using a new sequencing technology that was developed recently by Pacific Biosciences. The GRC was one of only five early access centers that was asked to test the new platform.
Dr. Devine helped to lead the sequencing efforts for this study and also led the sub-group of authors who discovered the presence of “mobile elements” (i.e., pieces of DNA that can move around and get inserted into other areas of the genome). Other members of the Institute for Genome Sciences (IGS) at the University of Maryland School of Medicine are among the 65 co-authors. Luke Tallon, PhD, Scientific Director of the Genomic Resource Center, worked with Dr. Devine to generate one of the first human genome sequences on the Pacific Bioscences platform that was contributed to this study. Nelson Chuang, a graduate student in Dr. Devine’s lab also contributed to the project.
“The landmark new research demonstrates a giant step forward in our understanding of the underpinnings of genetically-driven health conditions,” said E. Albert Reece, MD, PhD, MBA, Executive Vice President for Medical Affairs, UM Baltimore, and the John Z. and Akiko K. Bowers Distinguished Professor and Dean, University of Maryland School of Medicine. “This advance will hopefully fuel future studies aimed at understanding the impact of human genome variation on human diseases.”
Reference: “Haplotype-resolved diverse human genomes and integrated analysis of structural variation” by Peter Ebert, Peter A. Audano, Qihui Zhu, Bernardo Rodriguez-Martin, David Porubsky, Marc Jan Bonder, Arvis Sulovari, Jana Ebler, Weichen Zhou, Rebecca Serra Mari, Feyza Yilmaz, Xuefang Zhao, PingHsun Hsieh, Joyce Lee, Sushant Kumar, Jiadong Lin, Tobias Rausch, Yu Chen, Jingwen Ren, Martin Santamarina, Wolfram Höps, Hufsah Ashraf, Nelson T. Chuang, Xiaofei Yang, Katherine M. Munson, Alexandra P. Lewis, Susan Fairley, Luke J. Tallon, Wayne E. Clarke, Anna O. Basile, Marta Byrska-Bishop, André Corvelo, Uday S. Evani, Tsung-Yu Lu, Mark J.P. Chaisson, Junjie Chen, Chong Li, Harrison Brand, Aaron M. Wenger, Maryam Ghareghani, William T. Harvey, Benjamin Raeder, Patrick Hasenfeld, Allison A. Regier, Haley J. Abel, Ira M. Hall, Paul Flicek, Oliver Stegle, Mark B. Gerstein, Jose M.C. Tubio, Zepeng Mu, Yang I. Li, Xinghua Shi, Alex R. Hastie, Kai Ye, Zechen Chong, Ashley D. Sanders, Michael C. Zody, Michael E. Talkowski, Ryan E. Mills, Scott E. Devine, Charles Lee, Jan O. Korbel, Tobias Marschall and Evan E. Eichler, 25 February 2021, Science.