Last updated:
Author(s):
Ya Cui, Wenbin Ye, Jason Sheng Li, Jingyi Jessica Li, Eric Vilain, Tamer Sallam, Wei Li
Publish date:
5 April 2024
Journal:
Cell
PubMed ID:
38582080

Abstract

The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.

Related projects

In this study, we will develop novel methods to identify who (which subgroup of people) have a higher risk of diseases (e.g., breast cancer) than…

Institution:
University of California, Irvine, United States of America

All projects