Last updated:
Author(s):
Léo Henches, Jihye Kim, Zhiyu Yang, Simone Rubinacci, Gabriel Pires, Clara Albiñana, Christophe Boetto, Hanna Julienne, Arthur Frouin, Antoine Auvergne, Yuka Suzuki, Sarah Djebali, Olivier Delaneau, Andrea Ganna, Bjarni Vilhjálmsson, Florian Privé, Hugues Aschard
Publish date:
14 May 2025
Journal:
Human Genetics and Genomics Advances
PubMed ID:
40375557

Abstract

Polygenic risk scores (PRSs) models trained from genome-wide association study (GWAS) results are set to play a pivotal role in biomedical research addressing multifactorial human diseases. The prospect of using these risk scores in clinical care and public health is generating both enthusiasm and controversy, with varying opinions among experts about their strengths and limitations. The performance of existing polygenic scores is still limited but is expected to improve with increasing GWAS sample sizes and the development of new, more powerful methods. Theoretically, the variance explained by PRS can be as high as the total additive genetic variance, but it is unclear how much of that variance has already been captured by PRS. Here, we conducted a retrospective analysis to assess progress in PRS prediction accuracy since the publication of the first large-scale GWASs, using data from six common human diseases with sufficient GWAS information. We show that although PRS accuracy has grown rapidly over the years, the pace of improvement from recent GWAS has decreased substantially, suggesting that merely increasing GWAS sample sizes may lead to only modest improvements in risk discrimination. We next investigated the factors influencing the maximum achievable prediction using whole-genome sequencing data from 125,000 UK Biobank participants and state-of-the-art modeling of polygenic outcomes. Our analyses suggest that increasing the variant coverage of PRS, using either more imputed variants or sequencing data, is a key component for future improvements in prediction accuracy.

Related projects

Genetic epidemiology has entered the big data era and researchers have now access to both DNA data and a large number of disease-related traits including…

Institution:
Institut Pasteur, France

With the decreasing cost of DNA sequencing, large databases of human genomes are being collected in order to boost health related research, leading to the…

Institution:
University of Lausanne, Switzerland

All projects