Disease areas:
  • cancer and other tissue growths
Last updated:
Author(s):
Sun bin Kim, Joon Ho Kang, MyeongJae Cheon, Dong Jun Kim, Byung-Chul Lee
Publish date:
21 May 2024
Journal:
Scientific Reports
PubMed ID:
38773257

Abstract

In recent years, the utility of polygenic risk scores (PRS) in forecasting disease susceptibility from genome-wide association studies (GWAS) results has been widely recognised. Yet, these models face limitations due to overfitting and the potential overestimation of effect sizes in correlated variants. To surmount these obstacles, we devised the Stacked Neural Network Polygenic Risk Score (SNPRS). This novel approach synthesises outputs from multiple neural network models, each calibrated using genetic variants chosen based on diverse p-value thresholds. By doing so, SNPRS captures a broader array of genetic variants, enabling a more nuanced interpretation of the combined effects of these variants. We assessed the efficacy of SNPRS using the UK Biobank data, focusing on the genetic risks associated with breast and prostate cancers, as well as quantitative traits like height and BMI. We also extended our analysis to the Korea Genome and Epidemiology Study (KoGES) dataset. Impressively, our results indicate that SNPRS surpasses traditional PRS models and an isolated deep neural network in terms of accuracy, highlighting its promise in refining the efficacy and relevance of PRS in genetic studies.

Related projects

Our research aim is to develop a disease predicting model to enable screening or preventive therapies for an individual at high risk complex disease. We…

Institution:
Genoplan Korea Inc., Korea (South)

All projects