Last updated:
Author(s):
Wei Cheng, Sohini Ramachandran, Lorin Crawford
Publish date:
7 June 2022
Journal:
iScience
PubMed ID:
35769876

Abstract

In this paper, we propose a new approach for variable selection using a collection of Bayesian neural networks with a focus on quantifying uncertainty over which variables are selected. Motivated by fine-mapping applications in statistical genetics, we refer to our framework as an “ensemble of single-effect neural networks” (ESNN) which generalizes the “sum of single effects” regression framework by both accounting for nonlinear structure in genotypic data (e.g., dominance effects) and having the capability to model discrete phenotypes (e.g., case-control studies). Through extensive simulations, we demonstrate our method’s ability to produce calibrated posterior summaries such as credible sets and posterior inclusion probabilities, particularly for traits with genetic architectures that have significant proportions of non-additive variation driven by correlated variants. Lastly, we use real data to demonstrate that the ESNN framework improves upon the state of the art for identifying true effect variables underlying various complex traits.

Related projects

Some case-control phenotypes, labelled only with 0/1, may actually contain distinct underlying sub-phenotypes; for example, depression is thought to be such a phenotype. Analysis of…

Institution:
Microsoft Corporation, United States of America

Patients with complex diseases can have different mutations within a single gene, or set of interacting genes, which predispose them to the same disease. To…

Institution:
Brown University, United States of America

All projects