Last updated:
Author(s):
Yaowu Liu, Tianying Wang
Publish date:
15 October 2025
Journal:
Journal of the American Statistical Association

Abstract

In linear regression models with non-Gaussian errors, transformations of the response variable are widely used in a broad range of applications. Motivated by various genetic association studies, transformation methods for hypothesis testing have received substantial interest. In recent years, the rise of biobank-scale genetic studies, which feature a vast number of participants that could be around half a million, spurred the need for new transformation methods that are both powerful for detecting weak genetic signals and computationally efficient for large-scale data. In this work, we propose a novel transformation method that leverages the information of the error density. This transformation leads to locally most powerful tests and therefore has strong power for detecting weak signals. To make the computation scalable to biobank-scale studies, we harnessed the nature of weak genetic signals and proposed a consistent and computationally efficient estimator of the transformation function. Through extensive simulations and a gene-based analysis of spirometry traits from the UK Biobank, we validate that our approach maintains stringent control over Type I error rates and significantly enhances statistical power over existing methods.

Related projects

We aim to develop a set of statistical tools that can increase the power of identifying genetic variants associated with diseases and health-related outcomes, uncover…

Institution:
Southwestern University of Finance and Economics, China

Scientific rationale: Individual-level genetic profile variations have different impacts on diseases. Numerous genes have been identified to be associated with complex diseases, including breast cancer,…

Institution:
Colorado State University, United States of America

All projects