Last updated:
Author(s):
Paula Gablenz, Matteo Sesia, Tianshu Sun, Chiara Sabatti
Publish date:
2 April 2026
Journal:
Journal of the American Statistical Association

Abstract

We introduce local conditional hypotheses that express how the relation between explanatory variables and outcomes changes across different contexts, described by covariates. By expanding upon the model-X knockoff filter, we show how to adaptively discover these local associations, all while controlling the false discovery rate. Our enhanced inferences can help explain sample heterogeneity and uncover interactions, making better use of the capabilities offered by modern machine learning models. Specifically, our method is able to leverage any model for the identification of data-driven hypotheses pertaining to different contexts. Then, it rigorously test these hypotheses without succumbing to selection bias. Importantly, our approach is efficient and does not require sample splitting. We demonstrate the effectiveness of our method through numerical experiments and by studying the genetic architecture of Waist-Hip-Ratio across different sexes in the UKBiobank. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Related projects

Our goal is to develop new data analysis methods that are well suited to discover the many genetic signals that influence traits of medical relevance.

Institution:
Stanford University, United States of America

he goal of this project is to develop new data analysis tools that can help scientists discover how human genes influence genetic diseases or other…

Institution:
University of Southern California, United States of America

All projects