Machine Learning-Based Multivariate Health Assessment Given Health Risk Factor Data
Principal Investigator: Dr Stuart Reynolds
Approved Research ID: 25687
Approval date: February 20th 2017
We will use advanced machine learning techniques to build models that predict the likelihoods of a variety of chronic diseases and their co-morbidities (diabetes, heart disease, hypertension, stroke). The predictive models we are building can be used for early and accurate detection of the elevated risk of a variety of chronic diseases. Early detection of the elevated risk of such conditions occurring within a year or two of detection can trigger preventative measures in a given patient that may delay or even completely avoid the onset of said condition. The public health implications of this, if successful, should be significant. The UK Biobank Resource is nearly unique in it's ability to support our research. We intend to publish our results. Physiosigns will use machine learning software to build models and perform tests to assess their reliability. Our team has been developing and applying machine learning technology to large data application areas for twenty years. Our core learning framework allows a variety of modeling techniques, such as logistic regression and neural network-based deep learning to be applied to thousands of unique partial data contexts (data subsets chosen automatically with online learning). The resulting individual models are combined with techniques drawn from mixtures of experts, typically resulting in predictions that are more accurate and robust than single predictor alternatives. We would like to use the full cohort - the maximum number available. Our methods can consider more risk factors and diseases with lower prevalence as the amount of data increases.