Health-related outcomes data
Primary care data
An interim release of primary care data for ~230,000 UK Biobank participants (up to 2016 or 2017 depending on the data supplier) was made available in 2019. This dataset contains data from the GP system suppliers and contains coded clinical events (including consultations, diagnoses, procedures and laboratory tests), prescribed medications (including prescription date, drug code and, where available, drug name and quantity) and a range of administrative codes (e.g. referrals to specialist hospital clinics). The data are coded using READ2, CTV-3, BNF and DM+D.
We are making up-to-date primary care data available on a regular basis solely for research related to COVID-19, subject to the Control of Patient Information (COPI) regulations.
Hospital inpatient data
Hospital inpatient data are available for the full cohort. This provides information on hospital admissions for each participant and includes data on date of admission, diagnosis (and underlying conditions) during admission, procedures and discharge information. These are coded using ICD-9, ICD-10, OPCS-3 and OPCS-4. Please refer to resource 138483 for more information on the inpatient data.
For more details on how the data was collected, mapped and validated, recent changes to the data structure as well as further information on how to access the hospital inpatient data, please refer to our Essential Information page.
First occurrences of medical conditions
A set of ‘first occurrence’ data-fields have been generated that map the clinical codes from primary care, hospital inpatient admissions, death records and self-reported medical conditions to 3-character ICD-10 codes and provide, for each participant, the date that code first occurred in any source. For more information please see:
Linkage to national death registries provides notifications of participant deaths (if in the UK), containing data on date and cause(s) of death. Further information can be found in resource 115559. These are coded using ICD-10.
Information on the most common causes of death in the cohort by age, time period and sex can be found below.
Linkage to national cancer registries provides notifications of cancer registrations and includes data on cancer diagnosis (ICD-9 and ICD-10) and cancer histology code. Further information can be found in resource 115558.
Information on the most common cancers by age, time period and sex can be found in Showcase. The number of prevalent (i.e. occurring before recruitment) and incident (after recruitment) cancer diagnoses by type of cancer can be found in category 100092 of Showcase and on the Essential information page.
Current censor dates for hospital inpatient data, death registry and cancer registry data can also be found in Showcase. Information on the most common types of cancers by age, time period and sex can be found below.
Algorithmically-defined health outcomes
To aid researchers, UK Biobank have generated algorithmically-defined health outcomes using the self-reported health information, hospital inpatient data and death data, providing information on first diagnosis, for each participant, of a small number of health conditions. For more information please see:
Future health outcomes
In addition to incorporating potential updates of death, cancer and hospital inpatient data, further linkage is underway to provide more detailed data on cancer outcomes, including the stage and grade of the tumour, in addition to treatment information, for the full cohort. Additional potential linkages are currently being considered, such as disease specific registries and clinical audit information.
The Relationship Between Ambient Atmospheric Fine Particulate Matter (PM2.5) and Glaucoma in a Large Community CohortSharon Y. L. Chua and et al
Approved Research ID : 2112
Shared mechanisms between coronary heart disease and depression: findings from a large UK general population-based cohortGolam M. Khandaker et al
Approved Research ID : 26999
Explore our data