Skip to navigation Skip to main content Skip to footer

Past data releases

Retrospective timeline of the data currently available

The UK Biobank resource was launched with the data collected at baseline made available in March 2012. The following timeline indicates when additional data was first made available for researchers to download.

Date

Data made available

Late 2023

Main Whole Genome Sequencing Release (500k) (Public)

  • The final tranche of individual-level and joint-called WGS data for all 500,000 participants.

Whole Genome Sequencing DRAGEN Release

  • Whole genome sequencing data for 500,000 participants, assembled using DRAGEN pipelines.

October 2023

Proteomics

  • Expanded proteomic data covering 3,000 proteins (up from the initial 1,500 proteins measured for the 56,000 participants included in phase 1), available in Category 1838.

Brain imaging

  • Functional and structural brain connectivity networks for more than 40,000 UK Biobank participants. This covers more than one million high-quality structural and functional connectomes for multiple parcellation granularities, several alternative measures of interregional connectivity, and a variety of common data pre-processing techniques. Atlas parcellations in native volumetric MRI space are available in Categories 200, 201, and 202, functional time series are available in Category 203, and structural connectomes are available in Category 204.

Questionnaires

  • Mental health and well-being: Data for over 175,000 participants from the Mental Well-being questionnaire, comprising a repeat of some of the questions asked in 2016, and an exploration of further issues related to mental health and distress (such as living conditions and social engagement) as well as data on COVID-19, including first and most recent dates of occurrence, methods of diagnosis and extent of recovery. Data includes 210 new data fields in Category 1500.
  • Cognitive function: Data in Category 116 for over 170,000 participants from four cognitive function tests, which aim to assess cognitive domains such as visual attention, processing speed and non-verbal reasoning.

COVID-19 case-control

  • A final update to the COVID-19 case-control data (Data-Fields 41000 and 41001), which accounts for recently added PCR test data covering to the end of the study period, and adds five new case-control study participants who were the last to attend imaging as part of the study.

Record linkage data

  • COVID-19 vaccination data (England): Record linkage data on COVID-19 vaccinations for England on the Data Portal. Data is restricted and is available for research relating to COVID-19 only; principal investigators should contact the Access Management Team for authorisation to access the data. Data-Field 32040 provides access to the table and Data-Field 32041 provides a summary field containing dates of vaccination.
  • Inpatient data (Scotland): Addition of small amount of Scottish inpatient data from Aug/Sep 2021 to the Data Portal and the summary fields on Showcase in Category 2000.
  • Updated algorithmically-defined outcomes and first occurrences data: Updated algorithmically-derived outcomes (Category 42) and first occurrences (Category 1712) fields incorporating the latest linkage data from hospital inpatient and death records.

AMRA 2023 IDP update

  • Update to the fields in Category 149 to include further longitudinal assessments of abdominal composition.

WGS QC fields

  • Quality control information from whole genome sequencing for 200,000 participants (Vanguard phase plus 150,000 publicly released main phase participants). Additional fields provided to identify sequencing provider and assembly pipelines used for each participant.

July 2023

OMOP

  • OMOP release on RAP: Transformed version of UK Biobank dataset to the OMOP Common Data Model available on RAP (Field 20142)

Metabolomics

  • Nightingale Second Phase Data (300k): An update to NMR-metabolomic data from Nightingale Health for 170,000 additional samples. Two new derived fields have been added for all samples - Glucose-lactate (Field 20280) and Spectrometer-corrected Alanine (Field 20281). See Category 220 for further details. Additionally, Phase 1 data has been updated with recalibration of a spectrometer for increased accuracy, and corrections to errors in a small number of samples. Please see Resource 130 for more information.

Questionnaires and Participant Contacts

  • Health and Wellbeing Questionnaire: New response data (2022-23) from ~195k participants on 154 questions relating to health and well-being in light of the COVID-19 pandemic, including long COVID symptoms. ​See Category 160 for more information.
  • Amalgamated list of contact points: ​​Data on personal contact made with UK Biobank by participants, either by attendance at an assessment centre, completing an online questionnaire or by using/returning a device, available in Category 2.

Genomics

  • 200k WGS phased data – BEAGLE: Phased VCF files based on 200k whole genome sequencing generated using BEAGLE software; see Field 20278.
  • 200k WGS Phased data – SHAPEIT: Phased VCF files based on 200k whole genome sequencing generated using SHAPEIT software; see Field 20279.

Health Linkage Data

  • Hospital Inpatient Data: Update of the Welsh and Scottish hospital inpatient data, including filling in missing Welsh diagnosis data from 2016 onwards and incorporating Scottish psychiatric inpatient records dating back to the 1990s. Inpatient data are available via the Data Portal, with summary fields in Category 2000.
  • Scottish cancer data (2021): Addition of missing Scottish cancer records for the period Jan 2021 - July 2021, available in Category 100092.
  • COVID-19 Test Data (Scotland): COVID-19 test results data have been updated for Scotland, including changes to the structure of the covid19_result_scotland table on the Data Portal. See the COVID-19 test result data dictionary for more information.
  • Updated algorithmically-defined outcomes and first occurrences data: Updated algorithmically-derived outcomes (Category 42) and first occurrences (Category 1712) fields incorporating the latest linkage data from hospital inpatient and death records.

Image-Derived Phenotypes

  • Calico spleen data: ​Image derived phenotypes of iron in the spleen from IDEAL and gradient echo protocols for ~45k participants, available in Category 158.
  • Derived cardiac measures (2023 release): Update to cardiac derived measures in Category 157: additional structural and functional phenotypes generated by Imperial College London for a further 38,000 participants, including a QC field indicating machine learning pipeline version.
  • DXA - hip shape and osteoarthritis characterisation: ​Characterisation of hip bone and joint metrics derived from 40,000 DXA images which have been linked to clinical outcomes of hip pain and osteoarthritis. The data describe acetabular, superior, and inferior femoral head osteophytes grades, and a morphological characterisation of hip shape (including joint space narrowing and acetabular dysplasia).

Other Derived Data

  • Residential location data - derived location measures: New fields for census area and local authority districts derived from participant home locations at baseline in Category 703. Note that these fields are restricted and approval for access will preclude access to the home location grid coordinate fields in Category 702.
  • Update to MET score data: Update to the MET score fields in Category 54 to address differences between the original derivation and the published IPAQ guidelines and to add data for repeat assessments.

March 2023

Proteomics Phase One release: Olink Explore assay data for 55,000 participant samples providing normalised expression measures for 1,500 proteins (Category 1838).

Whole genome sequencing

  • Quality control data for the whole genome sequencing 200k release: Quality control information from whole genome sequencing for first 200,000 participants (Category 187).
  • Plink and bgen format data for whole genome sequencing 200k release: Conversion of the whole genome sequencing 200k GraphTyper Joint Variant Call to Plink and bgen formats, available in Data-Field 24305 and Data-Field 24306.
  • Microsatellite data from whole genome sequencing: Microsatellite data from whole genome sequencing for 150,000 participants (Data-Field 23365).

Eye Data

  • Retinal optical coherence tomography (OCT): Retinal OCT data in Category 100016, which was re-introduced into repeat imaging at the end of 2022.
  • Refractometry data: Refractometry data in Category 1419, which was re-introduced into repeat imaging at the end of 2022.
  • Macular thickness from OCT: Macular thickness measures for left and right eye, from OCT images at baseline and first repeat visit (Category 100079).

Image-derived phenotypes (IDPs)

  • Derived Kidney Measures: Kidney distance, fusion, and parenchyma generated by Uppsala from MR images for 40,000 participants in Category 159.
  • Update to derived brain measures: Update to fields in Category 100 with addition of new fields such as quantitative statistical mapping and arterial spin labelling.
  • Update to DXA machine-derived fields (50k): Update to DXA machine-derived fields in Category 103 with addition of approximately 11,000 participants.
  • QC data for liver MR derived fields: A QC field (Field 40063) for Perspectum liver IDPs (Field 40060, Field 40061 and Field 40062) to document the change in the acquisition protocol for the MRI images upon which they were based.

Health record linkage data

  • Hospital Inpatient Data (England): Inpatient data have been updated for England, including filling in missing critical care data from early 2021. Inpatient data are available via the Data Portal, with summary fields in Category 2000.
  • Death Registry Data (England, Scotland, and Wales): Death data have been updated for England, Scotland and Wales, available via the Data Portal and the Showcase fields in Category 100093.
  • Cancer Data (England, Scotland, and Wales): Cancer data have been updated for England, Scotland and Wales, available in Category 100092.
  • COVID-19 PCR test result data (England, Wales): COVID-19 test result data have been updated for England and Wales on the Data Portal. Some issues with the linkage in the English data have been discovered and corrected since the last release (affecting approximately 1.4% of the records), and so we recommend projects re-download the table if they are using this data.
  • First Occurrence Data: First occurrence fields in Category 1712 have been updated to account for the latest death, inpatient and self-report data.

Cognitive function data

  • Broken Letters test: Data from the "broken letters test" are being released in Category 1358. Participants are shown a series of progressively degrading letters of the alphabet and asked to identify them.
  • Picture Vocabulary: Data are being released for a picture vocabulary test introduced in 2016 at the Imaging Clinics; please see Category 504 for more details.

Geographic and environmental data

  • Additional location data: Additional location data relating to imaging visits has been added to the location at assessment fields (Fields 22688 and 22689).
  • Water Hardness Levels: Water mineral levels generated from participant home locations (Category 603).

Other data

  • OMOP transformation of UK Biobank dataset: Transformed version of UK Biobank dataset to the OMOP Common Data Model (Data-Field 20142)
  • Dynamometer calibration data: Calibration information for dynamometers used to measure participant grip strength (Category 100019)

Showcase structural update (Data Portal tables): Some changes have been made to Showcase and the Data Portal to enable easier navigation. The Data Portal now presents users with a drop-down list of the tables they are authorised to access. In Showcase, gateway-fields now list the Record Tables they give access to (see e.g. Field 40100), and record tables also now have their own Showcase 'pages' (see e.g. Record Table 924).

November 2022

Available through Showcase fields:

  • SARS-CoV-2 serology: SARS-CoV-2 antibody status data for 20,000 individuals (10,000 UK Biobank participants and 10,000 of their adult children/grandchildren), taken on seven occasions between June 2020 and Feb 2022. For details, please see categories 995 (waves 1-6) and 994 (wave 7).
  • Image-derived phenotypes (IDPs): Updated data on image-derived phenotypes (IDPs) related to liver iron and fat and abdominal composition in Category 126, Category 149 and Category 158 for approximately 40,000 participants, derived from Perspectum, AMRA and Calico.
  • Accelerometer data: Accelerometer data for approximately 100,000 participants is now available, providing 4 different activity levels: sleeping, sedentary, light exercise, moderate-vigorous exercise across the monitoring period. See Category 1020 for details.
  • Home location data:  The previous data-fields for home location at assessment have been replaced by new fields, following corrections to a small number of baseline addresses and improved data cleaning. The “rounded” (1-kilometre) fields 20074 and 20075 have been replaced by data-fields 22688 and 22689. Both these fields and the location history fields have now been made restricted. See Category 100024 and Category 150 for details.

Available on the Research Analysis Platform (RAP) only:

  • Imputation of genotyping: Imputed genotyping data are now available that use the Genomics England reference panel and TOPMed reference panel. See Category 100319 for details.
July 2022

Available through Showcase fields:

  • Cancer Data (Scotland):An update to the Scottish cancer registry data in Category 100092. See the Data Providers and Dates page for details of the data coverage dates.
  • Nutrition Data:Updated and extended derived information on nutritional intake from the online 24-hour questionnaire for more than 200,000 participants. See Categories 100117 and 100118 for details.
  • Polygenic Risk Scores (PRS):Data for a range of diseases and quantitative traits for all 500,000 participants, provided by Genomics PLC. See Categories 300, 301, and 302 for details.
  • Joint Variant Call (JVC) [temporarily restricted]: New data produced using the GATK pipeline on 200,000 participant samples. This provides a complete JVC for all currently available participant genomes, combining the 50,000 Vanguard samples with the 150,000 genomes included in the previous JVCs. See Category 24304 for details.

Available on the Research Analysis Platform (RAP) only:

  • Final Release of the Whole Exome Sequencing (WES) data: Completion of the WES project with the release of data for approximately 470,000 participants.

May 2022
  • Cancer Data (Scotland): An update to the Scottish cancer registry data in Category 100092. See the Data Providers and Dates page for details of the data coverage dates.
  • Joint Variant Call (JVC): New data produced using the GATK pipeline on 200,000 participant samples. This provides a complete JVC for all currently available participant genomes, combining the 50,000 Vanguard samples with the 150,000 genomes included in the previous JVCs. See Category 24304 for details.
  • Nutrition Data: Updated and extended derived information on nutritional intake from the online 24-hour questionnaire for more than 200,000 participants. See Categories 100117 and 100118 for details.
  • Polygenic Risk Scores (PRS): Data for a range of diseases and quantitative traits for all 500,000 participants, provided by Genomics PLC. See Categories 300, 301, and 302 for details.
February 2022

Available through Showcase fields:

  • Antibody Study Data: Data on SARS-CoV-2 antibody status (indicative of previous SARS-CoV-2 infection) obtained from lateral flow tests (LFTs) from 200,000 participants carried out March-June 2021 are now available. Positivity is based on the reading from two positive LFT results. See Category 998 for details.
  • Antibody Study Thriva Data: More than 60,000 participants from the SARS-CoV-2 antibody study who had a positive LFT result and had received a vaccine were sent a confirmatory blood sampling kit (Thriva) to test for antibodies to the SARS-CoV-2 virus. See Category 997 for details.
  • Updated algorithmically defined outcomes: The algorithmically-derived outcome fields in Category 42 have been updated to account for the latest death and inpatient data. The Data Providers and Dates page gives details of the data coverage dates. Some simplification of the way in which the algorithms operate has taken place, and the source encodings now specify whether the earliest hospital or death diagnosis is in the primary/underlying or secondary position. See Resource 460 for details of the changes made.
  • Trabecular Bone Score (TBS) Data: TBS Data generated by Professor Harvey from University of Southampton https://pubmed.ncbi.nlm.nih.gov/31206530 following the analysis of the DXA body scans undertaken on approximately 30,000 UK Biobank participants. See Category 125 for details.
  • Death Data (England, Scotland, & Wales): The Showcase death fields in Category 100093 have been updated to include the data released on the Data Portal in December. See the Data Providers and Dates page for details of the data coverage dates.
  • Hospital Inpatient Data (England & Scotland): The Showcase summary inpatient fields in Category 2000 have been updated to include the data released on the Data Portal in December. See the Data Providers and Dates page for details of the data coverage dates.
  • Cancer Data (England & Wales): An update to our existing English & Welsh cancer registry data in Category 100092. See the Data Providers and Dates page for details of the data coverage dates.
  • First Occurrence fields: Additional mappings have been added to the First Occurrence fields in Category 1712, in particular many related to diabetes. See here for details of the updates made, and Resource 11419 for full details of the mappings used. These fields have also been updated to take account of the most recent death, inpatient & self-report data. See the Data Providers and Dates page for details of the data coverage dates.
  • Additional derived measures from OCT scans: retinal measures for 80,000 participants derived from our OCT scans, including thicknesses of several retinal layers at multiple subfields. In addition, we are adding many new additional participants to the derived retinal measures that are currently on Showcase. New quality control fields will also be provided which are associated to these new derived measures to allow researchers to perform bespoke quality control on the data before analysis. Please see Category 100079 for all these new data fields.
  • Derived abdominal IDPs (Calico): Derived abdominal organ composition data for 1,000 participants from the abdominal MRI data, supplied by Calico Life Sciences. See Category 158 for details.
  • Cardiac and aortic function IDPs: Derived data on cardiac and aortic structure & function for 2,000 participants. See Category 157 for details.
December 2021

Available on the Data Portal:

  • Additional data from the EMIS GP supplier. See the Data providers and dates page for coverage dates and numbers of participants. These data are available for Covid-19 research only.
  • Additional data from the TPP GP supplier. See the Data providers and dates page for coverage dates and numbers of participants. Note that some new prescription records appear to be near-duplicates of earlier records, using new dm+d codes for the same meanings. These data are available for Covid-19 research only.
  • Additional death registry records for England, Scotland & Wales. See the Data providers and dates page for coverage dates. Note that the corresponding Showcase fields have not been updated.
  • Additional COVID-19 test data for England, Scotland & Wales. See the Data providers and dates page for coverage dates and numbers of participants.
  • Additional hospital inpatient admissions for England and Scotland. See the Data providers and dates page for coverage dates. Note that the summary Showcase fields have not been updated.
November 2021

Available on the Research Analysis Platform (RAP) only:

  • First tranche of Whole Genome Sequence data fields in Category 180, for 200,000 participants.
  • Whole Exome Sequence data in Category 170, for 450,000 participants.
August 2021

Available on the Data Portal:

  • Additional data from the EMIS GP supplier covering up to mid July 2021. See the Data providers and dates page for coverage dates and numbers of participants. These data are available for Covid-19 research only.
July 2021

Available through Showcase fields:

  • Additional cancer registry data (Category 100092) for England and Wales, covering up to an estimated censoring date of the end of July 2019. See the cancer registry information page for further details on the data, and the Data providers and dates page for more information on how censoring dates are estimated.
  • An update to fields in Category 149 (Abdominal composition from MRI scans), provided by Advanced MR Analytics AB, AMRA, Sweden. The latest data will be available for ~25,000 participants and includes new fields and updates to existing fields.
  • An update to the existing data for Predicted Forced Expiratory Volume (FEV1) and FEV1 predicted percentage for an additional 123,000 smokers and previous smokers who have a reproducible spirometry measure. Please see Field 20153 for further details.
  • An update to the existing Nightingale data so the full range of biomarkers are available for participants in the first tranche. This includes QC measures for any derived fields, plus additional sample-level QC data. Please see Category 220 for further details.
  • Data related to macular thickness and retinal pigment epithelium thickness derived from retinal images for ~35,000 participants, generated by a group led by the NIHR Biomedical Research Centre at Moorfields Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology. Please see Category 100079 for further details.
  • Data on the diagnostic source of COVID-19 cases identified for the COVID-19 re-imaging study. The source of the positive Covid test result is contained in Field 41001.

Available on the Data Portal:

  • Additional data from the EMIS GP supplier covering up to mid May 2021. See the Data providers and dates page for coverage dates and numbers of participants. These data are available for Covid-19 research only.
May 2021

Available on the Data Portal:

  • Additional data from the TPP GP supplier covering up to March 2021. See the Data providers and dates page for coverage dates and numbers of participants. These data are available for Covid-19 research only.
April 2021

Available on the Data Portal:

  • Additional data from the TPP GP supplier covering up to November 2020. See the Data providers and dates page for coverage dates and numbers of participants. An additional reference table of unit and precision information for numeric values associated with clinical codes was also made available as Resource 951. These data are available for Covid-19 research only.
  • Covid-19 test result data for Scotland and Wales has now been added to that previously available for England. These can be found on the new tables covid19_result_scotland and covid19_result_wales on the Data Portal. For clarity, the previous covid19_result table on the Data Portal has been renamed covid19_result_england. Researchers who already had access to the previous covid19_result table will have been given access to these new tables. New researchers wanting access to these tables should see here for how to do this. Details of the tables is given in the COVID-19 data dictionary.
March 2021

Available on the Data Portal:

Additional data from the EMIS GP supplier covering up to November 2020. See the Data providers and dates page for coverage dates and numbers of participants. These data are available for Covid-19 research only.

Available through Showcase fields:

  • Data on telomere length for almost all of the cohort (474,000 participants) generated by Prof. Nilesh Samani’s research group at the University of Leicester. See Category 265 for details.
  • The first tranche of data on NMR-metabolomics for 120,000 participants, generated by Nightingale Health. This comprises data on a wide range of circulating metabolic biomarkers, including detailed measures of cholesterol metabolism, fatty acid compositions, and various low-molecular weight metabolites, such as amino acids, ketones and glycolysis metabolites. See Category 220 for details.
  • The first tranche of imaging data from the repeat imaging study to enable research into the effect of SARS-CoV-2 infection on internal organs. This includes data from new modalities introduced to the brain (Category 100) and abdominal MRI scans (Category 105). Updates will be made approximately monthly.
  • An update to the hospital inpatient summary fields in Category 2000 and the First Occurrence fields in Category 1712 to take account of the additional death and inpatient data on the Data Portal (and through self-report at assessment centres). Note that the Showcase death fields (Category 100093) and the Algorithmically-defined Outcome fields (Category 42) have not been updated. See the Data providers and dates page for coverage dates.
January 2021

Available on the Data Portal:

  • Additional death registry records for England, Scotland & Wales covering up to early December.
  • Additional hospital inpatient admissions covering up to November for England and October for Scotland.
  • Available through Showcase fields:
  • Data from a web-based questionnaire on self-reported pain for about 170,000 participants. (Category 154)
  • Updated DXA data related to bone mineral density and body composition for about 49,000 participants and additional brain NIFTI scans for an extra 5,000 participants.
  • An update to the Showcase death fields in Category 100093, the hospital inpatient summary fields in Category 2000, and the First Occurrence fields in Category 1712, to take account of the additional death and inpatient data on the Data Portal (and through self-report at assessment centres).
  • Note that the Algorithmically-defined Outcome fields in Category 42 were not updated to take account of the more recent death & hospital inpatient data.
  • Returned datasets:
  • A number of new datasets returned to UK Biobank from research projects are now available for download using the ukblink utility.
December 2020

Updates to the Data Portal:

  • English hospital inpatient records, including critical care admissions, covering up to the end of September 2020.
  • Scottish hospital inpatient records covering November 2016 to the end of August 2020.
  • Welsh hospital inpatient records covering March 2016 to the end of February 2018.
  • Death register records up to October 2020.
  • No Showcase fields have been updated (including any derived fields from the death or inpatient data), with the data above only accessible via the Data Portal.
October and November 2020

Updates to Showcase fields:

  • Late November: Further Exome data for 200,000 participants: sample level variant (VCF, ~8TB) and sequence data (CRAM, ~175TB).
  • Early November: Joint-call Exome data (Field 23156) for 200,000 participants in pVCF format (~7TB).
  • Late October: Joint-call Exome data in PLINK format (Field 23155) for 200,000 participants.
  • Further information on the Exome data is available in the Exome Sequencing FAQs which can be found here.
    Updates to the Data Portal:
  • Mid October: Death register records up to mid September 2020.
  • Early October: English primary care data from EMIS covering approximately 260,000 participants. This data is available to be used for COVID-19 research only. See the COVID-19 page for access details, and Resource 3151 for further details of the data itself.

 

September 2020

Updates to the Data Portal:

  • English hospital inpatient record data covering up to the end of June 2020.
  • Critical care data for English inpatients covering April 2011 to the end of June 2020. See the Critical Care data page for further information.
  • Death register records up to mid August 2020.
  • No Showcase fields have been updated (including any derived fields from the death or inpatient data), with the data above only accessible via the Data Portal.
August 2020

Updates to the Data Portal:

  • English primary care data from TPP data provider, covering approximately 190,000 participants - for COVID-19 research only;
  • English inpatient hospital record data covering April 2020 to May 2020;
  • Death register records up to June 2020.
  • No Showcase fields have been updated (including any derived fields from the death or inpatient data), with the data above only accessible via the Data Portal.
July 2020

Updates to the Data Portal:

  • English inpatient hospital record data from April 2017 to March 2020;
  • Death register records through to May 2020;
  • Blood-type haplotype (Field 23165) via a new table on the Data Portal ('covid19_misc') to any researcher who has access to the COVID-19 test results data (Field 40100).
  • No Showcase fields have been updated (including any derived fields from the death or inpatient data), with the data above only accessible via the Data Portal.
June 2020

Death register data:

  • Additional death register records via Field 40023 and the Data Portal.
    Updated death data for Field 40007 and Field 40010. Please note these were not made accessible via the Data Portal.
  • UK Biobank started making death records available on the Data Portal. Researchers who already had a released basket giving access to any Data-field in Category 100093 were given access to the death tables without needing to submit a new basket containing Field 40023.
  • The algorithmically defined outcome fields in Category 42 and First Occurrences fields (Category 1712) were not updated to take account of the new death data at this Showcase update.
April 2020 COVID-19 test data from Public Health England (PHE). Please note these were only made accessible via the Data Portal. Please see here for details.
February 2020

Replacement exome sequencing data from the SPB pipeline (~50,000 participants). These will replace Data-fields 23171 - 23174.

Imaging data:

  • Additional NIFTI brain images.
    Additional derived brain imaging data, including data for 10 new Data-fields (Fields 25921 - 25930).
    Raw carotid ultrasound data (Field 20241).
    Derived imaging variables:
  • Liver iron corrected T1 (ct1) (Field 22417).
    Abdominal composition variables (Category 149).
    Other data collected at the assessment centre visits:
  • Impedance data from the imaging and repeat imaging visits (Category 100009).
    Paired associate learning cognitive function test data (Category 506).
    Data from the online questionnaire on food preferences (Category 1039).
  • Various Returned Results via the Returns Catalogue, including returns with derived individual-level data related to: cardiac measures, eye measures, actigraphy, spirometry, and weather.
  • Additional Data-fields for the online cognitive function questionnaire (Category 116) relating to device (Field 23077 and Field 23078) and mood (Category 155).
September 2019

Primary care (GP) data for around 45% of the cohort, containing coded clinical data and prescriptions (category 3000).

  • "First occurrence" fields (category 1712) showing the first occurrence of any code mapped to 3-character ICD-10. The data-fields have been generated by mapping:
  • Read code information in the Primary Care data (Category 3000),
    ICD-9 and ICD-10 codes in the Hospital inpatient data (Category 2000),
    ICD-10 codes in Death Register records (Field 40001, Field 40002), and
    Self-reported medical condition codes (Field 20002) reported at the baseline or subsequent UK Biobank assessment centre visit to 3-character ICD-10 codes.
    Freesurfer segmentation data (categories 190-197).
  • The hospital inpatient data has been restructured, and a very small amount of additional data has been added. See Update of HES data - September 2019 for more details.
March 2019
  • Exome sequencing data for 50,000 participants.
  • Biochemistry assay data - serum and red blood cells assay data for all participants. See here for the list of biomarkers that have been measured.
  • Updated hospital inpatient and death and cancer registry data.
  • Additional summary data fields for hospital inpatient data, including corresponding first recorded diagnosis/procedure date.
  • Algorithmically-derived health outcomes for: Asthma, COPD, Dementia, End stage renal disease, Motor neurone disease, Parkinson's disease.
  • Infectious disease pilot study (10,000 participants).
  • Retinal OCT image slices in PNG format (fields 21017 & 21018 in category 100016).
October 2018
  • Online digestive health questionnaire data
  • Indices of Multiple deprivation (IMD) scores
  • 12-lead ECG metrics (imaging assessment)
  • Liver phenotypes from MRI scan (for some participants)
  • Updated imaging data
  • MET score data
  • Returned datasets
March 2018 Version 3 of the imputed genetics data
January 2018
  • Greenspace and coastal proximity data
  • Updated imaging data
  • Updated body composition data from the MRI (for some participants)
July 2017
  • Genotyping data for the full cohort
  • Mental health questionnaire data
  • Additional home location coordinates
February 2017
  • Algorithmically-defined cases of myocardial infarction and stroke
  • Address history data
  • Derived data on body composition from abdominal MRI for some participants
  • Updated imaging data
October 2016
  • Updated death, cancer and hospital inpatient data
  • Updated imaging data
May 2016 Updated death and cancer data
March 2016
  • Updated physical activity (accelerometer) data-fields
  • Updated imaging data
  • Updated death and cancer register data
October 2015
  • Imaging data for 5,000 participants
  • Online questionnaire data on cognitive function
July 2015 Online occupational health questionnaire
July 2014
  • Physical activity monitor data for 30,000 participants
  • Objective measures of built environment for participants resident in Wales and Greater London Area
June 2014
  • Nutrient data from the 24-hour diet recall
  • Inpatient hospital data for participants in Scotland
December 2013
  • Death and cancer registry data for participants in Scotland
  • Repeat assessment data for 20,000 participants
  • Inpatient hospital data for participants in England
March 2013 Death and cancer registry data for participants in England and Wales available
September 2012 Four online 24-hour recall diet questionnaires. The data was collected over the period February 2011 - June 2012

Last updated