The sequences of 150,119 genomes in the UK Biobank

dc.contributor.authorDBDS Genetic Consortium
dc.contributor.departmentFaculty of Electrical and Computer Engineering
dc.contributor.departmentFaculty of Industrial Engineering, Mechanical Engineering and Computer Science
dc.contributor.departmentFaculty of Medicine
dc.contributor.departmentFaculty of Sociology, Anthropology and Folkloristics
dc.contributor.schoolHealth Sciences
dc.date.accessioned2025-11-20T08:49:57Z
dc.date.available2025-11-20T08:49:57Z
dc.date.issued2022-07-20
dc.descriptionFunding Information: We thank the participants of the UKB. The sequencing of 450,000 WGS individuals from the UKB, including the 150,119 described here has been funded by the UKB WGS consortium consisting of the UK Government’s research and innovation agency, UK Research and Innovation (UKRI), through the Industrial Strategy Challenge Fund, The Wellcome Trust and the pharmaceutical companies Amgen, AstraZeneca, GlaxoSmithKline and Johnson & Johnson. DNA sequencing was performed at the Welcome Trust Sanger Institute and deCODE genetics. Funding Information: We thank the participants of the UKB. The sequencing of 450,000 WGS individuals from the UKB, including the 150,119 described here has been funded by the UKB WGS consortium consisting of the UK Government’s research and innovation agency, UK Research and Innovation (UKRI), through the Industrial Strategy Challenge Fund, The Wellcome Trust and the pharmaceutical companies Amgen, AstraZeneca, GlaxoSmithKline and Johnson & Johnson. DNA sequencing was performed at the Welcome Trust Sanger Institute and deCODE genetics. Publisher Copyright: © 2022, The Author(s).en
dc.description.abstractDetailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.en
dc.description.versionPeer revieweden
dc.format.extent9
dc.format.extent10098405
dc.format.extent732-740
dc.identifier.citationDBDS Genetic Consortium 2022, 'The sequences of 150,119 genomes in the UK Biobank', Nature, vol. 607, no. 7920, pp. 732-740. https://doi.org/10.1038/s41586-022-04965-xen
dc.identifier.doi10.1038/s41586-022-04965-x
dc.identifier.issn0028-0836
dc.identifier.other60462588
dc.identifier.other5a43da47-ed59-409a-a312-c63b09741328
dc.identifier.other85134541905
dc.identifier.other35859178
dc.identifier.otherunpaywall: 10.1038/s41586-022-04965-x
dc.identifier.urihttps://hdl.handle.net/20.500.11815/6787
dc.language.isoen
dc.relation.ispartofseriesNature; 607(7920)en
dc.relation.urlhttps://www.scopus.com/pages/publications/85134541905en
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.subjectAfrica/ethnologyen
dc.subjectAsia/ethnologyen
dc.subjectBiological Specimen Banksen
dc.subjectCohort Studiesen
dc.subjectConserved Sequenceen
dc.subjectDatabases, Geneticen
dc.subjectExons/geneticsen
dc.subjectGenetic Variationen
dc.subjectGenome, Human/geneticsen
dc.subjectGenomicsen
dc.subjectHaplotypes/geneticsen
dc.subjectHumansen
dc.subjectINDEL Mutationen
dc.subjectIreland/ethnologyen
dc.subjectMicrosatellite Repeatsen
dc.subjectPolymorphism, Single Nucleotide/geneticsen
dc.subjectUnited Kingdomen
dc.subjectWhole Genome Sequencingen
dc.subjectMultidisciplinaryen
dc.titleThe sequences of 150,119 genomes in the UK Biobanken
dc.type/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/articleen

Skrár

Original bundle

Niðurstöður 1 - 1 af 1
Nafn:
The_Sequences_of_150119_Genomes_in_the_UK.pdf
Stærð:
9.63 MB
Snið:
Adobe Portable Document Format

Undirflokkur