Polygenic risk prediction : why and when out-of-sample prediction R2 can exceed SNP-based heritability

dc.contributor.authorMajor Depressive Disorder Working Group of the Psychiatric Genomics Consortium
dc.contributor.departmentFaculty of Medicine
dc.date.accessioned2025-11-20T09:22:10Z
dc.date.available2025-11-20T09:22:10Z
dc.date.issued2023-07-06
dc.descriptionCopyright © 2023 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.en
dc.description.abstractIn polygenic score (PGS) analysis, the coefficient of determination (R2) is a key statistic to evaluate efficacy. R2 is the proportion of phenotypic variance explained by the PGS, calculated in a cohort that is independent of the genome-wide association study (GWAS) that provided estimates of allelic effect sizes. The SNP-based heritability (hSNP2, the proportion of total phenotypic variances attributable to all common SNPs) is the theoretical upper limit of the out-of-sample prediction R2. However, in real data analyses R2 has been reported to exceed hSNP2, which occurs in parallel with the observation that hSNP2 estimates tend to decline as the number of cohorts being meta-analyzed increases. Here, we quantify why and when these observations are expected. Using theory and simulation, we show that if heterogeneities in cohort-specific hSNP2 exist, or if genetic correlations between cohorts are less than one, hSNP2 estimates can decrease as the number of cohorts being meta-analyzed increases. We derive conditions when the out-of-sample prediction R2 will be greater than hSNP2 and show the validity of our derivations with real data from a binary trait (major depression) and a continuous trait (educational attainment). Our research calls for a better approach to integrating information from multiple cohorts to address issues of between-cohort heterogeneity.en
dc.description.versionPeer revieweden
dc.format.extent9
dc.format.extent3218327
dc.format.extent1207-1215
dc.identifier.citationMajor Depressive Disorder Working Group of the Psychiatric Genomics Consortium 2023, 'Polygenic risk prediction : why and when out-of-sample prediction R 2 can exceed SNP-based heritability', American Journal of Human Genetics, vol. 110, no. 7, pp. 1207-1215. https://doi.org/10.1016/j.ajhg.2023.06.006en
dc.identifier.doi10.1016/j.ajhg.2023.06.006
dc.identifier.issn0002-9297
dc.identifier.other182433905
dc.identifier.other1372cbbf-fb4b-4e7f-ab4c-5b477e23818d
dc.identifier.other85164270154
dc.identifier.other37379836
dc.identifier.otherunpaywall: 10.1016/j.ajhg.2023.06.006
dc.identifier.urihttps://hdl.handle.net/20.500.11815/7317
dc.language.isoen
dc.relation.ispartofseriesAmerican Journal of Human Genetics; 110(7)en
dc.relation.urlhttps://www.scopus.com/pages/publications/85164270154en
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.subjectmeta-analysisen
dc.subjectout-of-sample prediction Ren
dc.subjectpolygenic risk predictionen
dc.subjectSNP-based heritabilityen
dc.subjectGenome-Wide Association Studyen
dc.subjectPolymorphism, Single Nucleotide/geneticsen
dc.subjectPhenotypeen
dc.subjectHumansen
dc.subjectComputer Simulationen
dc.subjectMultifactorial Inheritance/geneticsen
dc.subjectGenetics (clinical)en
dc.subjectGeneticsen
dc.titlePolygenic risk prediction : why and when out-of-sample prediction R2 can exceed SNP-based heritabilityen
dc.type/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/articleen

Skrár

Original bundle

Niðurstöður 1 - 1 af 1
Nafn:
1-s2.0-S0002929723002070-main_2_.pdf
Stærð:
3.07 MB
Snið:
Adobe Portable Document Format

Undirflokkur