Universal prediction of cell-cycle position using transfer learning

dc.contributor.authorZheng, Shijie C.
dc.contributor.authorStein-O’Brien, Genevieve
dc.contributor.authorAugustin, Jonathan J.
dc.contributor.authorSlosberg, Jared
dc.contributor.authorCarosso, Giovanni A.
dc.contributor.authorWiner, Briana
dc.contributor.authorShin, Gloria
dc.contributor.authorBjörnsson, Hans Tómas
dc.contributor.authorGoff, Loyal A.
dc.contributor.authorHansen, Kasper D.
dc.contributor.departmentFaculty of Medicine
dc.date.accessioned2025-11-20T08:42:12Z
dc.date.available2025-11-20T08:42:12Z
dc.date.issued2022-01-31
dc.descriptionFunding Information: This project has been made possible in part by grant number CZF2019-002443 from the Chan Zuckerberg Initiative DAF, an advised fund of Silicon Valley Community Foundation. Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under award R01GM121459. This work was additionally supported by awards from the National Science Foundation (IOS-1665692), the National Institute of Aging (R01AG066768), and the Maryland Stem Cell Research Foundation (2016-MSCRFI-2805). GSO is supported by postdoctoral fellowship awards from the Kavli Neurodiscovery Institute, the Johns Hopkins Provost Award Program, and the BRAIN Initiative in partnership with the National Institute of Neurological Disorders (K99NS122085). Publisher Copyright: © 2021, The Author(s).en
dc.description.abstractBackground: The cell cycle is a highly conserved, continuous process which controls faithful replication and division of cells. Single-cell technologies have enabled increasingly precise measurements of the cell cycle both as a biological process of interest and as a possible confounding factor. Despite its importance and conservation, there is no universally applicable approach to infer position in the cell cycle with high-resolution from single-cell RNA-seq data. Results: Here, we present tricycle, an R/Bioconductor package, to address this challenge by leveraging key features of the biology of the cell cycle, the mathematical properties of principal component analysis of periodic functions, and the use of transfer learning. We estimate a cell-cycle embedding using a fixed reference dataset and project new data into this reference embedding, an approach that overcomes key limitations of learning a dataset-dependent embedding. Tricycle then predicts a cell-specific position in the cell cycle based on the data projection. The accuracy of tricycle compares favorably to gold-standard experimental assays, which generally require specialized measurements in specifically constructed in vitro systems. Using internal controls which are available for any dataset, we show that tricycle predictions generalize to datasets with multiple cell types, across tissues, species, and even sequencing assays. Conclusions: Tricycle generalizes across datasets and is highly scalable and applicable to atlas-level single-cell RNA-seq data.en
dc.description.versionPeer revieweden
dc.format.extent4853067
dc.format.extent41
dc.identifier.citationZheng, S C, Stein-O’Brien, G, Augustin, J J, Slosberg, J, Carosso, G A, Winer, B, Shin, G, Björnsson, H T, Goff, L A & Hansen, K D 2022, 'Universal prediction of cell-cycle position using transfer learning', Genome Biology, vol. 23, no. 1, 41, pp. 41. https://doi.org/10.1186/s13059-021-02581-yen
dc.identifier.doi10.1186/s13059-021-02581-y
dc.identifier.issn1474-7596
dc.identifier.other46884438
dc.identifier.other24f8156a-6670-40d2-b105-03fc9345a089
dc.identifier.other85123974426
dc.identifier.other35101061
dc.identifier.otherunpaywall: 10.1186/s13059-021-02581-y
dc.identifier.urihttps://hdl.handle.net/20.500.11815/6653
dc.language.isoen
dc.relation.ispartofseriesGenome Biology; 23(1)en
dc.relation.urlhttps://www.scopus.com/pages/publications/85123974426en
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.subjectCell cycleen
dc.subjectSingle-cell RNA-sequencingen
dc.subjectTransfer learningen
dc.subjectCell Cycle/geneticsen
dc.subjectSequence Analysis, RNAen
dc.subjectSingle-Cell Analysisen
dc.subjectMachine Learningen
dc.subjectPrincipal Component Analysisen
dc.subjectWhole Exome Sequencingen
dc.subjectGeneticsen
dc.subjectEcology, Evolution, Behavior and Systematicsen
dc.subjectCell Biologyen
dc.titleUniversal prediction of cell-cycle position using transfer learningen
dc.type/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/articleen

Skrár

Original bundle

Niðurstöður 1 - 1 af 1
Nafn:
Universal_....pdf
Stærð:
4.63 MB
Snið:
Adobe Portable Document Format

Undirflokkur