Recent technical developments have enabled the transcriptomes of hundreds of cells to be assayed in an unbiased manner, opening up the possibility that new subpopulations of cells can be found. However, the effects of potential confounding factors, such as the cell cycle, on the heterogeneity of gene expression and therefore on the ability to robustly identify subpopulations remain unclear. We present and validate a computational approach that uses latent variable models to account for such hidden factors. We show that our single-cell latent variable model (scLVM) allows the identification of otherwise undetectable subpopulations of cells that correspond to different stages during the differentiation of naive T cells into T helper 2 cells. Our approach can be used not only to identify cellular subpopulations but also to tease apart different sources of gene expression heterogeneity in single-cell transcriptomes.
Pubmed ID: 25599176 RIS Download
Mesh terms: Animals | Cell Differentiation | Cell Lineage | Computational Biology | Gene Expression Regulation, Developmental | Genetic Heterogeneity | Mice | Models, Theoretical | Mouse Embryonic Stem Cells | RNA | Sequence Analysis, RNA | Single-Cell Analysis | Th2 Cells | Transcriptome
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.