The conserved Musashi (Msi) family of RNA binding proteins are expressed in stem/progenitor and cancer cells, but generally absent from differentiated cells, consistent with a role in cell state regulation. We found that Msi genes are rarely mutated but frequently overexpressed in human cancers and are associated with an epithelial-luminal cell state. Using ribosome profiling and RNA-seq analysis, we found that Msi proteins regulate translation of genes implicated in epithelial cell biology and epithelial-to-mesenchymal transition (EMT), and promote an epithelial splicing pattern. Overexpression of Msi proteins inhibited the translation of Jagged1, a factor required for EMT, and repressed EMT in cell culture and in mammary gland in vivo. Knockdown of Msis in epithelial cancer cells promoted loss of epithelial identity. Our results show that mammalian Msi proteins contribute to an epithelial gene expression program in neural and mammary cell types.
Pubmed ID: 25380226 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Commercial antibody vendor which supplies antibodies and other products to life science researchers.
View all literature mentionsA statistical framework for genomic data fusion is a computational framework for integrating and drawing inferences from a collection of genome-wide measurements. Each dataset is represented via a kernel function, which defines generalized similarity relationships between pairs of entities, such as genes or proteins. The kernel representation is both flexible and efficient, and can be applied to many different types of data. Furthermore, kernel functions derived from different types of data can be combined in a straightforward fashion. Recent advances in the theory of kernel methods have provided efficient algorithms to perform such combinations in a way that minimizes a statistical loss function. These methods exploit semidefinite programming techniques to reduce the problem of finding optimizing kernel combinations to a convex optimization problem. Computational experiments performed using yeast genome-wide datasets, including amino acid sequences, hydropathy profiles, gene expression data and known protein-protein interactions, demonstrate the utility of this approach. A statistical learning algorithm trained from all of these data to recognize particular classes of proteins--membrane proteins and ribosomal proteins--performs significantly better than the same algorithm trained on any single type of data. Matlab code to center a kernel matrix and Matlab code for normalization are available.
View all literature mentionsProbabilistic framework that quantitates the expression level of alternatively spliced genes from RNA-Seq and identifies differentially regulated isoforms or exons across samples.
View all literature mentionsCell line BT-474 is a Cancer cell line with a species of origin Homo sapiens (Human)
View all literature mentionsCell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)
View all literature mentions