Deep learning (DL) is currently revolutionizing peptide drug development due to both computational advances and the substantial recent expansion of digitized biological data. However, progress in oligopeptide drug development has been limited, likely due to the lack of suitable datasets and difficulty in identifying informative features to use as inputs for DL models. Here, we utilized an unsupervised deep learning model to learn a semantic pattern based on the intrinsically disordered regions of ~171 known osteogenic proteins. Subsequently, oligopeptides were generated from this semantic pattern based on Monte Carlo simulation, followed by in vivo functional characterization. A five amino acid oligopeptide (AIB5P) had strong bone-formation-promoting effects, as determined in multiple mouse models (e.g., osteoporosis, fracture, and osseointegration of implants). Mechanistically, we showed that AIB5P promotes osteogenesis by binding to the integrin α5 subunit and thereby activating FAK signaling. In summary, we successfully established an oligopeptide discovery strategy based on a DL model and demonstrated its utility from cytological screening to animal experimental verification.
Pubmed ID: 35228528 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Global nonprofit biological resource center (BRC) and research organization that provides biological products, technical services and educational programs to private industry, government and academic organizations. Its mission is to acquire, authenticate, preserve, develop and distribute biological materials, information, technology, intellectual property and standards for the advancement and application of scientific knowledge. The primary purpose of ATCC is to use its resources and experience as a BRC to become the world leader in standard biological reference materials management, intellectual property resource management and translational research as applied to biomaterial development, standardization and certification. ATCC characterizes cell lines, bacteria, viruses, fungi and protozoa, as well as develops and evaluates assays and techniques for validating research resources and preserving and distributing biological materials to the public and private sector research communities.
View all literature mentionsSoftware package used for interactive, or batched, statistical analysis in social science, health sciences and marketing. Software platform offers advanced statistical analysis, a library of machine-learning algorithms, text analysis, open-source extensibility, integration with big data and deployment into applications.Versions that were produced by SPSS Inc. before the IBM acquisition (Versions 18 and earlier) would be given origin or publisher of SPSS Inc. in Chicago.
View all literature mentionsCell line MC3T3-E1 is a Spontaneously immortalized cell line with a species of origin Mus musculus
View all literature mentionsMus musculus with name C57BL/6J from IMSR.
View all literature mentions