{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T22:36:55Z","timestamp":1762900615176,"version":"3.41.2"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"8","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["82270143"],"award-info":[{"award-number":["82270143"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004775","name":"Natural Science Foundation of Gansu Province, China","doi-asserted-by":"crossref","award":["22JR5RA508"],"award-info":[{"award-number":["22JR5RA508"]}],"id":[{"id":"10.13039\/501100004775","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100021171","name":"Guangdong Basic and Applied Basic Research Foundation","doi-asserted-by":"publisher","award":["2022A1515220122"],"award-info":[{"award-number":["2022A1515220122"]}],"id":[{"id":"10.13039\/501100021171","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Supercomputing Centre of Lanzhou University"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,8,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>There has been a burgeoning interest in cyclic peptide therapeutics due to their various outstanding advantages and strong potential for drug formation. However, it is undoubtedly costly and inefficient to use traditional wet lab methods to clarify their biological activities. Using artificial intelligence instead is a more energy-efficient and faster approach. MuCoCP aims to build a complete pre-trained model for extracting potential features of cyclic peptides, which can be fine-tuned to accurately predict cyclic peptide bioactivity on various downstream tasks. To maximize its effectiveness, we use a novel data augmentation method based on a priori chemical knowledge and multiple unsupervised training objective functions to greatly improve the information-grabbing ability of the model.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To assay the efficacy of the model, we conducted validation on the membrane-permeability of cyclic peptides which achieved an accuracy of 0.87 and R-squared of 0.503 on CycPeptMPDB using semi-supervised training and obtained an accuracy of 0.84 and R-squared of 0.384 using a model with frozen parameters on an external dataset. This result has achieved state-of-the-art, which substantiates the stability and generalization capability of MuCoCP. It means that MuCoCP can fully explore the high-dimensional information of cyclic peptides and make accurate predictions on downstream bioactivity tasks, which will serve as a guide for the future de novo design of cyclic peptide drugs and promote the development of cyclic peptide drugs.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>All code used in our proposed method can be found at https:\/\/linproxy.fan.workers.dev:443\/https\/github.com\/lennonyu11234\/MuCoCP.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae473","type":"journal-article","created":{"date-parts":[[2024,7,25]],"date-time":"2024-07-25T16:12:55Z","timestamp":1721923975000},"source":"Crossref","is-referenced-by-count":7,"title":["MuCoCP: a priori chemical knowledge-based multimodal contrastive learning pre-trained neural network for the prediction of cyclic peptide membrane penetration ability"],"prefix":"10.1093","volume":"40","author":[{"given":"Yunxiang","family":"Yu","sequence":"first","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"}]},{"given":"Mengyun","family":"Gu","sequence":"additional","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"}]},{"given":"Hai","family":"Guo","sequence":"additional","affiliation":[{"name":"The Second Hospital Clinical Medical School, Lanzhou University , Lanzhou, 730000, China"}]},{"given":"Yabo","family":"Deng","sequence":"additional","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"}]},{"given":"Danna","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"},{"name":"The Affiliated Hospital of Guangdong Medical University , Zhanjiang, 524000, China"},{"name":"Guangzhou First People\u2019s Hospital, South China University of Technology , Guangzhou, 510180, China"}]},{"given":"Jianwei","family":"Wang","sequence":"additional","affiliation":[{"name":"Guangzhou First People\u2019s Hospital, South China University of Technology , Guangzhou, 510180, China"}]},{"given":"Caixia","family":"Wang","sequence":"additional","affiliation":[{"name":"Guangzhou First People\u2019s Hospital, South China University of Technology , Guangzhou, 510180, China"}]},{"given":"Xia","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"}]},{"ORCID":"https:\/\/linproxy.fan.workers.dev:443\/https\/orcid.org\/0000-0003-2566-7322","authenticated-orcid":false,"given":"Wenjin","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Basic Medical Sciences, Lanzhou University , Lanzhou, 730000, China"}]},{"given":"Jinqi","family":"Huang","sequence":"additional","affiliation":[{"name":"The Affiliated Hospital of Guangdong Medical University , Zhanjiang, 524000, China"},{"name":"Guangzhou First People\u2019s Hospital, South China University of Technology , Guangzhou, 510180, China"}]}],"member":"286","published-online":{"date-parts":[[2024,7,27]]},"reference":[{"key":"2024081002505895900_btae473-B1","doi-asserted-by":"crossref","first-page":"2749","DOI":"10.1109\/TCBB.2021.3102133","article-title":"DeepCPPred: a deep learning framework for the discrimination of cell-penetrating peptides and their uptake efficiencies","volume":"19","author":"Arif","year":"2022","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2024081002505895900_btae473-B2","doi-asserted-by":"crossref","first-page":"3520","DOI":"10.1016\/j.cell.2022.07.019","article-title":"Accurate de novo design of membrane-traversing macrocycles","volume":"185","author":"Bhardwaj","year":"2022","journal-title":"Cell"},{"key":"2024081002505895900_btae473-B3","doi-asserted-by":"crossref","first-page":"1487","DOI":"10.1002\/chem.201905385","article-title":"Cyclic peptides as drugs for intracellular targets: the next frontier in peptide therapeutic development","volume":"27","author":"Buckton","year":"2021","journal-title":"Chemistry"},{"key":"2024081002505895900_btae473-B4","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1021\/acs.jmedchem.3c01611","article-title":"Multi_cycgt: a deep learning-based multimodal model for predicting the membrane permeability of cyclic peptides","volume":"67","author":"Cao","year":"2024","journal-title":"J Med Chem"},{"key":"2024081002505895900_btae473-B5","doi-asserted-by":"crossref","first-page":"1599","DOI":"10.1007\/s40265-019-01187-w","article-title":"Bremelanotide: first approval","volume":"79","author":"Dhillon","year":"2019","journal-title":"Drugs"},{"key":"2024081002505895900_btae473-B6","doi-asserted-by":"crossref","first-page":"10241","DOI":"10.1021\/acs.chemrev.9b00008","article-title":"Understanding cell penetration of cyclic peptides","volume":"119","author":"Dougherty","year":"2019","journal-title":"Chem Rev"},{"key":"2024081002505895900_btae473-B7","doi-asserted-by":"crossref","first-page":"10098","DOI":"10.1021\/acs.jmedchem.9b00456","article-title":"Enhancing the cell permeability of stapled peptides with a cyclic cell-penetrating peptide","volume":"62","author":"Dougherty","year":"2019","journal-title":"J Med Chem"},{"key":"2024081002505895900_btae473-B8","doi-asserted-by":"crossref","first-page":"3028","DOI":"10.1093\/bioinformatics\/btaa131","article-title":"StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency","volume":"36","author":"Fu","year":"2020","journal-title":"Bioinformatics"},{"year":"2019","author":"Gasteiger","key":"2024081002505895900_btae473-B9"},{"year":"2017","author":"Hamilton","key":"2024081002505895900_btae473-B10"},{"key":"2024081002505895900_btae473-B11","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/j.aiopen.2021.08.002","article-title":"Pre-trained models: past, present and future","volume":"2","author":"Han","year":"2021","journal-title":"AI Open"},{"year":"2020","author":"He","key":"2024081002505895900_btae473-B12"},{"year":"2016","author":"Kipf","key":"2024081002505895900_btae473-B13"},{"year":"2020","author":"Landrum","key":"2024081002505895900_btae473-B14"},{"year":"2021","author":"Li","key":"2024081002505895900_btae473-B15"},{"key":"2024081002505895900_btae473-B16","doi-asserted-by":"crossref","first-page":"311","DOI":"10.3390\/md19060311","article-title":"Improvement on permeability of cyclic peptide\/peptidomimetic: backbone N-methylation as a useful tool","volume":"19","author":"Li","year":"2021","journal-title":"Mar Drugs"},{"key":"2024081002505895900_btae473-B17","doi-asserted-by":"crossref","first-page":"7568","DOI":"10.1038\/s41467-023-43214-1","article-title":"A knowledge-guided pre-training framework for improving molecular representation learning","volume":"14","author":"Li","year":"2023","journal-title":"Nat Commun"},{"key":"2024081002505895900_btae473-B18","doi-asserted-by":"crossref","first-page":"2240","DOI":"10.1021\/acs.jcim.2c01573","article-title":"CycPeptMPDB: a comprehensive database of membrane permeability of cyclic peptides","volume":"63","author":"Li","year":"2023","journal-title":"J Chem Inf Model"},{"key":"2024081002505895900_btae473-B19","doi-asserted-by":"crossref","first-page":"921","DOI":"10.1038\/s41587-022-01226-0","article-title":"Identification of antimicrobial peptides from the human gut microbiome using deep learning","volume":"40","author":"Ma","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2024081002505895900_btae473-B20","doi-asserted-by":"crossref","first-page":"114278","DOI":"10.1016\/j.ejmech.2022.114278","article-title":"Amphiphilic cyclic peptide [W4KR5]-antibiotics combinations as broad-spectrum antimicrobial agents","volume":"235","author":"Mohammed","year":"2022","journal-title":"Eur J Med Chem"},{"key":"2024081002505895900_btae473-B21","doi-asserted-by":"crossref","first-page":"19846","DOI":"10.1021\/acsomega.1c02569","article-title":"CpACpP: in silico cell-penetrating anticancer peptide prediction using a novel bioinformatics framework","volume":"6","author":"Nasiri","year":"2021","journal-title":"ACS Omega"},{"key":"2024081002505895900_btae473-B22","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1016\/j.cbpa.2017.04.012","article-title":"Cyclic peptide natural products chart the frontier of oral bioavailability in the pursuit of undruggable targets","volume":"38","author":"Naylor","year":"2017","journal-title":"Curr Opin Chem Biol"},{"key":"2024081002505895900_btae473-B23","doi-asserted-by":"crossref","first-page":"3214","DOI":"10.1021\/acs.jproteome.8b00322","article-title":"KELM-CPPpred: kernel extreme learning machine based prediction model for cell-penetrating peptides","volume":"17","author":"Pandey","year":"2018","journal-title":"J Proteome Res"},{"key":"2024081002505895900_btae473-B24","doi-asserted-by":"crossref","first-page":"1872","DOI":"10.1007\/s11431-020-1647-3","article-title":"Pre-trained models for natural language processing: a survey","volume":"63","author":"Qiu","year":"2020","journal-title":"Sci China Technol Sci"},{"key":"2024081002505895900_btae473-B25","doi-asserted-by":"crossref","first-page":"4428","DOI":"10.3390\/molecules27144428","article-title":"Cyclic peptides for the treatment of cancers: a review","volume":"27","author":"Ramadhani","year":"2022","journal-title":"Molecules"},{"key":"2024081002505895900_btae473-B26","doi-asserted-by":"crossref","first-page":"397","DOI":"10.3390\/md20060397","article-title":"Marine cyclic peptides: antimicrobial activity and synthetic strategies","volume":"20","author":"Ribeiro","year":"2022","journal-title":"Mar Drugs"},{"year":"2023","author":"Vaswani","key":"2024081002505895900_btae473-B27"},{"key":"2024081002505895900_btae473-B28","doi-asserted-by":"crossref","first-page":"739","DOI":"10.1109\/TCBB.2019.2930993","article-title":"G-DipC: an improved feature representation method for short sequences to predict the type of cargo in Cell-Penetrating peptides","volume":"17","author":"Wang","year":"2020","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2024081002505895900_btae473-B29","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1007\/s11633-022-1410-8","article-title":"Large-scale multi-modal pre-trained models: a comprehensive survey","volume":"20","author":"Wang","year":"2023","journal-title":"Mach Intell Res"},{"year":"2021","author":"Wang","key":"2024081002505895900_btae473-B30"},{"key":"2024081002505895900_btae473-B31","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1016\/j.drudis.2016.02.005","article-title":"Quantifying the chameleonic properties of macrocycles and other high-molecular-weight drugs","volume":"21","author":"Whitty","year":"2016","journal-title":"Drug Discov Today"},{"key":"2024081002505895900_btae473-B32","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1021\/acscentsci.8b00098","article-title":"Machine learning to predict cell-penetrating peptides for antisense delivery","volume":"4","author":"Wolfe","year":"2018","journal-title":"ACS Cent Sci"},{"year":"2019","author":"Xu","key":"2024081002505895900_btae473-B33"},{"year":"2021","author":"You","key":"2024081002505895900_btae473-B34"},{"key":"2024081002505895900_btae473-B35","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1038\/s42004-023-00825-5","article-title":"Hierarchical molecular graph self-supervised learning for property prediction","volume":"6","author":"Zang","year":"2023","journal-title":"Commun Chem"},{"key":"2024081002505895900_btae473-B36","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1039\/D1CB00154J","article-title":"Cyclic peptide drugs approved in the last two decades (2001\u20132021)","volume":"3","author":"Zhang","year":"2022","journal-title":"RSC Chem Biol"},{"key":"2024081002505895900_btae473-B37","doi-asserted-by":"crossref","first-page":"bbac545","DOI":"10.1093\/bib\/bbac545","article-title":"SiameseCPP: a sequence-based siamese network to predict cell-penetrating peptides by contrastive learning","volume":"24","author":"Zhang","year":"2023","journal-title":"Brief Bioinform"},{"year":"2023","author":"Zhang","key":"2024081002505895900_btae473-B38"},{"year":"2020","author":"Zoph","key":"2024081002505895900_btae473-B39"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae473\/58667438\/btae473.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article-pdf\/40\/8\/btae473\/58790528\/btae473.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article-pdf\/40\/8\/btae473\/58790528\/btae473.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,10]],"date-time":"2024-08-10T02:51:28Z","timestamp":1723258288000},"score":1,"resource":{"primary":{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae473\/7721931"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,7,27]]},"references-count":39,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,8,2]]}},"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/doi.org\/10.1093\/bioinformatics\/btae473","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2024,8]]},"published":{"date-parts":[[2024,7,27]]},"article-number":"btae473"}}