

FOLLOWUS
Lab of Low-Dimensional Materials Chemistry, Key Laboratory for Ultrafine Materials of Ministry of Education, Frontier Science Center of the Materials Biology and Dynamic Chemistry, School of Materials Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
dapengzhang@ecust.edu.cn (D.P.Z.)
ysli@ecust.edu.cn (Y.S.L.)
Received:03 December 2025,
Accepted:06 January 2026,
Online First:05 March 2026,
Published:05 April 2026
Scan QR Code
Cheng, W. T.; Zheng, H. L.; Zhu, P. Y.; Hao, J. N.; Zhang, D. P.; Li, Y. S. Machine learning-guided prediction of ionizable amphiphilic Janus dendrimers for mRNA nanomedicine. Chinese J. Polym. Sci. 2026, 44, 1154–1164
Wan-Ting Cheng, Heng-Li Zheng, Peng-Yu Zhu, et al. Machine Learning-guided Prediction of Ionizable Amphiphilic Janus Dendrimers for mRNA Nanomedicine[J]. Chinese Journal of Polymer Science, 2026, 44(4): 1154-1164.
Cheng, W. T.; Zheng, H. L.; Zhu, P. Y.; Hao, J. N.; Zhang, D. P.; Li, Y. S. Machine learning-guided prediction of ionizable amphiphilic Janus dendrimers for mRNA nanomedicine. Chinese J. Polym. Sci. 2026, 44, 1154–1164 DOI: 10.1007/s10118-026-3563-3.
Wan-Ting Cheng, Heng-Li Zheng, Peng-Yu Zhu, et al. Machine Learning-guided Prediction of Ionizable Amphiphilic Janus Dendrimers for mRNA Nanomedicine[J]. Chinese Journal of Polymer Science, 2026, 44(4): 1154-1164. DOI: 10.1007/s10118-026-3563-3.
A tailored machine-learning (ML) framework was developed to investigate the relationship of molecular structures of ionizable amphiphilic Janus dendrimers (IAJDs) and their mRNA delivery effects. Count-based fingerprints and ChemBERTa embeddings improve accuracy and interpretability
revealing key motifs and enabling ML-guided design and optimization of IAJDs for mRNA delivery.
The efficient and safe delivery of messenger RNA (mRNA) therapeutics remains a critical challenge for clinical translation
driving the need for advanced carrier design. Ionizable amphiphilic Janus dendrimers (IAJDs) represent a promising class of carriers; however
their structural complexity and limited available datasets hinder systematic exploration and optimization. In this study
we established a tailored machine-learning framework to investigate the structure-function relationships of IAJDs under a constrained data regime (
n
=231). Convention
al molecular fingerprints were found to be suboptimal for representing these macromolecules
motivating the adoption of count-based descriptors and systematic ablation analyses to disentangle the contributions of the substructural features. These experiments identified key functional motifs underlying transfection performance and provided interpretable insights into the IAJD design principles. Complementing these handcrafted descriptors
we further applied deep learning-based molecular embeddings
which captured higher-order chemical semantics and significantly improved predictive accuracy. Collectively
these advances demonstrate that both refined fingerprinting and representation learning approaches can overcome data limitations
enabling the reliable prediction of IAJD activity while offering mechanistic interpretability. This study illustrates the potential of data-driven strategies as hypothesis-generation and prioritization tools for the design of next-generation mRNA delivery systems.
[Sahin, U.; Karikó, K.; Türeci, Ö. mRNA-based therapeutics — developing a new class of drugs. Nat. Rev. Drug. Discov . 2014 , 13 , 759−780..
[Pardi, N.; Hogan, M. J.; Porter, F. W.; Weissman, D. mRNA vaccines—a new era in vaccinology. Nat. Rev. Drug. Discov . 2018 , 17 , 261−279..
Polack, F. P.; Thomas, S. J.; Kitchin, N.; Absalon, J.; Gurtman, A.; Lockhart, S.; Perez, J. L.; Pérez; Marc, G.; Moreira, E. D.; Zerbini, C.; Bailey, R.; Swanson, K. A.; Roychoudhury, S.; Koury, K.;Li, P.; Kalina, W. V.; Cooper, D.; Frenck, R. W., Jr.; Hammitt, L. L.; Türeci, Ö; Nell, H.; Schaefer, A.; Ünal, S.; Tresnan, D. B.; Mather, S.; Dormitzer, P. R.; Şahin, U.; Jansen, K. U.; Gruber, W. C. Safety and efficacy of the BNT162b2 mRNA COVID-19 vaccine. N. Engl. J. Med. 2020 , 383 , 2603−2615..
Baden, L. R.; El Sahly, H. M.; Essink, B.; Kotloff, K.; Frey, S.; Novak, R.; Diemert, D.; Spector, S. A.; Rouphael, N.; Creech, C. B.; McGettigan, J.; Khetan, S.; Segall, N.; Solis, J.; Brosz, A.; Fierro, C.; Schwartz, H.; Neuzil, K.; Corey, L.; Gilbert, P.; Janes, H.; Follmann, D.; Marovich, M.; Mascola, J.; Polakowski, L.; Ledgerwood, J.; Graham, B. S.; Bennett, H.; Pajon, R.; Knightly, C.; Leav, B.; Deng, W.; Zhou, H.; Han, S.; Ivarsson, M.; Miller, J.; Zaks, T. Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine. N. Engl. J. Med. 2021 , 384 , 403−416..
Hou, Xucheng; Zaks, Tal; Langer, Robert; Dong, Yizhou. Lipid nanoparticles for mRNA delivery. Nat. Rev. Mater. 2021 , 6 , 1078−1094..
Zhang, D.; Atochina-Vasserman, E. N.; Maurya, D. S.; Huang, N.; Xiao, Q.; Ona, N.; Liu, M.; Shahnawaz, H.; Ni, H. P.; Kim, K.; Billingsley, M. M.; Pochan, D. J.; Mitchell, M. J.; Weissman, D.; Percec, V. One-component multifunctional sequence-defined ionizable amphiphilic Janus dendrimer delivery systems for mRNA. J. Am. Chem. Soc. 2021 , 143 , 12315−12327..
Zhu, P.; Li, Y.; Zhang, D. One-component ionizable amphiphilic Janus dendrimers for targeted mRNA delivery. Angew. Chem. Int. Ed. 2025 , 64 , e202505304..
Semple, S. C.; Akinc, A.;Chen, J.; Sandhu, A. P.; Mui, B. L.; Cho, C. K.; Sah, D. W. Y.; Stebbing, D.; Crosley, E. J.; Yaworski, Ed; Hafez, I. M.; Dorkin, J. R.; Qin, J.; Lam, K.; Rajeev, K. G.; Wong, K. F.; Jeffs, L. B.; Nechev, L.; Eisenhardt, M. L.; Jayaraman, M.; Kazem, M.; Maier, M. A.; Srinivasulu, M.; Weinstein, M. J.; Chen, Q.; Alvarez, R.; Barros, S. A.; De, S.; Kli muk, S. K.; Borland, T.; Kosovrasti, V.; Cantley, W. L.; Tam, Y. K.; Manoharan, M.; Ciufolini, M. A.; Tracy, M. A.; de Fougerolles, A.; MacLachlan, I.; Cullis, P. R.; Madden, T. D.; Hope, M. J. Rational design of cationic lipids for siRNA delivery. Nat. Biotechnol. 2010 , 28 , 172−176..
Kulkarni, J. A.; Witzigmann, D.; Chen, S.; Cullis, P. R.; van der Meel, R. Lipid nanoparticle technology for clinical translation of siRNA therapeutics. Acc. Chem. Res. 2019 , 52 , 2435−2444..
Arshad, M.; Atochina-Vasserman, E. N.; Maurya, D. S.; Lu, J.; Ona, N.; Vasserman, J. A.; McWilliams, B. C.; Ni, H.; Berkihiser, S.; Park, W; Weissman, D.; Percec, V. Harnessing the electron-withdrawing inductive effect of one-component ionizable amphiphilic Janus dendrimers unveils cation–π interactions and their important roles to targeted mRNA delivery. J. Am. Chem. Soc. 2025 , 147 , 21347−21356..
Feng, W. X.; Zhang, S. Q.; Xu, Y. Y.; Ye, X. F.; Xu, X. Y.; Wang L. Q.; Lin, J. P.; Cai C. H.; Du, L. Machine-learning-assisted materials genome approach for designing high-performance thermosetting polyimides. Chinese J. Polym. Sci. 2025 , 43 , 1718−1729..
Butler, K. T.; Davies, D. W.; Cartwright, H.; Isayev, O.; Walsh, A. Machine learning for molecular and materials science. Nature 2018 , 559 , 547−555..
Cheng, Q.; Wei, T.; Farbiak, L.; Johnson, L. T.; Dilliard, S. A.; Siegwart, D. J. Selective organ targeting (SORT) nanoparticles for tissue-specific mRNA delivery and CRISPR–Cas gene editing. Nat. Nanotechnol. 2020 , 15 , 313−320..
Witten, J; Raji, I; Manan, R. S.; Beyer, E.; Bartlett, S.; Tang, Y.; Ebadi, M.; Lei, J.; Nguyen, D.; Oladimeji, F.; Jiang, A. Y.; MacDonald, E.; Hu, Y.; Mughal, H.; Self, A.; Collins, E.; Yan, Z.; Engelhardt, J. F.; Langer, R.; Anderson, D. G. Artificial intelligence-guided design of lipid nanoparticles for pulmonary gene therapy. Nat. Biotechnol. 2025 , 43 , 1790−1799..
Li, B.; Manan, R. S.; Liang, S.; Gordon, A.; Jiang, A.; Varley, A.; Gao, G.; Langer, R.; Xue, W.; Anderson, D. Combinatorial design of nanoparticles for pulmonary mRNA delivery and genome editing. Nat. Biotechnol. 2023 , 41 , 1410−1415..
[Kim J. Optimization of IAJD design for DNP mRNA delivery. [accessed 2025, August 1]https://isef.net/project/bchm013-optimization-of-iajd-design-for-dnp-mrna-delivery.
Zhang, D.; Atochina-Vasserman, E. N.; Maurya, D. S.; Liu, M.; Xiao, Q.; Lu, J.; Lauri, G.; Ona, N.; Reagan, E.; Ni, H.; Weissman, D.; Percec, V. Targeted delivery of mRNA with one-component ionizable amphiphilic Janus dendrimers. J. Am. Chem. Soc. 2021 , 143 , 17975−17982..
Zhang, D.; Atochina-Vasserman, E. N.; Lu, J.; Maurya, D.; Xiao, Q.; Liu, M.; Adamson, J.; Ona, N.; Reagan, E. K.; Ni, H. P.; Weissman, D.; Percec, V. The unexpected importance of the primary structure of the hydrophobic part of one-component ionizable amphiphilic Janus dendrimers in targeted mRNA delivery activity. J. Am. Chem. Soc. 2022 , 144 , 4746−4753..
Lu, J.; Atochina-Vasserman, E. N.; Maurya, D. S.; Sahoo, D.; Ona, N.; Reagan, E. K.; Ni, H. P.; Weissman, D.; Percec, V. Targeted and equally distributed delivery of mRNA to organs with pentaerythritol-based one-component ionizable amphiphilic Janus dendrimers. J. Am. Chem. Soc. 2023 , 145 , 18760−18766..
Sahoo, D.; Atochina-Vasserman, E. N.; Maurya, D. S.; Arshad, M.; Chenna, S. S.; Ona, N.; Vasserman, J. A.; Ni, H. P.; Weissman, D.; Percec, V. The constitutional isomerism of one-component ionizable amphiphilic Janus dendrimers orchestrates the total and targeted activities of mRNA delivery. J. Am. Chem. Soc. 2024 , 146 , 3627−3634..
Weininger D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 1988 , 28 , 31−36..
Rogers, D.; Hahn, M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010 , 50 , 742−754..
Cover, T. M.; Hart, P. E. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967 , 13 , 21−27..
Morgan, H. L. The generation of a unique machine description for chemical structures—A technique developed at chemical abstracts service. J. Chem. Doc. 1965 , 5 , 107−113..
[Bajusz, D.; Rácz, A.; Héberger, K. Why is tanimoto index an appropriate choicefor fingerprint-based similarity calculations J. Cheminform . 2015 , 7 , 20..
[Chithra nanda, S.; Grand, G.; Ramsundar, B. ChemBERTa: large-scale self-supervised pretraining for molecular property prediction. arXiv 2020 , arXiv:2010.09885..
Breiman, L. Random forests. Mach. Learn. 2001 , 45 , 5−32..
[Hosmer, D. W.; Lemeshow, S.; Sturdivant, R. X. Applied logistic regression. In Applied Logistic Regression, 3rd ed .; Wiley: Hoboken, NJ, 2013 ; pp 89−151..
[Chen, T.; Guestrin, C. XGBoost: a scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’16); ACM: New York , 2016 , pp. 785−794..
Krenn, M.; Häse, F.; Nigam, A.; Friederich, P.; Aspuru-Guzik, A. Self-referencing embedded strings (SELFIES): a 100% robust molecular string representation. Mach. Learn.: Sci. Technol. 2020 , 1 , 045024..
Degen, J.; Wegscheid-Gerlach, C.; Zaliani, A.; Rarey, M. On the art of compiling and using “drug-like” chemical fragment spaces. ChemMedChem 2008 , 3 , 1503−1507..
Ertl, P.; Schuffenhauer, A. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J. Cheminform. 2009 , 1 , 8..
Bickerton, G. R.; Paolini, G. V.; Besnard, J.; Muresan, S.; Hopkins, A. L. Quantifying the chemical beauty of drugs. Nat. Chem. 2012 , 4 , 90−98..
Lipinski, C. A.; Lombardo, F.; Dominy, B. W.; Feeney, P. J. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Deliv. Rev. 2001 , 46 , 3−26..
0
Views
77
Downloads
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802046900号