Molecular Dynamics and Machine Learning Unlock Possibilities in Beauty Design -- A Perspective
Yuzhi Xu, Haowei Ni, Qinhui Gao, Chia-Hua Chang, Yanran Huo, Fanyu Zhao, Shiyu Hu, Wei Xia, Yike Zhang, Radu Grovu, Min He, John. Z. H. Zhang, Yuanqing Wang
TL;DR
The paper argues that beauty design can be accelerated by integrating molecular dynamics (MD) simulations with machine learning (ML) to move beyond empirical trial-and-error approaches. It frames MD sampling from the Boltzmann distribution $p(\mathbf{x}|\mathcal{G}) \propto \exp(-u(\mathbf{x};\mathcal{G}))$ for conformational ensembles and data-driven structure–property mappings $p(\hat{y}_{\mathcal{G}}|\mathcal{G},\Theta,\mathcal{M})$, incorporating Bayesian uncertainty and active learning. It surveys MD-based free-energy methods (MM-GBSA/MM-PBSA, ASGBIE, alchemical TI/FEP with BAR/MBAR) and ML toolkits (graph neural networks, generative and diffusion models, Bayesian optimization, and LLMs) for predicting skin-related endpoints such as color, hydration, melanogenesis, and permeation. It closes with a concrete roadmap of projects—arbutin modification, HA ADMET optimization, cosmetic peptides, and precision mixtures—aimed at delivering safe, affordable, and effective cosmeceuticals.
Abstract
Computational molecular design -- the endeavor to design molecules, with various missions, aided by machine learning and molecular dynamics approaches, has been widely applied to create valuable new molecular entities, from small molecule therapeutics to protein biologics. In the small data regime, physics-based approaches model the interaction between the molecule being designed and proteins of key physiological functions, providing structural insights into the mechanism. When abundant data has been collected, a quantitative structure-activity relationship (QSAR) can be more directly constructed from experimental data, from which machine learning can distill key insights to guide the design of the next round of experiment design. Machine learning methodologies can also facilitate physical modeling, from improving the accuracy of force fields and extending them to unseen chemical spaces, to more directly enhancing the sampling on the conformational spaces. We argue that these techniques are mature enough to be applied to not just extend the longevity of life, but the beauty it manifests. In this perspective, we review the current frontiers in the research \& development of skin care products, as well as the statistical and physical toolbox applicable to addressing the challenges in this industry. Feasible interdisciplinary research projects are proposed to harness the power of machine learning tools to design innovative, effective, and inexpensive skin care products.
