Generative Modeling of Entangled Polymers with a Distance-Based Variational Autoencoder
Pietro Chiarantoni, Oscar Serra, Mohammad Erfan Mowlaei, Venkata Surya Kumar Choutipalli, Mark DelloStritto, Xinghua Shi, Micheal L. Klein, Vincenzo Carnevale
TL;DR
Generating dense polymer globule configurations via molecular dynamics is computationally intensive, especially in melts. The authors train a distance-matrix variational autoencoder with a Conv-Transformer encoder and a Gaussian mixture latent prior, then generate new configurations by decoding latent samples and embedding with multidimensional scaling followed by short SDK MD relaxation. Reconstructions reproduce key observables such as the radial distribution function and topological measures, while generated samples can be physically viable and novel, albeit with a broader energy distribution that requires filtering. Overall, the framework offers a scalable approach to embed, sample, and generate dense polymer configurations, with potential extensions to coordinate-based and atomistic simulations for broader applicability.
Abstract
We present a variational autoencoder framework for learning and generating configurations of structured polymer globules from distance matrices. We used coarse-grained molecular dynamics to sample polyethylene structures, which we used as the training set for our deep learning model. By combining convolution and attention layers, the model encodes the structural patterns of distance matrices into an organized and roto-translationally invariant latent space of lower dimensionality. The generative capability of the variational autoencoder, coupled with a post-processing pipeline based on multidimensional scaling and short molecular dynamics, enables the recovery of physically meaningful configurations. The reconstructed and generated samples reproduce key observables, including energy, size, and entanglement, despite minor discrepancies in the raw decoder output.
