DNA Origami Nanostructures Observed in Transmission Electron Microscopy Images can be Characterized through Convolutional Neural Networks

Xingfei Wei; Qiankun Mo; Chi Chen; Mark Bathe; Rigoberto Hernandez

DNA Origami Nanostructures Observed in Transmission Electron Microscopy Images can be Characterized through Convolutional Neural Networks

Xingfei Wei, Qiankun Mo, Chi Chen, Mark Bathe, Rigoberto Hernandez

TL;DR

This paper addresses the challenge of determining the ligation number of DNA origami nanostructures bound to quantum dots in TEM images. It introduces a transfer learning workflow that pre-trains CNNs on a large set of CG MD-generated images and then fine-tunes them on a small TEM dataset to classify ligation numbers. Among nine CNN architectures, pre-trained ResNet50 and VGG16 perform best on MD test images, while fine-tuned VGG16 yields the highest agreement on TEM images, demonstrating a practical path to rapid, bias-reduced TEM analysis. The approach enables scalable, automated characterization of complex nanostructures in large TEM images and provides publicly available code and data for reuse.

Abstract

Artificial intelligence (AI) models remain an emerging strategy to accelerate materials design and development. We demonstrate that convolutional neural network (CNN) models can characterize DNA origami nanostructures employed in programmable self-assembling, which is important in many applications such as in biomedicine. Specifically, we benchmark the performance of 9 CNN models -- viz. AlexNet, GoogLeNet, VGG16, VGG19, ResNet18, ResNet34, ResNet50, ResNet101, and ResNet152 -- to characterize the ligation number of DNA origami nanostructures in transmission electron microscopy (TEM) images. We first pre-train CNN models using a large image dataset of 720 images from our coarse-grained (CG) molecular dynamics (MD) simulations. Then, we fine-tune the pre-trained CNN models, using a small experimental TEM dataset with 146 TEM images. All CNN models were found to have similar computational time requirements, while their model sizes and performances are different. We use 20 test MD images to demonstrate that among all of the pre-trained CNN models ResNet50 and VGG16 have the highest and second highest accuracies. Among the fine-tuned models, VGG16 was found to have the highest agreement on the test TEM images. Thus, we conclude that fine-tuned VGG16 models can quickly characterize the ligation number of nanostructures in large TEM images.

DNA Origami Nanostructures Observed in Transmission Electron Microscopy Images can be Characterized through Convolutional Neural Networks

TL;DR

Abstract

DNA Origami Nanostructures Observed in Transmission Electron Microscopy Images can be Characterized through Convolutional Neural Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)