Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

Krishna Murthy Jatavallabhula; Edward Smith; Jean-Francois Lafleche; Clement Fuji Tsang; Artem Rozantsev; Wenzheng Chen; Tommy Xiang; Rev Lebaredian; Sanja Fidler

Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

Krishna Murthy Jatavallabhula, Edward Smith, Jean-Francois Lafleche, Clement Fuji Tsang, Artem Rozantsev, Wenzheng Chen, Tommy Xiang, Rev Lebaredian, Sanja Fidler

TL;DR

<3-5 sentence high-level summary> Kaolin addresses the fragmentation in 3D deep learning by providing an integrated PyTorch-based library that covers data loading, multi-representation support, differentiable rendering, metrics, visualization, and a pretrained-model ecosystem. Its modular architecture includes a flexible DifferentiableRenderer, differentiable geometry operations, and broad dataset compatibility, enabling rapid prototyping across meshes, point clouds, voxels, SDFs, and RGB-D data. The paper highlights performance-oriented implementations and a rich model zoo to standardize benchmarks and accelerate research in 3D tasks such as reconstruction, segmentation, and reasoning under 2D supervision. The work aims to lower entry barriers, promote reproducibility, and foster community contributions through open-source development and extensible tooling.

Abstract

We present Kaolin, a PyTorch library aiming to accelerate 3D deep learning research. Kaolin provides efficient implementations of differentiable 3D modules for use in deep learning systems. With functionality to load and preprocess several popular 3D datasets, and native functions to manipulate meshes, pointclouds, signed distance functions, and voxel grids, Kaolin mitigates the need to write wasteful boilerplate code. Kaolin packages together several differentiable graphics modules including rendering, lighting, shading, and view warping. Kaolin also supports an array of loss functions and evaluation metrics for seamless evaluation and provides visualization functionality to render the 3D results. Importantly, we curate a comprehensive model zoo comprising many state-of-the-art 3D deep learning architectures, to serve as a starting point for future research endeavours. Kaolin is available as open-source software at https://github.com/NVIDIAGameWorks/kaolin/.

Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

TL;DR

Abstract

Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)