Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

Tam Thuc Do; Philip A. Chou; Gene Cheung

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

Tam Thuc Do, Philip A. Chou, Gene Cheung

TL;DR

To improve coding performance over 2023volumetric, a nested sequences of function subspaces is considered and a new nonlinear predictor using a polynomial of bilateral filter is proposed using a polynomial of bilateral filter.

Abstract

We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters $θ$ of a continuous attribute function $f: \mathbb{R}^3 \mapsto \mathbb{R}$ are quantized to $\hatθ$ and encoded, so that discrete samples $f_{\hatθ}(\mathbf{x}_i)$ can be recovered at known 3D points $\mathbf{x}_i \in \mathbb{R}^3$ at the decoder. Specifically, we consider a nested sequences of function subspaces $\mathcal{F}^{(p)}_{l_0} \subseteq \cdots \subseteq \mathcal{F}^{(p)}_L$, where $\mathcal{F}_l^{(p)}$ is a family of functions spanned by B-spline basis functions of order $p$, $f_l^*$ is the projection of $f$ on $\mathcal{F}_l^{(p)}$ represented as low-pass coefficients $F_l^*$, and $g_l^*$ is the residual function in an orthogonal subspace $\mathcal{G}_l^{(p)}$ (where $\mathcal{G}_l^{(p)} \oplus \mathcal{F}_l^{(p)} = \mathcal{F}_{l+1}^{(p)}$) represented as high-pass coefficients $G_l^*$. In this paper, to improve coding performance over \cite{do2023volumetric}, we study predicting $f_{l+1}^*$ at level $l+1$ given $f_l^*$ at level $l$ and encoding of $G_l^*$ for the $p=1$ case (RAHT($1$)). For the prediction, we formalize RAHT(1) linear prediction in MPEG-PCC in a theoretical framework, and propose a new nonlinear predictor using a polynomial of bilateral filter. We derive equations to efficiently compute the critically sampled high-pass coefficients $G_l^*$ amenable to encoding. We optimize parameters in our resulting feed-forward network on a large training set of point clouds by minimizing a rate-distortion Lagrangian. Experimental results show that our improved framework outperforms the MPEG G-PCC predictor by $11\%$--$12\%$ in bit rate.

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

TL;DR

Abstract

We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters

of a continuous attribute function

are quantized to

and encoded, so that discrete samples

can be recovered at known 3D points

at the decoder. Specifically, we consider a nested sequences of function subspaces

, where

is a family of functions spanned by B-spline basis functions of order

is the projection of

represented as low-pass coefficients

, and

is the residual function in an orthogonal subspace

(where

) represented as high-pass coefficients

. In this paper, to improve coding performance over \cite{do2023volumetric}, we study predicting

at level

given

at level

and encoding of

for the

case (RAHT(

)). For the prediction, we formalize RAHT(1) linear prediction in MPEG-PCC in a theoretical framework, and propose a new nonlinear predictor using a polynomial of bilateral filter. We derive equations to efficiently compute the critically sampled high-pass coefficients

amenable to encoding. We optimize parameters in our resulting feed-forward network on a large training set of point clouds by minimizing a rate-distortion Lagrangian. Experimental results show that our improved framework outperforms the MPEG G-PCC predictor by

in bit rate.

Paper Structure (13 sections, 34 equations, 2 figures)

This paper contains 13 sections, 34 equations, 2 figures.

Introduction
Coding Framework
Low-pass Coefficients
High-pass Coefficients
Constrained Prediction & Residual Coding
Constrained Up-sampling Prediction
Locally Linear Prediction
Polynomial Bilateral Filter Prediction
Experiments
Training
Baselines
Results
Conclusion

Figures (2)

Figure 1: (a) Visualization of constrained Up-sampling Prediction, (b) Multilevel feedforward network implementing point cloud attribute encoder (blue and green) and decoder (green).
Figure 2: Rate-Distortion curves: (a) Longdress, (b) Redandblack, (c) Loot, (d) Soldier

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

TL;DR

Abstract

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

Authors

TL;DR

Abstract

Table of Contents

Figures (2)