Lightweight Predictive 3D Gaussian Splats

LPGS models the scenes with the parent(yellow) and child(green) splats where important child splats can be promoted to parent via Adaptive Tree Manipulation (ATM).

Abstract

Recent approaches representing 3D objects and scenes using Gaussian splats show increased rendering speed across a variety of platforms and devices. While rendering such representations is indeed extremely efficient, storing and transmitting them is often prohibitively expensive. To represent large-scale scenes, one often needs to store millions of 3D Gaussians, occupying gigabytes of disk space. This poses a very practical limitation, prohibiting widespread adoption. Several solutions have been proposed to strike a balance between disk size and rendering quality, noticeably reducing the visual quality. In this work, we propose a new representation that dramatically reduces the hard drive footprint while featuring similar or improved quality when compared to the standard 3D Gaussian splats. When compared to other compact solutions, ours offers higher quality renderings with significantly reduced storage, being able to efficiently run on a mobile device in real-time. Our key observation is that nearby points in the scene can share similar representations. Hence, only a small ratio of 3D points needs to be stored. We introduce an approach to identify such points—called parent points. The discarded points—children points—along with attributes can be efficiently predicted by tiny MLPs.

TL;DR: We propose a novel Gaussian Splat representation requiring much less storage, featuring superior rendering quality, and being able to run on mobile devices in real-time.

360 Orbit Camera Videos

Real-time Demo on iPhone 14

If you browser is not displaying the videos, try to scroll down and place the videos on the top of your screen.

Visual Comparisons: Ourdoor Scenes from Mip-NeRF 360 Dataset

Ours: PSNR 27.63 / 39.4 MB
3DGS: PSNR 27.41 / 1331 MB
Ours: PSNR 27.63 / 39.4 MB
CompactGS: PSNR 26.81 / 62.78 MB
Ours: PSNR 27.63 / 39.4 MB
LightGS: PSNR 26.73 / -

Visual Comparisons: Indoor Scenes from Mip-NeRF 360 Dataset

Ours: PSNR 31.84 / 28.95 MB
3DGS: PSNR 30.63 / 350 MB
Ours: PSNR 31.84 / 28.95 MB
CompactGS: PSNR 30.88 / 34.21 MB
Ours: PSNR 31.84 / 28.95 MB
LightGS: PSNR 31.27 / -
Ours: PSNR 29.10 / 30.02 MB
3DGS: PSNR 28.70 / 276.52 MB
Ours: PSNR 29.10 / 30.02 MB
CompactGS: PSNR 28.71 / 34.34 MB
Ours: PSNR 29.10 / 30.02 MB
LightGS: PSNR 28.11 / -

Visual Comparisons: Deep Blending Dataset

Ours: PSNR 29.34 / 35.00 MB
3DGS: PSNR 28.77 / 774 MB
Ours: PSNR 29.34 / 35.00 MB
CompactGS: PSNR 29.26 / 48 MB
Ours: PSNR 29.34 / 35.00 MB
LightGS
Ours: PSNR 30.44 / 35.80 MB
3DGS: PSNR 30.04 / 553 MB
Ours: PSNR 30.44 / 35.80 MB
CompactGS: PSNR 30.32 / 39 MB
Ours: PSNR 30.44 / 35.80 MB
LightGS

Visual Comparisons: Tank&Temples Dataset

Ours: PSNR 25.45 / 35.78 MB
3DGS: PSNR 25.19 / 608.7 MB
Ours: PSNR 25.45 / 35.78 MB
CompactGS: PSNR 25.07 / 41.57 MB
Ours: PSNR 25.45 / 35.78 MB
LightGS: PSNR 24.56 / -
Ours: PSNR 21.97 / 37.02 MB
3DGS: PSNR 21.10 / PSNR 255.82 MB
Ours: PSNR 21.97 / 37.02 MB
CompactGS: PSNR 21.56 / 37.29 MB
Ours: PSNR 21.97 / 37.02 MB
LightGS: PSNR 21.09 / -

We thank the authors of Nerfies that kindly open sourced the template of this website.