TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation

📅 2025-06-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing feed-forward 3D Gaussian Splatting (3DGS) reconstruction methods achieve high speed but incur substantial storage overhead for Gaussian parameters, and mainstream compression techniques struggle to adapt due to architectural incompatibility. This paper introduces TinySplat—the first end-to-end feed-forward 3DGS compression framework—that directly synthesizes compact, high-fidelity Gaussian representations from sparse input views, eliminating iterative optimization. Its core comprises a training-agnostic triple redundancy elimination mechanism: View Projection Transformation (VPT) reduces geometric redundancy; Visibility-Aware Basis Reduction (VABR) suppresses perceptual redundancy; and standardized video coding (H.264/H.265) eliminates spatial redundancy. Experiments demonstrate that TinySplat achieves over 100× compression of Gaussian data, attaining state-of-the-art reconstruction quality using only 6% of the storage, 25% of the encoding time, and 1% of the decoding time required by prior approaches.

Technology Category

Application Category

📝 Abstract
The recent development of feedforward 3D Gaussian Splatting (3DGS) presents a new paradigm to reconstruct 3D scenes. Using neural networks trained on large-scale multi-view datasets, it can directly infer 3DGS representations from sparse input views. Although the feedforward approach achieves high reconstruction speed, it still suffers from the substantial storage cost of 3D Gaussians. Existing 3DGS compression methods relying on scene-wise optimization are not applicable due to architectural incompatibilities. To overcome this limitation, we propose TinySplat, a complete feedforward approach for generating compact 3D scene representations. Built upon standard feedforward 3DGS methods, TinySplat integrates a training-free compression framework that systematically eliminates key sources of redundancy. Specifically, we introduce View-Projection Transformation (VPT) to reduce geometric redundancy by projecting geometric parameters into a more compact space. We further present Visibility-Aware Basis Reduction (VABR), which mitigates perceptual redundancy by aligning feature energy along dominant viewing directions via basis transformation. Lastly, spatial redundancy is addressed through an off-the-shelf video codec. Comprehensive experimental results on multiple benchmark datasets demonstrate that TinySplat achieves over 100x compression for 3D Gaussian data generated by feedforward methods. Compared to the state-of-the-art compression approach, we achieve comparable quality with only 6% of the storage size. Meanwhile, our compression framework requires only 25% of the encoding time and 1% of the decoding time.
Problem

Research questions and friction points this paper is trying to address.

Reduces storage cost of 3D Gaussian representations
Eliminates redundancy in 3D scene compression
Improves speed and efficiency of 3DGS compression
Innovation

Methods, ideas, or system contributions that make the work stand out.

View-Projection Transformation reduces geometric redundancy
Visibility-Aware Basis Reduction mitigates perceptual redundancy
Off-the-shelf video codec addresses spatial redundancy
🔎 Similar Papers
No similar papers found.
Z
Zetian Song
State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University
J
Jiaye Fu
School of Electronic and Computer Engineering, Peking University, Shenzhen, and State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University
J
Jiaqi Zhang
State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University
X
Xiaohan Lu
State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University
Chuanmin Jia
Chuanmin Jia
Peking University
Video CodingMultimediaData Compression
S
Siwei Ma
State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University
W
Wen Gao
State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University