Texture-Guided Graph Transform Optimization for Point Cloud Attribute Compression
Abstract
:1. Introduction
- We design a novel point cloud graph transform coding framework, which incorporates region-adaptive graph construction and R-D optimized transform to improve coding performance. By formulating attribute transform as a graph optimization problem, our scheme achieves optimal transform coding performance by effectively leveraging attribute correlation, thus outperforming state-of-the-art methods.
- We propose a texture-guided graph optimization scheme to fully capture underlying attribute correlation in point clouds. Texture analysis is performed in point cloud local regions to guide graph optimization. By optimizing graph construction with a tailored optimization function, the scheme improves the fidelity of graph representations and consequently enhances transform performance.
- We introduce a R-D optimized graph transform scheme with graph sparsity constraints. Our optimization framework considers both the decorrelation capability of the transform and the sparsity of the constructed graph. This enhancement not only bolsters coding performance but also augments the efficiency of graph-based point cloud transform processing.
2. Related Work on Point Cloud Attribute Compression
3. The Proposed Texture-Guided Graph Transform Optimization Scheme for Point Cloud Attribute Compression
3.1. Problem Formulation
3.2. Overview of Our Proposed Framework
3.3. Point Cloud Reorganization and Clustering
3.4. Attribute Inter-Cluster Prediction and Intra-Cluster Analysis
3.5. Point Cloud Graph Transform Optimization
4. Experimental Results
4.1. Simulation Setup
4.2. Compression Performance Evaluation
4.3. Reconstruction Quality Evaluation
4.4. Ablation Studies
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- He, Y.; Liang, B.; Yang, J.; Li, S.; He, J. An iterative closest points algorithm for registration of 3D laser scanner point clouds with geometric features. Sensors 2017, 17, 1862. [Google Scholar] [CrossRef] [PubMed]
- Yue, Y.; Li, X.; Peng, Y. A 3D Point Cloud Classification Method Based on Adaptive Graph Convolution and Global Attention. Sensors 2024, 24, 617. [Google Scholar] [CrossRef] [PubMed]
- Feng, Y.; Zeng, S.; Liang, T. Part2Point: A Part-Oriented Point Cloud Reconstruction Framework. Sensors 2024, 24, 34. [Google Scholar] [CrossRef] [PubMed]
- Zhuang, L.; Tian, J.; Zhang, Y.; Fang, Z. Variable Rate Point Cloud Geometry Compression Method. Sensors 2023, 23, 5474. [Google Scholar] [CrossRef] [PubMed]
- Graziosi, D.; Nakagami, O.; Kuma, S.; Zaghetto, A.; Suzuki, T.; Tabatabai, A. An overview of ongoing point cloud compression standardization activities: Video-based (V-PCC) and geometry-based (G-PCC). APSIPA Trans. Signal Inf. Process. 2020, 9, e13. [Google Scholar] [CrossRef]
- Schwarz, S.; Preda, M.; Baroncini, V.; Budagavi, M.; Cesar, P.; Chou, P.A.; Cohen, R.A.; Krivokuća, M.; Lasserre, S.; Li, Z.; et al. Emerging MPEG standards for point cloud compression. IEEE J. Emerg. Sel. Top. Circuits Syst. 2018, 9, 133–148. [Google Scholar] [CrossRef]
- Cao, C.; Preda, M.; Zaharia, T. 3D point cloud compression: A survey. In Proceedings of the 24th International Conference on 3D Web Technology, Los Angeles, CA, USA, 26–28 July 2019; pp. 1–9. [Google Scholar]
- Wang, J.; Ding, D.; Li, Z.; Feng, X.; Cao, C.; Ma, Z. Sparse Tensor-Based Multiscale Representation for Point Cloud Geometry Compression. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 9055–9071. [Google Scholar] [CrossRef] [PubMed]
- Guo, T.; Yuan, H.; Wang, L.; Wang, T. Rate-distortion optimized quantization for geometry-based point cloud compression. J. Electron. Imaging 2023, 32, 013047. [Google Scholar] [CrossRef]
- Zhang, J.; Chen, T.; Ding, D.; Ma, Z. YOGA: Yet Another Geometry-based Point Cloud Compressor. In Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada, 29 October–3 November 2023; ACM: New York, NY, USA, 2023; pp. 9070–9081. [Google Scholar]
- Do, T.T.; Chou, P.A.; Cheung, G. Volumetric Attribute Compression for 3D Point Clouds Using Feedforward Network with Geometric Attention. In Proceedings of the ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 4–10 June 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–5. [Google Scholar]
- Liu, H.; Yuan, H.; Liu, Q.; Hou, J.; Zeng, H.; Kwong, S. A hybrid compression framework for color attributes of static 3D point clouds. IEEE Trans. Circuits Syst. Video Technol. 2021, 32, 1564–1577. [Google Scholar] [CrossRef]
- Wang, J.; Ding, D.; Ma, Z. Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction. In Proceedings of the 2023 Data Compression Conference (DCC), Snowbird, UT, USA, 21–24 March 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 228–237. [Google Scholar]
- Zhao, B.; Lin, W.; Lv, C. Fine-grained patch segmentation and rasterization for 3-d point cloud attribute compression. IEEE Trans. Circuits Syst. Video Technol. 2021, 31, 4590–4602. [Google Scholar] [CrossRef]
- Yang, X.; Shao, Y.; Liu, S.; Li, T.H.; Li, G. PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds. In Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada, 29 October–3 November 2023; ACM: New York, NY, USA, 2023; pp. 9271–9281. [Google Scholar]
- Shao, Y.; Zhang, Q.; Li, G.; Li, Z.; Li, L. Hybrid point cloud attribute compression using slice-based layered structure and intra prediction. In Proceedings of the 26th ACM international conference on Multimedia, Seoul, Republic of Korea, 22–26 October 2018; pp. 1199–1207. [Google Scholar]
- ISO/IEC JTC 1/SC 29/WG 7; G-PCC Test Model v16. MPEG: Geneva, Switzerland, 2021.
- Song, F.; Li, G.; Yang, X.; Gao, W.; Liu, S. Block-Adaptive Point Cloud Attribute Coding with Region-Aware Optimized Transform. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 4294–4308. [Google Scholar]
- Ortega, A.; Frossard, P.; Kovačević, J.; Moura, J.M.; Vandergheynst, P. Graph signal processing: Overview, challenges, and applications. Proc. IEEE 2018, 106, 808–828. [Google Scholar] [CrossRef]
- Dong, X.; Thanou, D.; Toni, L.; Bronstein, M.; Frossard, P. Graph signal processing for machine learning: A review and new perspectives. IEEE Signal Process. Mag. 2020, 37, 117–127. [Google Scholar] [CrossRef]
- Hu, W.; Pang, J.; Liu, X.; Tian, D.; Lin, C.W.; Vetro, A. Graph signal processing for geometric data and beyond: Theory and applications. IEEE Trans. Multimed. 2021, 24, 3961–3977. [Google Scholar] [CrossRef]
- Zhang, C.; Florencio, D.; Loop, C. Point cloud attribute compression with graph transform. In Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Chongqing, China, 4–6 March 2022; IEEE: Piscataway, NJ, USA, 2014; pp. 2066–2070. [Google Scholar]
- Schnabel, R.; Klein, R. Octree-based Point-Cloud Compression. PBG@ SIGGRAPH 2006, 3, 111–121. [Google Scholar]
- Huang, Y.; Peng, J.; Kuo, C.C.J.; Gopi, M. A generic scheme for progressive point cloud coding. IEEE Trans. Vis. Comput. Graph. 2008, 14, 440–453. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.; Yu, L.; Wang, W. Hilbert space filling curve based scan-order for point cloud attribute compression. IEEE Trans. Image Process. 2022, 31, 4609–4621. [Google Scholar] [CrossRef] [PubMed]
- Cohen, R.A.; Tian, D.; Vetro, A. Attribute compression for sparse point clouds using graph transforms. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1374–1378. [Google Scholar]
- Shao, Y.; Zhang, Z.; Li, Z.; Fan, K.; Li, G. Attribute compression of 3D point clouds using Laplacian sparsity optimized graph transform. In Proceedings of the 2017 IEEE Visual Communications and Image Processing (VCIP), St. Petersburg, FL, USA, 10–13 December 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–4. [Google Scholar]
- Xu, Y.; Hu, W.; Wang, S.; Zhang, X.; Wang, S.; Ma, S.; Guo, Z.; Gao, W. Predictive generalized graph Fourier transform for attribute compression of dynamic point clouds. IEEE Trans. Circuits Syst. Video Technol. 2020, 31, 1968–1982. [Google Scholar] [CrossRef]
- De Queiroz, R.L.; Chou, P.A. Compression of 3D point clouds using a region-adaptive hierarchical transform. IEEE Trans. Image Process. 2016, 25, 3947–3956. [Google Scholar] [CrossRef]
- Zhang, C.; Florêncio, D. Analyzing the optimality of predictive transform coding using graph-based models. IEEE Signal Process. Lett. 2012, 20, 106–109. [Google Scholar] [CrossRef]
- Dong, X.; Thanou, D.; Frossard, P.; Vandergheynst, P. Learning Laplacian matrix in smooth graph signal representations. IEEE Trans. Signal Process. 2016, 64, 6160–6173. [Google Scholar] [CrossRef]
- Kalofolias, V. How to learn a graph from smooth signals. In Proceedings of the Artificial intelligence and statistics, PMLR, Cadiz, Spain, 9–11 May 2016; pp. 920–929. [Google Scholar]
- Hu, W.; Gao, X.; Cheung, G.; Guo, Z. Feature graph learning for 3D point cloud denoising. IEEE Trans. Signal Process. 2020, 68, 2841–2856. [Google Scholar] [CrossRef]
- ISO/IEC JTC 1/SC 29/WG 7; G-PCC Performance Evaluation and Anchor Results. MPEG: Geneva, Switzerland, 2023.
- Sharma, G.; Wu, W.; Dalal, E.N. The CIEDE2000 color-difference formula: Implementation notes, supplementary test data, and mathematical observations. Color Res. Appl. 2005, 30, 21–30. [Google Scholar] [CrossRef]
- Hu, W.; Cheung, G.; Ortega, A.; Au, O.C. Multiresolution graph fourier transform for compression of piecewise smooth images. IEEE Trans. Image Process. 2014, 24, 419–433. [Google Scholar] [CrossRef] [PubMed]
- Kalofolias, V.; Perraudin, N. Large Scale Graph Learning From Smooth Signals. In Proceedings of the 7th International Conference on Learning Representations, ICLR, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Komodakis, N.; Pesquet, J.C. Playing with duality: An overview of recent primal? dual approaches for solving large-scale optimization problems. IEEE Signal Process. Mag. 2015, 32, 31–54. [Google Scholar] [CrossRef]
- ISO/IEC JTC2/SC29/WG7 MPEG Output Document N00650; Common Test Conditions for G-PCC. MPEG: Geneva, Switzerland, 2023.
- Bjontegaard, G. Calculation of Average PSNR Differences between RD-Curves. ITU SG16 Doc. VCEG-M33. 2001. Available online: https://www.itu.int/wf-tp3/av-arch/video-site/0104_Aus/VCEG-M33.doc (accessed on 5 May 2024).
3D Geometry Location in X Y Z Axis | Hilbert Code | ||
---|---|---|---|
0 | 0 | 0 | ( 0 0 0 ) |
0 | 0 | 1 | ( 0 0 1 ) |
0 | 1 | 0 | ( 0 1 1 ) |
0 | 1 | 1 | ( 0 1 0 ) |
1 | 0 | 0 | ( 1 1 1 ) |
1 | 0 | 1 | ( 1 1 0 ) |
1 | 1 | 0 | ( 1 0 0 ) |
1 | 1 | 1 | ( 1 0 1 ) |
Category | Sequence | Abbreviation | Total Point Number | Geometry Precision | Attribute Type |
---|---|---|---|---|---|
Solid | Basketball_player_vox11_00000200 | Basketball | 2,925,514 | 11 | R, G, B |
Dancer_vox11_00000001 | Dancer | 2,592,758 | 11 | R, G, B | |
Longdress_vox10_1300 | Longdress | 857,966 | 10 | R, G, B | |
Loot_vox10_1200 | Loot | 805,285 | 10 | R, G, B | |
Redandblack_vox10_1550 | Redandblack | 757,691 | 10 | R, G, B | |
Soldier_vox10_0690 | Soldier | 1,089,091 | 10 | R, G, B | |
Thaidancer_viewdep_vox12 | Thaidancer_v | 3,130,215 | 12 | R, G, B | |
Queen_0200 | Queen | 1,000,993 | 10 | R, G, B | |
Facade_00064_vox11 | Façade | 4,061,755 | 11 | R, G, B | |
Dense | Longdress_viewdep_vox12 | Longdress_v | 3,096,122 | 12 | R, G, B |
Loot_viewdep_vox12 | Loot_v | 3,017,285 | 12 | R, G, B | |
Redandblack_viewdep_vox12 | Redandblack_v | 2,770,567 | 12 | R, G, B | |
Soldier_viewdep_vox12 | Soldier_v | 4,001,754 | 12 | R, G, B | |
Boxer_viewdep_vox12 | Boxer_v | 3,493,085 | 12 | R, G, B |
Category | Sequence | G-PCC PLT [17] | G-PCC RAHT [17] | BAAC [18] | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Y | U | V | Y | U | V | Y | U | V | ||
Solid | Basketball | −11.95% | −13.73% | −32.23% | −25.25% | −10.36% | −24.22% | −3.08% | −3.11% | −3.91% |
Dancer | −14.68% | −12.37% | −29.03% | −22.80% | −15.10% | −26.73% | −2.70% | −2.74% | −3.24% | |
Longdress | −12.37% | −15.28% | −13.87% | −17.59% | −12.86% | −12.62% | −6.69% | −7.97% | −7.80% | |
Loot | −24.58% | −68.73% | −53.06% | −36.50% | −58.13% | −52.70% | −8.98% | −12.79% | −11.95% | |
Redandblack | −16.40% | −21.87% | −19.32% | −24.89% | −19.02% | −18.30% | −6.53% | −7.25% | −6.95% | |
Soldier | −17.45% | −45.58% | −38.75% | −36.86% | −55.31% | −47.56% | −8.43% | −11.97% | −11.98% | |
Thaidancer_v | −18.13% | −31.55% | −31.42% | −34.21% | −33.40% | −34.17% | −24.23% | −27.91% | −28.47% | |
Queen | −14.24% | −24.56% | −28.79% | −28.95% | −19.91% | −20.80% | −3.36% | −4.01% | −4.21% | |
Facade | −13.10% | −17.09% | −18.85% | −23.31% | −14.72% | −16.73% | −2.52% | −2.98% | −3.16% | |
Dense | Longdress_v | −11.64% | −15.73% | −13.60% | −16.62% | −14.26% | −13.83% | −28.97% | −28.68% | −28.16% |
Loot_v | −23.58% | −42.83% | −38.52% | −55.02% | −49.73% | −52.81% | −25.76% | −46.07% | −54.77% | |
Redandblack_v | −14.44% | −24.04% | −19.40% | −31.30% | −25.57% | −21.46% | −39.41% | −41.84% | −43.12% | |
Soldier_v | −15.28% | −39.10% | −35.22% | −39.88% | −44.06% | −51.38% | −32.42% | −34.62% | −41.93% | |
Boxer_v | −13.52% | −51.47% | −41.99% | −41.12% | −57.49% | −56.59% | −34.31% | −42.37% | −36.54% | |
Average Results | −15.81% | −30.28% | −29.57% | −31.02% | −30.71% | −32.14% | −16.24% | −19.59% | −20.44% |
Category | Sequence | k = 5 | k = 10 | k = 15 | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Y | U | V | Y | U | V | Y | U | V | ||
Solid | Basketball | 1.95% | 1.96% | 2.60% | −0.43% | −0.49% | −0.54% | −0.53% | −0.54% | −0.69% |
Dancer | 1.98% | 1.97% | 2.72% | −0.42% | −0.54% | −0.58% | −0.53% | −0.67% | −0.71% | |
Longdress | 3.05% | 4.18% | 4.02% | −0.93% | −1.13% | −1.09% | −1.10% | −1.31% | −1.29% | |
Loot | 4.77% | 6.22% | 6.19% | −1.24% | −0.92% | −0.99% | −1.51% | −1.18% | −1.29% | |
Redandblack | 3.44% | 3.64% | 4.21% | −1.18% | −1.38% | −1.37% | −1.27% | −1.42% | −1.46% | |
Soldier | 1.61% | 2.31% | 2.26% | −1.30% | −0.92% | −0.88% | −1.63% | −1.16% | −1.18% | |
Thaidancer_v | 11.17% | 16.41% | 16.82% | −3.12% | −4.39% | −4.55% | −3.11% | −4.37% | −4.54% | |
Queen | 4.02% | 4.58% | 5.08% | −0.36% | −0.51% | −0.53% | −0.73% | −0.84% | −0.80% | |
Facade | 1.34% | 1.93% | 2.18% | −0.44% | −0.60% | −0.58% | −0.35% | −0.57% | −0.52% | |
Dense | Longdress_v | 10.61% | 14.00% | 13.10% | 8.30% | 10.87% | 10.16% | 8.30% | 10.87% | 10.16% |
Loot_v | 17.12% | 20.64% | 19.98% | 12.63% | 13.77% | 13.48% | 12.63% | 13.77% | 13.48% | |
Redandblack_v | 9.47% | 12.05% | 10.73% | 7.01% | 9.05% | 7.93% | 7.02% | 9.05% | 7.94% | |
Soldier_v | 12.59% | 16.56% | 16.06% | 9.74% | 11.92% | 11.49% | 9.74% | 11.92% | 11.49% | |
Boxer_v | 7.95% | 8.99% | 8.91% | 5.34% | 5.34% | 5.16% | 5.35% | 5.35% | 5.18% | |
Average Results | 6.50% | 8.25% | 8.20% | 2.40% | 2.86% | 2.65% | 2.31% | 2.78% | 2.56% |
Category | Sequence | = 0.01 | = 0.1 | = 1 | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Y | U | V | Y | U | V | Y | U | V | ||
Solid | Basketball | 2.68% | 2.47% | 3.08% | 0.59% | 0.51% | 1.17% | 1.27% | 1.29% | 1.69% |
Dancer | 2.92% | 2.70% | 3.73% | 2.71% | 2.52% | 3.40% | 1.31% | 1.30% | 1.81% | |
Longdress | 8.25% | 11.55% | 11.15% | 7.28% | 10.20% | 9.85% | 2.95% | 3.93% | 3.80% | |
Loot | 11.94% | 11.96% | 11.92% | 10.58% | 10.74% | 10.81% | 4.54% | 6.15% | 5.88% | |
Redandblack | 8.80% | 9.54% | 10.60% | 7.87% | 8.47% | 9.59% | 3.07% | 3.25% | 3.87% | |
Soldier | 12.75% | 12.82% | 12.47% | 11.39% | 11.53% | 11.27% | 4.53% | 5.53% | 5.53% | |
Thaidancer_v | 12.33% | 18.32% | 18.25% | 10.99% | 16.33% | 16.27% | 11.61% | 16.91% | 17.46% | |
Queen | 3.32% | 4.48% | 4.67% | 2.39% | 3.32% | 3.56% | 0.76% | 0.46% | 0.80% | |
Facade | 1.16% | 2.03% | 2.02% | 0.96% | 1.68% | 1.69% | 0.35% | 0.50% | 0.64% | |
Dense | Longdress_v | 0.19% | 0.21% | 0.25% | −1.61% | −1.99% | −1.89% | 60.96% | 71.45% | 69.42% |
Loot_v | −2.78% | −5.77% | −5.25% | −5.04% | −7.16% | −6.64% | 189.75% | 357.43% | 337.41% | |
Redandblack_v | 1.09% | 0.49% | 1.66% | −0.91% | −1.57% | −0.72% | 82.60% | 97.15% | 90.60% | |
Soldier_v | 1.28% | −1.71% | −1.93% | −1.35% | −3.63% | −3.77% | 93.64% | 154.82% | 165.47% | |
Boxer_v | −5.23% | −9.09% | −9.06% | −7.34% | −10.42% | −10.45% | 136.58% | 196.47% | 199.10% | |
Average Results | 4.19% | 4.29% | 4.54% | 2.75% | 2.89% | 3.15% | 42.42% | 65.47% | 64.53% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shao, Y.; Song, F.; Gao, W.; Liu, S.; Li, G. Texture-Guided Graph Transform Optimization for Point Cloud Attribute Compression. Appl. Sci. 2024, 14, 4094. https://doi.org/10.3390/app14104094
Shao Y, Song F, Gao W, Liu S, Li G. Texture-Guided Graph Transform Optimization for Point Cloud Attribute Compression. Applied Sciences. 2024; 14(10):4094. https://doi.org/10.3390/app14104094
Chicago/Turabian StyleShao, Yiting, Fei Song, Wei Gao, Shan Liu, and Ge Li. 2024. "Texture-Guided Graph Transform Optimization for Point Cloud Attribute Compression" Applied Sciences 14, no. 10: 4094. https://doi.org/10.3390/app14104094