Skip to main navigation Skip to search Skip to main content

Self-supervised Learning with Multi-view Rendering for 3D Point Cloud Analysis

  • Bach Tran
  • , Binh Son Hua
  • , Anh Tuan Tran
  • , Minh Hoai
  • Vin-AI Research

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Recently, great progress has been made in 3D deep learning with the emergence of deep neural networks specifically designed for 3D point clouds. These networks are often trained from scratch or from pre-trained models learned purely from point cloud data. Inspired by the success of deep learning in the image domain, we devise a novel pre-training technique for better model initialization by utilizing the multi-view rendering of the 3D data. Our pre-training is self-supervised by a local pixel/point level correspondence loss computed from perspective projection and a global image/point cloud level loss based on knowledge distillation, thus effectively improving upon popular point cloud networks, including PointNet, DGCNN and SR-UNet. These improved models outperform existing state-of-the-art methods on various datasets and downstream tasks. We also analyze the benefits of synthetic and real data for pre-training, and observe that pre-training on synthetic data is also useful for high-level downstream tasks. Code and pre-trained models are available at https://github.com/VinAIResearch/selfsup_pcd.git.

Original languageEnglish
Title of host publicationComputer Vision – ACCV 2022 - 16th Asian Conference on Computer Vision, Proceedings
EditorsLei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa
PublisherSpringer Science and Business Media Deutschland GmbH
Pages413-431
Number of pages19
ISBN (Print)9783031263187
DOIs
StatePublished - 2023
Event16th Asian Conference on Computer Vision, ACCV 2022 - Hybrid, Macao, China
Duration: Dec 4 2022Dec 8 2022

Publication series

NameLecture Notes in Computer Science
Volume13841 LNCS

Conference

Conference16th Asian Conference on Computer Vision, ACCV 2022
Country/TerritoryChina
CityHybrid, Macao
Period12/4/2212/8/22

Keywords

  • 3D deep learning
  • Multiple-view rendering
  • Point cloud analysis
  • Self-supervised learning

Fingerprint

Dive into the research topics of 'Self-supervised Learning with Multi-view Rendering for 3D Point Cloud Analysis'. Together they form a unique fingerprint.

Cite this