Residential College | false |
Status | 即將出版Forthcoming |
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | |
Zhang, Ruiyang1; Zhang, Hu2; Yu, Hang3; Zheng, Zhedong1![]() ![]() | |
2025 | |
Conference Name | 18th European Conference on Computer Vision, ECCV 2024 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
Volume | 15069 LNCS |
Pages | 249-266 |
Conference Date | 29 September 2024 to 4 October 2024 |
Conference Place | Milan; Italy |
Publisher | Springer Science and Business Media Deutschland GmbH |
Abstract | The unsupervised 3D object detection is to accurately detect objects in unstructured environments with no explicit supervisory signals. This task, given sparse LiDAR point clouds, often results in compromised performance for detecting distant or small objects due to the inherent sparsity and limited spatial resolution. In this paper, we are among the early attempts to integrate LiDAR data with 2D images for unsupervised 3D detection and introduce a new method, dubbed LiDAR-2D Self-paced Learning (LiSe). We argue that RGB images serve as a valuable complement to LiDAR data, offering precise 2D localization cues, particularly when scarce LiDAR points are available for certain objects. Considering the unique characteristics of both modalities, our framework devises a self-paced learning pipeline that incorporates adaptive sampling and weak model aggregation strategies. The adaptive sampling strategy dynamically tunes the distribution of pseudo labels during training, countering the tendency of models to overfit easily detected samples, such as nearby and large-sized objects. By doing so, it ensures a balanced learning trajectory across varying object scales and distances. The weak model aggregation component consolidates the strengths of models trained under different pseudo label distributions, culminating in a robust and powerful final model. Experimental evaluations validate the efficacy of our proposed LiSe method, manifesting significant improvements of +7.1% AP and +3.4% AP on nuScenes, and +8.3% AP and +7.4% AP on Lyft compared to existing techniques. |
Keyword | 2d scene Understanding Self-paced Learning Unsupervised 3d Object Detection Unsupervised Learning |
DOI | 10.1007/978-3-031-73247-8_15 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications ; Computer Science, Theory & Methods |
WOS ID | WOS:001353688700015 |
Scopus ID | 2-s2.0-85209988423 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Zheng, Zhedong |
Affiliation | 1.FST and ICI, University of Macau, Macao 2.CSIRO Data61, Sydney, Australia 3.Shanghai University, Shanghai, China |
First Author Affilication | Faculty of Science and Technology |
Corresponding Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Zhang, Ruiyang,Zhang, Hu,Yu, Hang,et al. Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene[C]:Springer Science and Business Media Deutschland GmbH, 2025, 249-266. |
APA | Zhang, Ruiyang., Zhang, Hu., Yu, Hang., & Zheng, Zhedong (2025). Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 15069 LNCS, 249-266. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment