Residential College | false |
Status | 即將出版Forthcoming |
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | |
Li, Chunliang1; Han, Wencheng2; Yin, Junbo1; Zhao, Sanyuan1; Shen, Jianbing2 | |
2025 | |
Conference Name | 18th European Conference on Computer Vision, ECCV 2024 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
Volume | 15090 LNCS |
Pages | 273-292 |
Conference Date | 29 September 2024 to 4 October 2024 |
Conference Place | Milan; Italy |
Abstract | Concurrent processing of multiple autonomous driving 3D perception tasks within the same spatiotemporal scene poses a significant challenge, in particular due to the computational inefficiencies and feature competition between tasks when using traditional multi-task learning approaches. This paper addresses these issues by proposing a novel unified representation, RepVF, which harmonizes the representation of various perception tasks such as 3D object detection and 3D lane detection within a single framework. RepVF characterizes the structure of different targets in the scene through a vector field, enabling a single-head, multi-task learning model that significantly reduces computational redundancy and feature competition. Building upon RepVF, we introduce RFTR, a network designed to exploit the inherent connections between different tasks by utilizing a hierarchical structure of queries that implicitly model the relationships both between and within tasks. This approach eliminates the need for task-specific heads and parameters, fundamentally reducing the conflicts inherent in traditional multi-task learning paradigms.We validate our approach by combining labels from the OpenLane dataset with the Waymo Open dataset. Our work presents a significant advancement in the efficiency and effectiveness of multi-task perception in autonomous driving, offering a new perspective on handling multiple 3D perception tasks synchronously and in parallel. The code will be available at: https://github.com/jbji/RepVF. |
Keyword | 3d Lane Detection 3d Object Detection Multi-task Method. |
DOI | 10.1007/978-3-031-73411-3_16 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85210873033 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | University of Macau |
Affiliation | 1.School of Computer Science, Beijing Institute of Technology, Beijing, China 2.SKL-IOTSC, Computer and Information Science, University of Macau, Zhuhai, China |
Recommended Citation GB/T 7714 | Li, Chunliang,Han, Wencheng,Yin, Junbo,et al. RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception[C], 2025, 273-292. |
APA | Li, Chunliang., Han, Wencheng., Yin, Junbo., Zhao, Sanyuan., & Shen, Jianbing (2025). RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 15090 LNCS, 273-292. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment