Better Monocular 3D Detectors with LiDAR from the Past
arxiv(2024)
摘要
Accurate 3D object detection is crucial to autonomous driving. Though
LiDAR-based detectors have achieved impressive performance, the high cost of
LiDAR sensors precludes their widespread adoption in affordable vehicles.
Camera-based detectors are cheaper alternatives but often suffer inferior
performance compared to their LiDAR-based counterparts due to inherent depth
ambiguities in images. In this work, we seek to improve monocular 3D detectors
by leveraging unlabeled historical LiDAR data. Specifically, at inference time,
we assume that the camera-based detectors have access to multiple unlabeled
LiDAR scans from past traversals at locations of interest (potentially from
other high-end vehicles equipped with LiDAR sensors). Under this setup, we
proposed a novel, simple, and end-to-end trainable framework, termed
AsyncDepth, to effectively extract relevant features from asynchronous LiDAR
traversals of the same location for monocular 3D detectors. We show consistent
and significant performance gain (up to 9 AP) across multiple state-of-the-art
models and datasets with a negligible additional latency of 9.66 ms and a small
storage cost.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要