PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation

IEEE Conference on Computer Vision and Pattern Recognition(2022)

引用 10|浏览53
暂无评分
摘要
In this paper, we propose a new deep learning-based method for estimating room layout given a pair of 360° panoramas. Our system, called Position-aware Stereo Merging Network or PSMNet, is an end-to-end joint layout-pose estimator. PSMNet consists of a Stereo Pano Pose (SP 2 ) transformer and a novel Cross-Perspective Projection (CP 2 ) layer. The stereo-view SP2 transformer is used to implicitly infer correspondences between views, and can handle noisy poses. The pose-aware CP 2 layer is designed to render features from the adjacent view to the anchor (reference) view, in order to perform view fusion and estimate the visible layout. Our experiments and analysis validate our method, which significantly outperforms the state-of-the-art layout estimators, especially for large and complex room spaces.
更多
查看译文
关键词
3D from multi-view and sensors, Deep learning architectures and techniques
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要