A Linear Approach to Matching Cuboids in RGBD Images

Given a color image and depth map we match cuboidshaped objects in the scene. (a) and (b): The color image and aligned depth map from Kinect. (c): The cuboids detected by the proposed method and projected onto the color image. (d): The cuboids in the scene viewed from another perspective. These cuboids reveal important structures of the scene.

Abstract

We propose a novel linear method to match cuboids in indoor scenes using RGBD images from Kinect. Beyond depth maps, these cuboids reveal important structures of a scene. Instead of directly ﬁtting cuboids to 3D data, we ﬁrst construct cuboid candidates using superpixel pairs on a RGBD image, and then we optimize the conﬁguration of the cuboids to satisfy the global structure constraints. The optimal conﬁguration has low local matching costs, small object intersection and occlusion, and the cuboids tend to project to a large region in the image; the number of cuboids is optimized simultaneously. We formulate the multiple cuboid matching problem as a mixed integer linear program and solve the optimization efﬁciently with a branch and bound method. The optimization guarantees the global optimal solution. Our experiments on the Kinect RGBD images of a variety of indoor scenes show that our proposed method is efﬁcient, accurate and robust against object appearance variations, occlusions and strong clutter.

Paper

H. Jiang and J. Xiao.
A Linear Approach to Matching Cuboids in RGBD Images.
Proceedings of 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR2013)

Source Code

code.zip

Dataset

RGB-D cuboid Dataset

Poster

poster.pdf

Slides

slides.pptx

Bibtex

@inproceedings{RGBDcuboid,    
	Author    = {Hao Jiang and Jianxiong Xiao},    
	Title     = {A Linear Approach to Matching Cuboids in RGBD Images},    
	Booktitle = {26th IEEE Conference on Computer Vision and Pattern Recognition},    
	Year      = {2013},  
}

Acknowledgments

This research is supported by the U.S. NSF funding 1018641. Xiao is supported by Google U.S./Canada Ph.D. Fellowship.