THE THREE R’S OF COMPUTER VISION: RECOGNITION, RECONSTRUCTION AND REORGANIZATION

J. MALIK, P. ARBELAEZ, J. CARREIRA, K. FRAGKIADAKI, R. GIRSHICK, G. GKIOXARI, S. GUPTA, B. HARIHARAN, A. KAR AND S. TULSIANI

PATTERN RERCOGNITION LETTERS, 2016

Abstract

We argue for the importance of the interaction between recognition, reconstruction and re-organization, and propose that as a unifying framework for computer vision. In this view, recognition of objects is reciprocally linked to re-organization, with bottom-up grouping processes generating candidates, which can be classified using top down knowledge, following which the segmentations can be refined again. Recognition of 3D objects could benefit from a reconstruction of 3D structure, and 3D reconstruction can benefit from object category-specific priors. We also show that reconstruction of 3D structure from video data goes hand in hand with the reorganization of the scene. We demonstrate pipelined versions of two systems, one for RGB-D images, and another for RGB images, which produce rich 3D scene interpretations in this framework.

Universidad de los Andes | Monitored by Mineducación
Recognition as University: Decree 1297 of May 30th, 1964.
Recognition as legal entity: Resolution 28 of February 23, 1949 Minjusticia.

© Universidad de los Andes. All rights reserved.