Computer Science Colloquium

Integrated models of scenes and objects

Antonio Torralba
Massachusetts Institute of Technology

Monday, March 5th 11:15 a.m.
Room 1302 Warren Weaver Hall
251 Mercer Street
New York, NY 10012-1185

Colloquium Information:


Richard Cole, (212) 998-3119


Human scene understanding is remarkable: with only a brief glance at an image, an abundance of information is available - spatial layout, scene category, identity of main objects in the scene, etc. In traditional computer vision, scene and object recognition are two related visual tasks generally studied separately. By devising systems that solve these tasks in an integrated fashion it is possible to build more efficient and robust recognition systems. We argue that multi-object recognition systems should be based on models which consider the relationships between different object categories during the training process. This approach provides several benefits. At the lowest level, significant computational savings can be achieved if different categories share a common set of features. More importantly, jointly trained recognition systems can use similarities between object categories to their advantage by learning features which lead to better generalization. This inter-category regularization is particularly important when few training examples are available, as is common in many vision domains. In complex, natural scenes, object recognition systems can be further improved by using contextual knowledge about the objects likely to be found in a given scene, and common spatial relationships between those objects. I will describe how scene information can be used early during the visual processing in order to improve object detection and recognition.

Refreshments will be served

top | contact