For decades, computer vision researchers have been trying to extract high level information from images. While the semantics of images is still unreachable from the signal in most real cases, users would like to express requests to image data bases using high-level queries. This gap between the user needs and the image processing capabilities will limit the use of image databases in the near to mid term future.
Meaningful applications, however, are already possible using existing scientific technology, for instance using query-by-example. The scalability of such applications stresses the need for: new indexing methods able to handle approximate measures from the image signal; approximate search methods that are efficient in high dimensional spaces; and robust search methods able to handle many partially erroneous data (outliers).
The tutorial will illustrate some limited answers to these open problems using invariant features, robust statistics and probabilistic matching. It will then focus on the long term goal of high level semantics extraction from images. This problem is as yet poorly defined: the semantics of an image is user dependent and nobody knows how to express it in a formal way. Some limited answers exist, however, and the tutorial will illustrate how learning mechanisms provide impressive initial results. Moreover learning can be linked to relevance feedback and therefore allows performing better user dependent search.
Dr. Roger Mohr (see his DB&LP entry) is a Professor of Computer Science at Institut National Polytechnique de Grenoble since 1988. He is chairing the Computer Science Engineering School since 2003. From 2000 to 2002, he joined Xerox Research Centre Europe where he led the Grenoble lab.
His research interests are mainly in the area of computer vision. Dr. Mohr has published over 120 publications, including papers in international journals such as IEEE-PAMI, Artificial Intelligence, IJCV, and Pattern Recognition. His major personal contributions are the optimal consistency algorithm for constrained problems, the introduction of projective geometry in computer vision, and image indexing through local invariant features. His present research interest is visual learning.
Dr. Mohr is member of the GRAVIR lab. Half of this lab activity is centred around vision and has a particular focus on visual learning, on human activity recognition, on three-dimensional perception with cameras and video interpretation. He is involved in many national, industrial and international projects, including the EU networks of excellence and the EU integrated projects. Roger Mohr is a Professor in Computer Science at Institut National Polytechnique de Grenoble since 1988. He is now chairing the Computer Science Engineering School since 2003. From 2000 to 2002, he joined joined Xerox Research Centre Europe where he led the Grenoble lab.