Human and Machine Perception:
Emergence, Attention and Creativity

Pavia, September 14 - 17, 1998
HMP98 Home Page

ALERTING MECHANISMS IN ATTENTIVE SYSTEMS

Giovanni Garibotto
e-mail: Giovanni.Garibotto@elsag.it

Attentive Vision represents one of the dominant paradigms in applied Computer Vision. Beside representing a natural extension of biological vision, it has been proved to be the most effective approach to solve real industrial problems. Looking for interesting points and areas in the scene is a mixture of a bottom-up process (driven by relevant perceptual features) and a top-down scheme (driven by goals and tasks to be solved).
The paper will refer some recent practical experience of this approach in order to establish a basis of discussion.
Two deeply different areas will be considered, namely that of reading technologies and that of mobile robotics, to demonstrate how close is the vision requirement no matter how far could appear the application fields.
In both cases the main requirement is the hierarchical and progressive identification of the interesting fields where to look closely in. When reading or understanding a mail piece (either a letter or a flat or a parcel) it is necessary to detect, distinguish and recognize very different features and items (a bar-code, a stamp or mail priorities, the destination address, the address of the sender, advertisement messages, possible order-mails, etc.). Actually the understanding of the mail information is becoming far more complex than just OCR reading. Context information plays a fundamental role in this game, both in telling apart the different meaning of the available information as well as in improving significantly the performance of the specific recognition task.
In mobile robotics the use of artificial visual landmarks (as traffic signs, special geometric marks and lanes on the navigation floor) has proved to be an engineering efficient approach to autonomous navigation control. The possibility to use existing natural features already present in the scene to be detected and tracked in a sequence of images is a great challenge to extend this autonomous guidance capability.
In both cases the identification of features to be tracked is a goal-driven task which is based on alerting mechanisms acting as a focus of attention for the following Computer Vision task.
This can be highly conditioned by the a-priori knowledge of the expected 3D model (like the known geometry of the landmark, its symmetry properties, its perspective and invariant properties, if available).
There are still a lot of unresolved problems and open issues in this approach, as far as control mechanisms are concerned (centralised vs distributed), the most appropriate use of visual models (high level models or low-level features), the use of feedback in multiple resolution schemes.
The paper will address some of these questions in the framework of applied computer vision with a series of concrete industrial examples, to stimulate a discussion on this matter.

URL: http://vision.unipv.it/hmp98
e-mail: hmp@ipvvis.unipv.it