Home DE ES FR


Advanced Search

Our On-Line PhDs

Submit a Thesis
My Account Register Help

About
Fields
Mathematics and Applications
Information and Communication Sciences and Technologies
Physics, Optics
Materials Science, Mechanics and Mechanical Engineering
Fluid Mechanics and Energy
Chemistry, Physical Chemistry and Chemical Engineering
Life Sciences and Engineering
Earth Sciences and Environmental Engineering
Sciences of Economy, Management and Society
Modélisation et interprétation d'images à l'aide de graphes

Lerallut, Romain (2006) Modélisation et interprétation d'images à l'aide de graphes. PhD thesis Morphologie Mathématique, CMM- Centre de morphologie mathématique p.210.

Full text available as:

- These_lerallut.pdf ( 35060 Kb )
Licence: Copyright

Abstract

Intelligent analysis and comparison of images is one of the most dynamic topic of research in both

academia and industry. Describing and comparing images automatically is a critical issue for the full

development of the «information society»

Search engines working on textual data have dramatically proved their value. However, there is

currently no similar system for image-only data. One possible explanation is that we do not really

have a language made for describing images, thus meaningful comparisons are much more difficult

than in the case of text. Nevertheless, textual search engines have shown that it is not necessary that

the machines understand what they analyse to return good results. Simple syntactic analysis methods,

coupled with composition rules are enough to drive extremely efficient systems.

To enable machines to simulate the interpretation of images, it would be necessary to create descriptors

playing the role of words in text and composition rules making it possible to compare images

like search engines compare sentences.We already have at our disposal numerous methods to detected

automatically simple objects or regions in images, by their common color, their identical motion, etc.

Furthering the analogy, these objects could be seen as syllables. The difficulty lies in grouping them

to form words, then sentences, and compare them while being robust to perturbations.

To achieve this, we use graphs to store these objects and their relationships. These can be either of

a neighboring nature, or inclusion, which leads the graphs to be either planar graphs or trees. We will

see several methods to construct either type as well as their pros and cons.

In a first step, we have used the graph-matching algorithms developed by Cristina Gomila at the

end of her PhD thesis at the CMM (1998-2001). While working with the european project MASCOT

studying the use of «metadata» to enhance video coding, we have studied in detail the algorithm and

spotted its strengths and weaknesses.We first tested replacing the core of the matching algorithm by a

better one. This resulted in slight improvements in both quality and computation time. Then we tried

to reduce our sensitivity to variations in the segmentation process by using a spectral graph-matching

algorithm. Despite good results on simple images, our tests on harder images have not succeeded. To

improve our robustness with respect to the stability of the graphs, we then prefered working on the

source material : images.

The second step of this work was the development of image-base techniques to reduce the sensitivity

of our segmentation algorithms to noise and small variations. First, we developed a class of

adaptive filtering operatiors, the «morphological amoebas», which proved extremely effective in reducing

noise in image. Second, we created a robust color gradient operator that can detect contour

lines in noisy images. These two operators have improved sometimes spectacularly the stability of our

segmentations, hence that of our graphs and in the end the quality of the results.

The next step in this work has been the modeling of objects independently from the rest of the

image. This approach was motivated by realizing that in some scenarii the content of the image outvii



side some well-defined objects is not informative. We must thus analyse directly and as precisely as

possible the objects themselves. We first supposed that the segmentation of the outline of the objects

was a solved problem, and concentrated on creating a robust signature for each object. To get it, we

modified a watershed algorithm in order to perform a top-down resegmentation of a morphological

scalespace based on levelings. We used this resegmentation to build a robust tree of embedded regions,

and we defined a distance between those trees. We tested the whole process on a commonly

used database by the indexation community.

The last step was centered around applications. First by comparing the various approches presented

in this document, concentrating in particular on the speed versus robustness compromise.

Then we search for the best combination of techniques to build a videosurveillance application. In

particular, we developed fast and robust segmentation techniques for the project PS26-27 «Intelligent

Environment» in partnership with ST Microelectronics and the ORION group of the INRIA. This aim

of this project is to build a technology demonstrator for videosurveillance applied to the detection of

accidents in hospitals or at home. Our part of the work was the detection of the outline of people in

video sequences.

Finally, by coupling these detectors to our tree-based objects descriptors, we were able to define

robust signatures for people that could be use with great profit by automatic videosurveillance systems.

Item Type:PhD Thesis (PhD)
Thesis Supervisor:Meyer, Fernand
Date:13 September 2006
Board of examiners:Jeulin, Dominique and Keriven, Renaud and Van Droogenbroeck, Marc and Thonnat, Monique and Martin, Lionel and Meyer, Fernand
Discipline:Morphologie Mathématique
Collection (Fonds):ENSMP
Department:CMM- Centre de morphologie mathématique
Subjects:1. Mathematics and Applications
Uncontrolled Keywords:Traitement image, Bruit aléatoire, Théorie graphe, Surveillance électronique, Arbre graphe, Filtre morphologique
ID Code:3298
Deposited By:Claudine Abauzit
Deposited On:31 January 2008

Statistiques de consultation

Repository Staff Only: edit this item

© ParisTech 2007 - Réalisé par RILK.com - Graphisme par Winch Communication