Accueil DE EN ES FR


Advanced Search

Our On-Line PhDs

Submit a Thesis
My Account Register Help

About
Fields
Mathematics and Applications
Information and Communication Sciences and Technologies
Physics, Optics
Materials Science, Mechanics and Mechanical Engineering
Fluid Mechanics and Energy
Chemistry, Physical Chemistry and Chemical Engineering
Life Sciences and Engineering
Earth Sciences and Environmental Engineering
Sciences of Economy, Management and Society
Automatic detection of text from natural scenes. A semantic descriptor for content based image retrieval

Retornaz, Thomas (2007) Automatic detection of text from natural scenes. A semantic descriptor for content based image retrieval. PhD thesis Morphologie Mathématique, CMM- Centre de morphologie mathématique, ENSMP p.231.

Full text available as:

- ThesisRetornaz_Final.pdf ( 62485 Kb )
Licence: Copyright

Abstract

Multimedia data bases, both personal and professional, are continuously growing and the need for

automatic solutions becomes mandatory. Effort devoted by the research community to content-based

image indexing is also growing, but the semantic gap is difficult to cross: the low level descriptors

used for indexing are not efficient enough for an ergonomic manipulation of big and generic image

data bases. The text present in a scene is usually linked to image semantic context and constitutes a

relevant descriptor for content-based image indexing.

In this thesis we present an approach to automatic detection of text from natural scenes, which

tends to handle the text in different sizes, orientations, and backgrounds. The system uses a non

linear scale space based on the ultimate opening operator (a morphological numerical residue). In

a first step, we study the action of this operator on real images, and propose solutions to overcome

these intrinsic limitations. In a second step, the operator is used in a text detection framework which

contains additionally various tools of text categorisation.

The robustness of our approach is proven on two different dataset. First we took part to ImagEval

evaluation campaign and our approach was ranked first in the text localisation contest. Second, we

produced result (using the same framework) on the free ICDAR dataset, the results obtained are comparable

with those of the state of the art. Lastly, a demonstrator was carried out for EADS. Because

of confidentiality, this work could not be integrated into this manuscript.

Item Type:PhD Thesis (PhD)
PhD Supervisor:Marcotegui Iturmendi, Beatriz
Date:23 October 2007
Board of examiners:Jeulin, Dominique and Becker, Jean-Marie and Jolion, Jean-Michel and Cord, Matthieu and Kruger, Jorg and Reithler, Livier and Marcotegui Iturmendi, Beatriz
Ecole Doctorale:ED 431 INFORMATION, COMMUNICATION, MODELISATION ET SIMULATION
Discipline:Morphologie Mathématique
Collection (Fonds):Mines ParisTech (ENSMP)
Institution:ENSMP
Department:CMM- Centre de morphologie mathématique
Subjects:2. Information and Communication Sciences and Technologies
1. Mathematics and Applications
Uncontrolled Keywords:Image databank, Banque image, Image processing, Traitement image, Automatic indexing, Indexation automatique, Numerical analysis, Analyse numérique, Descriptor, Descripteur
ID Code:3782
Deposited By:Claudine Abauzit
Deposited On:02 June 2008

References

[1] L.V. Ahn, M. Blum, N. Hopper, and J. Langford. Captcha : Using hard AI problems for security.

In Advances in Cryptology - EUROCRPYT 2003 : International Conference on the Theory and

Applications of Cryptographic Techniques, pages 294–311,Warsaw, POLAND, may 4-8 2003.

[2] P. Aigrain, H. Zhang, and D. Petkovic. Content-based representation and retrieval of visual

media : A state-of-the-art review. Multimedia Tools and Applications, 3(3) :179–202, 1996.

[3] S. Antani and R. Kasturi. A survey on the use of pattern recognition methods for abstraction,

indexing and retrieval of images and video. Pattern Recognition, 35(4) :945–965, April 2002.

[4] H.B. Aradhye, G.K. Myers, and J.A. Herson. Image analysis for efficient categorization of

image-based spam e-mail. In International Conference on Document Analysis and Recognition,

pages 914–918, Seoul, Korea, August 2005.

[5] H.S. Baird, A.L. Coates, and R.J. Fateman. Pessimalprint : a reverse Turing test. International

Journal of Document Analysis and Recognition (IJDAR), 5(2-3) :158–163, April 2003.

[6] Christophe Berger, Thierry Géraud, Roland Levillain, and Nicolas Widynski. Effective component

tree computation with application to pattern recognition in astronomical imaging. In

IEEE International Conference on Image Processing, San Antonio, Texas, USA, pages 16–19,

September 2007.

[7] Serge Beucher. Transformations résiduelles en morphologie numérique. Technical report,

Centre de Morphologie Mathématique / École des mines de Paris, Décembre 2003.

[8] Serge Beucher. Numerical residues. In C. Ronse, L. Najman, and E. Decencière, editors,

Mathematical Morphology : 40 Years On, volume 30 of Computational Imaging and Vision,

pages 23–32. Springer-Verlag, Dordrecht, 2005.

[9] M.C. Burl, M.Weber, and P. Perona. A probabilistic approach to object recognition using local

photometry and global geometry. Lecture Notes in Computer Science, 1407 :628–641, 1998.

[10] H. Byun, I. Jang, and Y. Choi. Text extraction in digital news video using morphology. In

DAS’02 : Proceedings of the 5th International Workshop on Document Analysis Systems V,

pages 341–352, London, UK, 2002. Springer-Verlag.

[11] D. Chen. Text detection and recognition in images and video sequences. PhD thesis, École

Polytechnique Fédérale de Lausanne, Aug. 2003.

[12] D. Chen and J. Luettin. A survey of text detection and recognition in images and videos.

IDIAP-RR-00 38, IDIAP, 2000.

[13] D. Chen, K. Shearer, and H. Bourlard. Text enhancement with asymmetric filter for video OCR.

In International Conference on Image Analysis and Processing, pages 192–197, 2001.

[14] D. Chen, J.M. Odobez, and H. Bourlard. Text detection and recognition in images and video

frames. Pattern Recognition, 37(3) :595–608, March 2004.

[15] D. Chen, J.M. Odobez, and J.P. Thiran. A localization/verification scheme for finding text in

images and video frames based on contrast independent features and machine learning methods.

Signal Processing : Image Communication, 19(3) :205–217, March 2004.

[16] X. Chen, J. Yang, J. Zhang, and A. Waibel. Automatic detection and recognition of signs from

natural scenes. IEEE Transactions on Image Processing, 13(1) :87–99, January 2004.

[17] Xiangrong Chen and Alan L. Yuille. A time-efficient cascade for real-time object detection :

With applications for the visually impaired. In Proceedings of the 2005 IEEE Computer Society

Conference on Computer Vision and Pattern Recognition (CVPR’05) - Workshops, page 28,

Washington, DC, USA, 2005. IEEE Computer Society.

[18] M. CHEW, H.S. BAIRD, K. Tapas, S.E.H. Barney, J. Hu, and P.B. Kantor. Baffletext : A human

interactive proof. In Proceedings of SPIE-IS&T Electronic Imaging, Document Recognition

and Retrieval, Conference No10, volume 5010, pages 305–316, Santa Clara CA , USA, january

2003.

[19] P. Clark and M. Mirmehdi. Recognising text in real scenes. International Journal on Document

Analysis and Recognition, 4(4) :243–257, August 2002. ISSN 1433-2833.

[20] P. Clark and M. Mirmehdi. Rectifying perspective views of text in 3d scenes using vanishing

points. Pattern Recognition, 36(11) :2673–2686, November 2003.

[21] D. Crandall, S. Antani, and R. Kasturi. Extraction of special effects caption text events from

digital video. International Journal of Document Analysis and Recognition (IJDAR), 5(2-3) :

138–157, April 2003.

[22] C.H. Demarty. Segmentation et structuration d’un document vidéo pour la caractérisation et

l’indexation de son contenu sémantique : Application aux journaux télévisés. Thèse de doctorat

en morphologie mathématique, École Nationale Supérieure des Mines de Paris, 2000.

[23] N. Dimitrova, L. Agnihotri, C. Dorai, and R.M. Bolle. Mpeg-7 videotext description scheme for

superimposed text in images and video. Signal Processing :Image Communication, 16(1-2) :

137–155, September 2000.

[24] B. Efron and R. Tibshirani. An Introduction to the Bootstrap. Chapman & Hall, New York,

1993.

[25] D.A. Forsyth and M.M. Fleck. Identifying nude pictures. In 3rd IEEE Workshop on Applications

of Computer Vision, Sarasota, Florida, December 1996.

[26] K. Fukunaga. Introduction to Statistical Pattern Recognition. Academic Press, 2 edition, 1990.

[27] Y.M.Y. Hasan and L.J. Karam. Morphological text extraction from images. IEEE Trans. Image

Processing, 9(11) :1978–1983, November 2000.

[28] H. Hase, T. Shinokawa, M. Yoneda, and C.Y. Suen. Character string extraction from color

documents. Pattern Recognition, 34(7) :1349–1365, July 2001.

[29] Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The elements of statistical learning :

Data mining, inference, and prediction, chapter 4. Series in Statistics. Springer, New York, 1st

ed. 2001. corr. 3rd printing edition, 2003.

[30] X.S. Hua, X.R. Chen, L. Wenyin, and H.J. Zhang. Automatic location of text in video frames.

In MULTIMEDIA’01 : Proceedings of the 2001 ACM workshops on Multimedia, pages 24–27,

New York, NY, USA, 2001. ACM Press.

[31] X.S. HUA, L. Wenyin, and H.J. ZHANG. An automatic performance evaluation protocol for

video text detection algorithms. IEEE Transactions on Circuits and Systems for Video Technology,

14(4) :498–507, April 2004.

[32] L. Huiping. Automated Processing and Analysis of Text in Digital Video. PhD thesis, Language

and Media Processing Laboratory : University of Maryland, 2000.

[33] A.K. Jain and B. Yu. Automatic text location in images and video frames. In Pattern Recognition,

1998. Proceedings. Fourteenth International Conference on, volume 2, pages 1497–1499,

Brisbane, Qld., Australia, August 1998.

[34] Edmond J.Breen and Ronald Jones. Attribute openings, thinnings and granulometries.

P.MARAGOS, R.SCHAFER, M.BUTT : Mathematical Morphology and its applications to

image and signal processing, 64(3) :377–389, Nov 1996.

[35] K. Jung. Neural network-based text location in color images. Pattern Recognition Letters, 22

(14) :1503–1515, December 2001.

[36] K. Jung, K.I. Kim, and A.K. Jain. Text information extraction in images and video : a survey.

Pattern Recognition, 37(5) :977–997, May 2004.

[37] K. Karu, A.K. Jain, and R.M. Bolle. Is there any texture in the image ? Pattern Recognition,

29(9) :1437–1446, September 1996.

[38] H.K. Kim. Efficient automatic text location method and content-based indexing and structuring

of video database. Journal of Visual Communication and Image Representation, 7(4) :336–344,

December 1996.

[39] K.I. Kim, K. Jung, and J.H. Kim. Texture-based approach for text detection in images using

support vector machines and continuously adaptive mean-shift algorithm. IEEE Transactions

on Pattern Analysis and Machine Intelligence, 25(12) :1631–1639, December 2003.

[40] F. Lebourgeois. Robust multifont OCR system from gray level images. In 4th International

Conference Document Analysis and Recognition (ICDAR’97), 2-Volume Set, August 18-20,

1997, Ulm, Germany, pages 1–5, 1997.

[41] C.W Lee, K. Jung, and H.J. Kim. Automatic text detection and removal in video sequences.

Pattern Recognition Letters, 24(15) :2607–2623, November 2003.

[42] N.J. Leite and S.J.F. Guimaraes. Morphological residues and a general framework for image filtering

and segmentation. Journal on Applied Signal Processing, 2001(4) :219–229, December

2001.

[43] H. Li and D. Doermann. Superresolution-based enhancement of text in digital video. In International

Conference on Pattern Recognition, volume 1, pages 847–850, Los Alamitos, CA,

USA, 2000. IEEE Computer Society.

[44] H. Li and D. Doermann. Text enhancement in digital video using multiple frame integration. In

MULTIMEDIA’99 : Proceedings of the seventh ACM international conference on Multimedia

(Part 1), pages 19–22, New York, NY, USA, 1999. ACM Press.

[45] H. Li, D. Doermann, and O. Kia. Automatic text detection and tracking in digital video. IEEE

Transactions on Image Processing - Special Issue on Image and Video Processing for Digital

Libraries, 9(1) :147–156, January 2000.

[46] J. Li and R. Gray. Context based multiscale classification of images. In IEEE International

Conference on Image Processing, volume 3, pages 566–570, October 1998.

[47] J. Liang, D. Doermann, and H. Li. Camera-based analysis of text and documents : a survey.

International Journal on Document Analysis and Recognition, 7(2-3) :84–104, July 2005.

[48] R. Lienhart. Video OCR : A survey and practitioner’s guide. In VideoMining, Chapter 6, A.

Rosenfeld and D. Doermann and D. DeMenthon. Kluwer Academic Publishers, 2003.

[49] R. Lienhart. Automatic text recognition for video indexing. In MULTIMEDIA’96 : Proceedings

of the 4th ACM international conference on Multimedia, pages 11–20, New York, NY, USA,

1996. ACM Press.

[50] R. Lienhart and W. Effelsberg. Automatic text segmentation and text recognition for video

indexing. Multimedia Syst., 8(1) :69–81, 2000.

[51] R. Lienhart and A. Wernicke. Localizing and segmenting text in images and videos. IEEE

Transactions on Circuits and Systems for Video Technology, 12(4) :256–268, April 2002.

[52] S.M. Lucas. Text locating competition results. In International Conference on Document

Analysis and Recognition, volume 1, pages 80–85, Seoul, Korea, 2005.

[53] S.M. Lucas, A. Panaretos, L. Sosa, A. Tang, S.Wong, R. Young, K. Ashida, H. Nagai, M. Okamoto,

H. Yamamoto, H. Miyao, J. Zhu, W. Ou, C. Wolf, J.M. Jolion, L. Todoran, M. Worring,

and X. Lin. Icdar 2003 robust reading competitions :entries, results, and future directions.

International Journal on Document Analysis and Recognition, 7(2-3) :105–122, July 2005.

[54] M.R. Lyu, J. Song, and M. Cai. A comprehensive method for multilingual video text detection,

localization, and extraction. IEEE Transactions Circuits and Systems for Video Technology, 15

(2) :243–255, February 2005.

[55] V. Mariano. Video Object Detection and Matching. PhD thesis, Pennsylvania State University,

Juin 2003.

[56] V.Y. Mariano and R. Kasturi. Locating uniform-colored text in video frames. In International

Conference on Pattern Recognition, volume 4, pages 539–542, 2000.

[57] A. Meijster and M.H.F.Wilkinson. A comparison of algorithms for connected set openings and

closings. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(4) :484–494,

April 2002.

[58] S. Messelodi and C.M. Modena. Automatic identification and skew estimation of text lines in

real scene images. Pattern Recognition, 32(5) :791–810, May 1999.

[59] Fernand Meyer. Un algorithme optimal de ligne de partage des eaux. pages 847–857,

Lyon,France, November 1991.

[60] Fernand Meyer and Serge Beucher. Morphological segmentation. Journal of Visual Communication

and Image Representation, 1(1) :21–46, 1990. EX N-11/90/MM.

[61] G. Meyers, R. Bolles, Q.T. Luong, and J. Herson. Recognition of text in 3d scenes. In 4th

Symposium on Document Image Understanding Technology, Columbia, Maryland, April 2001.

[62] G. Nagy, T.A. Nartker, and S.V. Rice. Optical character recognition : an illustrated guide to the

frontier. In Document Recognition and Retrieval VII, SPIE Vol. 3967, San Jose, pages 58–69,

January 2000.

[63] L. Najman and M. Couprie. Building the component tree in quasi-linear time. IEEE Transactions

on Image Processing, 15(11) :3531–3539, November 2006.

[64] W. Niblack. An introduction to digital image processing, pages 113–116. Strandberg Publishing

Company, Birkeroed, Denmark, 1985.

[65] J. Ohya, A. Shio, and S. Akamatsu. Recognizing characters in scene images. IEEE Trans.

Pattern Anal. Mach. Intell., 16(2) :214–220, 1994.

[66] N. Otsu. A threshold selection method from grey-level histograms. IEEE Trans. Systems, Man

and Cybernetics, 9(1) :62–66, January 1979.

[67] Souhaïl OUTAL. Quantification par analyse d’images de la granulométrie des roches fragmentées

: amélioration de l’extraction morphologique des surfaces,amélioration de la reconstruction

stéréologique. Thèse de doctorat en morphologie mathématique et géosciences, ENSMP,

2006.

[68] H.C. Park, S.Y. Ok, Y.J. Yu, and H.G. Cho. A word extraction algorithm for machine-printed

documents using a 3d neighborhood graph model. International Journal of Document Analysis

and Recognition (IJDAR), 4(2) :115–130, 2001.

[69] T. Pun. A new method for grey-level picture thresholding using the entropy of the histogram.

Signal Processing, 2 :223–237, July 1980.

[70] T. Retornaz and Marcotegui B. Workshop imageval : Scene-text localization based on ultimate

opening.application on imageval database campaign., July 2007. URL http://www.

imageval.org/Workshop/ARMINES_CMM_ImagEVAL06_p.pdf.

[71] T. Retornaz and Marcotegui B. Scene-text localization based on ultimate opening. In International

Symposium on Mathematical Morphology ISMM’07., Rio de Janeiro. Brasil, October

2007.

[72] T. Retornaz and Marcotegui B. Ultimate opening implementation based on a flooding process.

In ICS XII, The 12th International Congress for Stereology, Saint-Etienne, France, September

2007.

[73] S.V. Rice, F.R. Jenkins, and T.A. Nartker. The 5th annual test of OCR accuracy. Technical

Report 96-01, Information Science Research Institute, University of Nevada, Las Vegas, April

1996.

[74] V. Risson. Application de la Morphologie Mathématique à l’analyse des conditions d’éclairage

des images couleur. Thèse de doctorat en morphologie mathématique, École Nationale

Supérieure des Mines de Paris, December 2001.

[75] Jean-Francois Rivest, Pierre Soille, and Serge Beucher. Morphological gradients. Journal of

Electronic Imaging, 2(4) :326–336, 1993.

[76] Roerdink and Meijster. The watershed transform : Definitions, algorithms and parallelization

strategies. FUNDINF : Fundamenta Informatica, 41, 2000.

[77] Y. Rui, T. Huang, and S. Chang. Image retrieval : current techniques, promising directions and

open issues. Journal of Visual Communication and Image Representation, 10(4) :39–62, April

1999.

[78] P. Salembier, A. Oliveras, and L. Garrido. Anti-extensive connected operators for image and

sequence processing. IEEE Transactions on Image Processing, 7(4) :555–570, 1998.

[79] G. Salton. Automatic text processing : The Transformation, Analysis, and Retrieval of Information

by Computer. Addison-Wesley Longman Publishing Co., Inc., 1988.

[80] T. Sato, T. Kanade, E.K. Hughes, M.A. Smith, and S.I. Satoh. Video OCR : indexing digital

new libraries by recognition of superimposed captions. Multimedia Systems, 7(5) :385–395,

1999.

[81] Jean Serra. Image analysis and mathematical morphology, volume 1. Academic Press, 1982.

[82] Jean Serra. Image Analysis and Mathematical Morphology - Theoretical Advances, volume 2.

Academic Press, 1988.

[83] Mehmet Sezgin and Bulent Sankur. Survey over image thresholding techniques and quantitative

performance evaluation. Journal of Electronic Imaging, 13(1) :146–168, 2004.

[84] J.C. Shim, C. Dorai, and R.M. Bolle. Automatic text extraction from video for content-based

annotation and retrieval. In 14th International Conference on Pattern Recognition, volume 1,

pages 618–620, 1998.

[85] B.K. Sin, S.K. Kim, and B.J. Cho. Locating characters in scene images using frequency features.

In 16th International Conference on Pattern Recognition (ICPR’02) - Volume 3, pages

489–492, Quebec, Canada, 2002.

[86] A.W.M. Smeulders, M.L. Kersten, and T. Gevers. Crossing the divide between computer vision

and databases in search of image databases. In Proc. 4th Working Conf. Visual Database

Systems, pages 223–239, 1998.

[87] A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval

at the end of the early years. IEEE Transactions on Pattern Analysis and Machine

Intelligence, 22(12) :1349–1380, December 2000. ISSN 0162-8828.

[88] M. Smith and T. Kanade. Video skimming for quick browsing based on audio and image

characterization. Technical Report CMU-CS-95-186, Computer Science Department, Carnegie

Mellon University, Pittsburgh, PA, July 1995.

[89] S.M. Smith and J. M. Brady. SUSAN – A new approach to low level image processing. Technical

Report TR95SMS1c, Chertsey, Surrey, UK, 1995.

[90] Pierre Soille. Morphological Image Analysis : Principles and Applications. Springer-Verlag

Berlin, Heidelberg, New York, 1999.

[91] A.L. Spitz. Determination of the script and language content of document images. IEEE

Transactions on Pattern Analysis and Machine Intelligence, 19(3) :235–245, March 1997.

[92] M. Szummer and R.W. Picard. Indoor-outdoor image classification. In IEEE Intl. Workshop on

Content-Based Access of Image and Video Databases, CAIVD, pages 42–51, Bombay, India,

1998.

[93] X. Tang, X. Gao, J. Liu, and H. Zhang. A spatial- temporal approach for video caption detection

and recognition. IEEE Transactions on Neural Networks, 13(4) :961–971, April 2002.

[94] Robert Endre Tarjan. Efficiency of a good but not linear set union algorithm. J. ACM, 22(2) :

215–225, 1975.

[95] O.D. Trier and A.K. Jain. Goal-directed evaluation of binarization methods. IEEE Transactions

on Pattern Analysis and Machine Intelligence, 17(12) :1191–1201, 1995.

[96] O.D. Trier, A.K. Jain, and T. Taxt. Feature-extraction methods for character-recognition : A

survey. Pattern Recognition, 29(4) :641–662, April 1996.

[97] Erik R. Urbach, Niek J. Boersma, and Michael H.F. Wilkinson. Vector-attribute filters. In

C. Ronse, L. Najman, and E. Decencière, editors, Mathematical Morphology : 40 Years On,

volume 30 of Computational Imaging and Vision, pages 95–104. Springer-Verlag, Dordrecht,

2005.

[98] C. Vachier. Extraction de caractéristiques, Segmentation d’Image et Morphologie Mathématique.

Thèse de doctorat en morphologie mathématique, École Nationale Supérieure des Mines

de Paris, December 1995.

[99] A. Vailaya, A. Jain, and H.J. Zhang. On image classification : city images vs. landscapes.

Pattern Recognition, 31(12) :1921–1935, December 1998.

[100] L. Vincent. Efficient computation of various types of skeletons. In M.H. Loew, editor, Proc.

SPIE, Medical Imaging V : Image Processing, volume 1445, pages 297–311, San Jose, CA,

February 1991.

[101] L. Vincent and P. Soille. Watersheds in digital spaces : An efficient algorithm based on immersion

simulations. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(6) :

583–598, June 1991.

[102] Luc Vincent. Morphological area openings and closings for grayscale images. In NATO :Shape

in picture workshop, Driebergen, pages 197–208. Springer-Verlag, Sep 1992.

[103] Luc Vincent. Morphological area openings and closings, their efficient implementation and

applications. JEAN SERRA, P.SALEMBIER EDITORS : Mathematical Morphology and its applications

to signal processing, pages 22–27, May 1993.

[104] R.A. Wagner and M.J. Fischer. The string-to-string correction problem. Journal of the Association

for Computing Machinery, 21(1) :168–173, 1974.

[105] ThomasWalter. Application de la Morphologie Mathématique au diagnostic de la Rétinopathie

Diabétique à partir d’images couleur. Thèse de doctorat en morphologie mathématique, École

Nationale Supérieure des Mines de Paris, September 2003.

[106] J.Z. Wang, J. Li, G. Wiederhold, and O. Firschein. System for classifying objectionable websites.

In Interactive Distributed Multimedia Systems and Telecommunication Services : 5th

International Workshop,Oslo, Norway, volume 1483, pages 113–124, September 1998.

[107] J.Z. Wang, J. Li, and G. Wiederhold. Simplicity : Semantics-sensitive integrated matching for

picture libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(9) :

947–963, 2001. ISSN 0162-8828.

[108] K. Wang and J.A. Kangas. Character location in scene images from digital camera. Pattern

Recognition, 36(10) :2287–2299, October 2003.

[109] M.H.F. Wilkinson and J. Roerdink. Fast morphological attribute operations using Tarjan ’s

union-find algorithm. In Mathematical Morphology and its Applications to Image and Signal

Processing, Kluwer, pages 311–320, 2000.

[110] C. Wolf. Text Detection in Images taken from Videos Sequences for Semantic Indexing. PhD

thesis, INSA de Lyon, December 2003.

[111] C. Wolf and J.M. Jolion. Object count/area graphs for the evaluation of object detection and

segmentation algorithms. International Journal of Document Analysis and Recognition (IJDAR),

8(4) :280–296, 2006.

[112] C. Wolf and J.M. Jolion. Extraction and recognition of artificial text in multimedia documents.

Pattern Analysis and Applications, 6(4) :309–326, 2004.

[113] E.K. Wong and M. Chen. A robust algorithm for text extraction in color video. In IEEE

International Conference on Multimedia and Expo (II), pages 797–799, 2000.

[114] E.K.Wong and M. Chen. A new robust algorithm for video text extraction. Pattern Recognition,

36(6) :1397–1406, June 2003.

[115] V. Wu, R. Manmatha, and E.M. Riseman. Finding text in images. In DL’97 : Proceedings

of the second ACM international conference on Digital libraries, pages 3–12, New York, NY,

USA, 1997. ACM Press.

[116] V. Wu, R. Manmatha, and E.M. Riseman. Textfinder : An automatic system to detect and

recognize text in images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21

(11) :1224–1229, 1999.

[117] W. Wu, X. Chen, and J. Yang. Detection of text on road signs from video. IEEE Transactions

on Intelligent Transportation Systems, 6(4) :378–390, December 2005.

[118] Q. Ye, Q. Huang, W. Gao, and D. Zhao. Fast and robust text detection in images and video

frames. Image and Vision Computing, 23(6) :565–576, June 2005.

[119] D. Zhang, B. Tseng, and S.F. Chang. Accurate overlay text extraction for digital video analysis.

In International Conference on Information Technology : Research and Education, pages 233–

237, August 2003.

[120] D.Q. Zhang and S.F. Chang. Learning to detect scene text using a higher-order MRF with belief

propagation. In IEEE Workshop on Learning in Computer Vision and Pattern Recognition, in

conjunction with CVPR (LCVPR),Washington DC, page 52, June 2004.

[121] Yefeng Zheng, Huiping Li, and David Doermann. Machine printed text and handwriting identification

in noisy document images. IEEE Trans. Pattern Anal. Mach. Intell., 26(3) :337–353,

2004.

[122] Y. Zhong, K. Karu, and A.K. Jain. Locating text in complex color images. Pattern Recognition,

28(10) :1523–1535, 1995.

[123] Y. Zhong, H.J. Zhang, and A.K. Jain. Automatic caption localization in compressed video.

IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(4) :385–392, April 2000.

[124] J. Zhou and D.P. Lopresti. Locating and recognizing text in WWW images. Information

Retrieval, 2(2/3) :177–206, 2000.

[125] J. Zhou, D.P. Lopresti, and T. Tasdizen. Finding text in color images. In Document Recognition

V SPIE , San Jose, pages Vol 3305 : 130–140, 1998.

Statistiques de consultation

Repository Staff Only: edit this item

© ParisTech 2007 - Réalisé par RILK.com - Graphisme par Winch Communication