Русский | Українська | English DonNTU | Master's portal


Fedorov Anton

Fedorov Anton

Faculty: Computer sciences and technologies
Speciality: Software for automated systems
Theme of master's work:

"Investigation of contour segmentation methods for constructing optical character recognition system"

Scientific advisor: Fedyaev Oleg Ivanovich

Abstract

Biography

Abstract

:: Introduction ::

State-of-the-art of computer technologies’ development allows to apply recognition systems in a wide range of areas, including the field of automatic character recognition [1]. The character recognition task is still actual since she is necessary for solving at identification of cars license plates, text reading etc.


:: Urgency of a theme ::

The urgency of the car numbers recognition task is that intensity of automobile movement and quantity of car accidents recently has increased. The developed system automatically watches traffic infringers and writes infringers’ car numbers in a database.


:: The purposes and problems ::

The dissertation work purpose is research and development of methods, algorithms and programs of recognition of signs, characters, digits and Russian characters. They provide the analysis and information processing on the picture.


:: Planned practical results ::

After the dissertation work termination efficient software product is planning to receive. It is intended for implantation in the traffic monitoring system.


:: The review of the researches and the developments of the theme ::

Throughout the world there are developments on such software systems. I would like to list the most successful of them:

  1. System "Auto-Inspector" – the hardware-software complex providing recognition of moving cars numbers.

  2. The system of cars’ numbers’ optical identification "Shtrih-M" System is intended for the automatic registration of cars’ movement, recognition of car numbers.

    Differences between my work and "Shtrih-M" is that "Shtrih-M" works in narrower conditions and is unstable to interferences of different sorts [7].

At the national level, readers of car numbers are developed by the Kiev corporation "Allan". They provide high-contrast images of car numbers within all spectrum of conditions of surrounding lighting.

Representatives of the local level:

  • Poltava Sergey Alexandrovich, "Pattern recognition" (a source: the newspaper "Computer-Inform"). Unlike my operation, in system the neural networks raising speed of recognition are not used.

  • Andrey Afanasenko, "The development of hybrid specialized System of processing of image on the basis of fuzzy neural systems" [10]. In Andrey's work it is used neural net in the pattern recognition task. However neural net architecture is not considered demanded for improvement of recognition quality.


:: Alleged scientific novelty ::

Usage of developed neuronet structures and training algorithms will provide a high speed of set characters’ recognition at the expense of calculations multisequencing.

Research and choice of the most suitable methods to a specific target of boundary segmentation, binarization etc. will allow to lower recognition conditions. It is very important in the car number recognition task.


:: The basic idea of work ::

The object of research of the dissertation work is the interactive program system of character recognition, it’s internal structure. Artificial neural networks lie at the heart of system.

The task of recognition of patterns or graphic samples belongs to the class of NP-tasks. It requires search of new solution techniques. At processing of the document image it is possible to select some stages – selection of the fields containing prospective graphic images, recognition of graphic images, recognition results checking. At each stage especial processing methods are used [1].

As primary sources of the information for licence plate are used, as a rule, video- and a photo-images. Thus on them can be objects of an any kind [5]. Therefore before the direct analysis of the represented subjects it is necessary to execute a number of the preliminary operations, allowing to receive the image of objects itself without extraneous images [6].

For an effective employment of that images different approaches to decomposition of data model are used, allowing to present the general model as set of hierarchically interconnected more simple models of a different level of hierarchy [4].

One of the most widespread methods of the decision of these problems is contour segmentation.

Methods of contrast segmentation are used in many areas where objects on analyzed images possess the big complexity that causes high requirements to reliability, accuracy and reliability of research results. Use of computer equipment and mathematical methods in this area allows not only to accelerate process of material processing, but also to raise accuracy of research results [4].

Fast development of digital technique recently opens new opportunities in realization of these methods. The speedup of computer equipment allows to use complex algorithms, and owing to appearance of high-resolution color television gauges it is possible to receive and process color images. New technical opportunities allow to expand considerably a circle of researches, open new ways of the problems decision, that regards the images analysis [2].

The most typical method of contrast segmentation: a method of the certain window and a method of chain codes.

Boundary is the contrast area of the image containing sharp distinction of brightness between two adjacent pixels (as a rule, it is object boundaries). There is a set of various methods of boundaries selection [8].

At a preparatory stage of the certain window method researchers find the areas containing necessary contrast (high or, on the contrary, low). Further the window is created on the assumption of the provisional sizes and the form of required object, and it is considered quantity of sides in "suspicious" areas. If it is in the set range - the object is allocated. The range of quantity of sides gets out experimentally.

The selected image of licence plate represents a two-dimensional monochrome signal. It is divided into set of areas (patterns) of real characters images by contour boundaries. Examples of patterns are shown on fig. 1.

Licence plate characters
Figure 1 – Licence plate characters

Properties of artificial neural networks allow to use them at a stage of selected licence plates recognition (fig. 2, 3).

Visualization of selection of licence plates' images
Figure 2 – Visualization of selection of licence plates' images. Animation (Ulead GIF Animator 5), 720x288 px, 111 Kb, 4 frames with delay 150 msec between frames; the quantity of playback cycles is limited 10th.
Binarized bit image of the entry character and desirable result of recognition – a character serial number
Figure 3 – Binarized bit image of the entry character and desirable result of recognition – a character serial number

However it is required to select type of a neural network and its architecture for qualitative solution of the task of recognition of digits and Russian characters. They should provide correct reading of the information from the image and it’s analysis for the purpose of automobile license plate identification even in the presence of interferences.

Within the limits of the carried out research matching of the most widespread sorts of neural networks from the point of view of recognition efficiency is fulfilled.

The analysis has shown that neural network with back propagation training algorithm and a Kohonen network give good results.

The research object is the architecture of a neural network with back propagation training algorithm (fig. 4).

Multilayered perceptron
Figure 4 - Multilayered perceptron

The analysis of the received results has shown that the three-layer neural network with back propagation training algorithm (15x15x41) with sigmoid activation function possesses the best recognizing ability for this class of printing characters.

Kohonen neural networks [3] are for visualization and the initial ("prospecting") analysis of the data, first of all [9].

During the analysis other modification in which we take more than one neuron-winner has been used, and it has allowed to reduce time of training and to raise quality of recognition.


:: Conclusion ::

On this stage analysis of the chosen subject area is completed, the technical requirements are formed. Database structure, which has been designed, satisfies all the needs. The selected software environment is the best suitable system for this kind of development. The development of software system, based on defined requirements, has been started.

The additional analysis of recognition methods on purpose maximization of recognition quality is in the long term planned. It is supposed to solve the task for the night photographing and video shooting too.

Also functionality addition is planned.


:: References ::

  1. Аль-Рашайда Хасан Хусейн. Исследование и разработка методов локализации, идентификациии и распознавания арабских символов (на примере номерного знака автомобиля). – СПб.: ЛЭТИ, 2008 – 18 с.

  2. Антощук С., Крилов В., Бабілунга О. Ієрархічна модель контурної сегментації зображень // Праці 8-ї Міжнародн. конф. «Оброблення сигналів і зображень (УкрОБРАЗ’2006)».- Київ: НАН України – Інститут кібернетики. - 2006. - С.109

  3. Головко В.А. Нейронные сети: обучение и применение. — М.: ИПРЖР, 2001.

  4. Гонсалес Р., Вудс Р. Цифровая обработка изображений. – М.: Техносфера, 2005. – 1072 с.

  5. Дуда Р., Харт П. «Распознавание образов и анализ сцен» - М.: Мир, 1976.

  6. Копитчук М.Б., Олещук О.В. Попередня обробка зображень // Праці 6-ї Міжнародн. конф. «Оброблення сигналів і зображень (УкрОБРАЗ’2002)».- Київ: НАН України – Інститут кібернетики. - 2002. - С.127-130.

  7. Уоссерман Д. Нейрокомпьютерная техника: Теория и практика. — М.: Мир, 1992.

  8. Методы компьютерной обработки изображений / под ред. Сойфера В.А. – 2-е изд., испр. – М.: ФИЗМАТЛИТ, 2003. – 784 с.

  9. Распознавание изображений (источник: газета «Компьютер-Информ») / Портал магистров ДонНТУ, — http://www.masters.donntu.ru/2006/fvti/poltava/library/article5.htm

  10. Афанасенко А.В., «Разработка гибридной специализированной системы распознавания образов на базе нечетких нейронных сетей». [Электронный ресурс] / Портал магистров ДонНТУ, — http://www.masters.donntu.ru/2003/kita/afanasenko/diss/index.htm


Biography | Abstract

© Фёдоров А.В., ДонНТУ, 2010