Abstract:
«The analysis of methods and models of data integration on the basis of the ontological approach»
Content
1. A brief overview of the abstract
2. Conclusion
A brief overview of the abstract
Data integration involves integrating data from multiple sources and provides data to users in a unified manner. This process becomes essential as commercial tasks (when two similar companies need to merge their databases) and scientific (combining the results of the study on different bioinformatics repositories, for example). The role of data integration increases when the volume increases and the need to share data. This has been the focus of much theoretical work, but many problems remain unresolved.
By integrating data, there are two alternative approaches: the old - syntax, and the new - semantic. The first is based on the external similarity connecting data, the second to the content. For example, if we follow the first, connecting two regular tables, assuming that the temperature values are expressed in one scale, and if we could somehow arrange semantic storage, then could expect in the "High" are data and metadata, i.e. a physical quantity and an indication on what scale it measured, and have been able to combine data sets recorded and the Celsius and Fahrenheit. Semantic integration is based on knowledge and taking into account the nature of the data. Of course, data storage along with metadata creates additional challenges, but provides more convenience.
In the current economic conditions, a key element for the success of any company is the data, and how each company can manage. Business decisions that companies take today determine their ability to survive the current difficult economic conditions and to stay on the market. But the success of each of these solutions depends on timely, comprehensive and accurate data.
However, such data is no easy task. Data is typically scattered throughout the Corporation - in applications, databases on personal computers in the form of pdf files, spreadsheets, and text documents. In addition, some data may be located outside the Corporation - annexes "cloud computing" software-as-a-service providers (software as a service, SaaS) and business process outsourcing, as well as with trading partners. Obviously, a successful business requires the integration of all this data.
It is therefore a very promising development to date is the data representation by semantic links, namely on the basis of ontological methods. Further, the integration of data makes it easier, thereby reducing the cost of development and implementation, as well as supporting and operating software.
With regard to the issue of integration was actually quite some time to consolidate data from different sources. You create the files actually were part of the data, content, and how to use them are uncertain. Even you can create automated data storage, which has not withdrawn the consolidation of reporting tasks. Integration trends are provided with a number of fundamental IT-technologies that create the possibility of implementing business processes information based on data obtained from different sources and converted a whole. Realization of integration of heterogeneous information sources provides a competitive advantage to the company. While integration is not one-dimensional. A the concept of "integration of information resources" is defined multidimensional and, moreover, is ambiguous in one aspect of their consideration.
First of all, it is necessary to review the development of integration of information resources as a movement from the traditional syntactic "contextual" interpretation of data consolidation, which does not allow to provide the meaning to "intention," that is, semantic, semantic, allowing more efficient use of data programs that provide the functionality of the company's information system. This is timely, because, in fact, let's talk about the prospects for implementing relevant today "technology knowledge management (Knowledge Management), which is defined as the set of technologies that support the processes of generation, transmission, processing and use of data within the company.
The meaning of semantic approach to the integration of information resources is to integrate the data and context data storage along with metadata, which would take into account the nature of the data. Emergence of technology integration is due; above all, the needs of users of corporate information systems that seek for decision making are not isolated events, and knowledge on the status of the managed object as a whole.
Conclusion
Ontological approach requires a description of the classes and relations between them. One of the important advantages of using ontologies is that these descriptions are not limited to one task, such as data replication. The same description are used to implement the integration of on-the-fly, the data store to retrieve the instances of classes, in order to ensure the quality of data for the restriction of access to resources, to realize the management of middle-layer and for security.
Product of semantic integration, one way or another, involves a work of integrating ontologies, which, in turn, describing the same or area which overlaps the interest can be significantly different levels of detail, as well as the approach to the conceptualization of the subject area. On this basis, to address the problem of semantic heterogeneity in a wide range of information sources should be as fully reflected their semantics by setting arbitrary ontologies and establish relations between different ontologies.
Semantic storage can also be used to Exchange data between agents to supplement their Semantic representations, generating new information resources that contain aggregate information. To achieve this goal, plan to define a special type of so-called cross intermediary tasks. Also, there is a need to develop common principles of interaction of agents, in order to be able not only to search for the user, but also joint information processing.
In the future it will also be necessary to formalize the basic procedures, such as search and registration agent, adding terms in the thesaurus. This will greatly facilitate the further development of prototype software system integration.
References
1. Сила платформы. Как платформа интеграции данных способна помочь IT-отделам снизить расходы, повысить эффективность и увеличить прибыль бизнеса // Informatica, май 2009 г.
2. Интеграционные платформы - современный способ наиболее эффективной интеграции данных [Электронный ресурс]. - Режим доступа: http://www.iso.ru/print/rus/document6250.phtml
3. Fusionsoft SemanticNet: Платформа семантической разработки и интеграции информационных систем. [Электронный ресурс]. - Режим доступа: http://fusionsoft-online.com/platform.php?lang=rus
4. Обзор SemanticNet. [Электронный ресурс]. - Режим доступа: http://fusionsoft-online.com/overview.php
5. Система интеграции структурированных данных c неограниченным масштабированием и семантическим интерфейсом доступа. [Электронный ресурс]. - Режим доступа: http://fusionsoft-online.com/articles-semantic-integration-system.php
6. Перспективы построения глобальной инфраструктуры объектной интеграции систем, дополняющей SOA в части управления данными. [Электронный ресурс]. - Режим доступа: http://fusionsoft-online.com/articles-global-object-integration.php
7. DataXtend. [Электронный ресурс]. - Режим доступа: http://www.progress-tech.ru/products/dataxtend/
8. Ломов П. А., Шишаев М. Г. Интеграция данных на основе онтологий для обеспечения информационной поддержки принятия решений. [Электронный ресурс]. - Режим доступа: http://crider.rork.ru/index.php?option=com_content&task=view&id=15&Itemid=31
9. Ломов П. А., Шишаев М. Г. Интеграция онтологий с использованием тезауруса для осуществления семантического поиска [Электронный ресурс]. - Режим доступа: http://www.jitcs.ru/index.php?option=com_content&view=article&id=264
10. Ломов П. А., Шишаев М. Г. Интеграция семантически связанных информационных ресурсов на основе онтологий [Электронный ресурс]. - Режим доступа: http://crider.rork.ru/index.php?option=com_content&task=view&id=14&Itemid=31
11. Ломов П.А., Путилов В.А., Маслобоев А.В. Поддержка интеллектуальности пользовательскогоинтерфейса системы распределенного семантическогопоиска: проблемы и решения. [Электронный ресурс]. - Режим доступа: http://vestnik.mstu.edu.ru/v13_3_n40/articles/09_maslo.pdf
12. Демченко В.В., Шевченко Д.А. Сравнение средств разработки знание-ориентированных систем для интеграции распределенных приложений. [Электронный ресурс]. - Режим доступа: http://www.nbuv.gov.ua/portal/natural/urss/2010_2/23-26Demchenko.pdf
13. Деркач А.В. Применение онтологии для автоматизации весового проектирования сложных технических объектов [Электронный ресурс]. - Режим доступа: http://www.nbuv.gov.ua/portal/natural/Kzms/2008/2008_st18.pdf
14. Берко А.H. Применение онтологий для семантической интеграции данных. [Электронный ресурс]. - Режим доступа: http://www.nbuv.gov.ua/portal/natural/Vnulp/Komp-nauky/2009_650/03.pdf
15. Вороной А.С. Виртуальная организация агентов для разработки онтологической базы знаний системы компьютерного обучения. [Электронный ресурс]. - Режим доступа: http://www.nbuv.gov.ua/portal/natural/ii/2010_3/AI_2010_3%5C7%5C00_Voronoy.pdf
16. Вороной А.С., Егошина А.А. Средства интеграции онтологий предметных областей для создания баз знаний интеллектуальных учебных сред. [Электронный ресурс]. - Режим доступа: http://www.nbuv.gov.ua/portal/natural/ii/2010_2/AI_2010_2%5C2%5C00_Voronoy_Egoshina.pdf