Abstract

Introduction

Popular purchases of goods through online stores. The user does not need to leave the house, but just go to the Internet, go to any online store and order any product.

Popularization of the sale of goods through the online store has led to an increase in the number of online stores. Such a policy has caused competition between online stores.

1. Actuality theme

The user sets himself the goal - to find the most suitable product for himself according to different criteria (price, characteristics, etc.), but the method of selection of this product is made by the user manually. As a result, one can highlight a number of shortcomings, such as:

To exclude these shortcomings, data collection systems from other online stores have been introduced. These systems eliminate two drawbacks: manual search on online stores and compare them with criteria (for example, sorting by price of one kind of product). But these systems have their disadvantages:

The main task of the system will be aimed at eliminating shortcomings in existing systems by optimizing the search for various user queries and issuing qualitative responses. Consider the task of searching and selecting goods.

2. The purpose and tasks of the study, the planned results

The user will log in. In the search bar, enter your request and confirm it. After that, the system will begin processing the request. It will accept the request (input), make a sample of goods and display a list of selected goods (output data) with minimal information (price, name, characteristics, etc.) and a link to the order of this product.

Based on the input and output data, the following algorithm for the search and selection of goods in the system is presented:

The user will log in. In the search box, it will indicate your request and confirm it. After that, the system will begin processing the request. It will accept the (input) request, sample the goods and display the list of selected products ( output ) with minimal information (price, name, characteristics, etc.) and a link to ordering this item.

Based on the input and output data, an algorithm for searching and selecting goods in the system is presented:

  1. Indexing the content of the site.
  2. Enter the request by the user and confirm.
  3. The system of the request excludes the official parts of the language
  4. The resulting string is split into an array of words translated into the base form
  5. The search for every word in the resulting array is carried out in the index,
  6. Search results are ranked, ordered and returned to the user.

Also, with insufficient or incomplete list of goods, the user can expand search options. Extend search options by adding a new restriction to your previously queried request. The system will re-execute the algorithm for a better answer. In order to improve search efficiency, the system maintains a statistical search log for users.

The purpose of this work is to increase the effectiveness of the information retrieval process, depending on the request, as well as to shorten the time for obtaining qualitative and accurate results of queries.

To solve this goal, it is necessary to solve a number of problems:

To solve the problems, it is necessary to analyze the existing methods, to highlight their advantages, disadvantages and choose the most promising ones.

CONCLUSION

As a result of a survey of topical methods and algorithms of fuzzy search, the dignity and non-compliance of these methods were noted.
Also, we select the indexing algorithms. They have a complex implementation, and also require a prepared dictionary for comparisons, but much faster perform comparisons of lines of large dimension, and more precisely determine the similarity of these lines. These algorithms with their modification are most suitable for realization of the tasks set within the studied area.
When writing this abstract the master's work is not completed yet. Final completion: June 2019. Full text of the work and materials on the topic can be obtained from the author or his manager after the specified date.

SOURCES

  1. Нечёткий поиск в тексте и словаре [Электронный ресурс]. – Режим доступа: https://sohabr.net/habr/post/1149...
  2. Алгоритмы приблизительного сравнения текста [Электронный ресурс]. – Режим доступа: http://alexandrerodichevski.chiappani...
  3. Вычисление редакционного расстояния [Электронный ресурс]. – Режим доступа: https://habr.com/post/117063/
  4. Comparison of Jaro-Winkler and Ratcliff/Obershelp algorithms in spell check [Электронный ресурс]. – Режим доступа: ib-extended-essay-ilovepdf-compressed.pdf
  5. Классификация современных алгоритмов нечеткого словарного поиска  [Электронный ресурс]. – Режим доступа: http://rcdl.ru/doc/2004/paper27.pdf
  6. Программаная реализация Алгоритма Левенштейна [Электронный ресурс]. – Режим доступа: https://moluch.ru/archive/19/1966/
  7. Вычисления Левенштейна  [Электронный ресурс]. – Режим доступа: https://docviewer.yandex.ua/view/0/...
  8. ВЫЯВЛЕНИЕ ДУБЛИКАТОВ В РАЗНОРОДНЫХ БИБЛИОГРАФИЧЕСКИХ ИСТОЧНИКАХ  [Электронный ресурс]. – Режим доступа: https://nsu.ru/xmlui/bitstream/handle/nsu/7131/10.pdf
  9. Применение алгоритмов нечеткого поиска в PHP  [Электронный ресурс]. – Режим доступа: https://habr.com/ru/post/115394/
  10. Індекс (бази даних)  [Электронный ресурс]. – Режим доступа: https://ru.wikipedia.org/wiki/Индекс_(базы_данных)