IDENTIFYING KEYWORDS ON THE BASIS OF CONTENT MONITORING METHOD IN UKRAINIAN TEXTS
DOI:
https://doi.org/10.15588/1607-3274-2016-1-9Keywords:
text, a Ukrainian, algorithm, content monitoring, keywords, content analysis, Porter stemmer, linguistic analysis, parsAbstract
The task of developing algorithmic providing processes of content monitoring for the problem solution of determining a keyword inUkrainian text is solved. The formal justification of content monitoring in text using Porter stemmer is considered. The basis of the stemming modification is the known results of morpheme and word building structure derivatives classification in Ukrainian language, affix combinatorics patterns identification, modeling the structural organization of verbs and suffixal nouns and morphonological modifications in the verb inflection and word formation and inflection of adjectives in Ukrainian language. The method decomposition is conducted and the algorithmic software of its basic structural components of the text content analysis results is developed. Theoretically means to improve the performance indicators of keywords search are identified, including keyword density in text. Based on the software obtained results of experimental testing of the proposed method of content monitoring to keywords identification in scientific texts of technical profile are developed. It is detected that the chosen experimental base of 100 works the article analysis method the without the initial required information and without the reference list reaches the best results for the density criterion, but with the specified blocked words and qualifying thematic dictionary verification.
References
Берко А. Системи електронної контент-комерції / А. Берко, В. Висоцька, В. Пасічник. – Л. : НУЛП, 2009. – 612 с. 2. Математична лінґвістика / [В. Висоцька, В. Пасічник, Ю. Щербина, Т. Шестакевич]. – Л. : «Новий Світ-2000», 2012. – 359 с. 3. Найефективніші методи залучення потенційних клієнтів [Електронний ресурс] / Центр ресурсів якості трафіку оголошень, Google AdWords. – Режим доступу: http://www.google.com/intl/ uk_ALL/ads/adtrafficquality/advertisers/best-practices-forgenerating-leads.html. – Назва з титул. екрану. 4. Нечеткий поиск в тексте и словаре [Електронний ресурс]. – Режим доступу: http://habrahabr.ru/post/114997/. – Назва з титул. екрану. 5. Реализации алгоритмов. Расстояние Левенштейна [Електронний ресурс]. – Режим доступу: http://ru.wikibooks.org/wiki/ Реализации_алгоритмов/Расстояние_Левенштейна. – Назва з титул. екрану. 6. Задача о расстоянии Дамерау-Левенштейна [Електронний ресурс]. – Режим доступу: http://neerc.ifmo.ru/wiki/i n d e x . p h p ? t i t l e = % D 0 % 9 7 % D 0%B0 %D0%B4 %D0%B0 %D1%8 7 %D0%B0 _%D0 %B E _ % D 1 % 8 0 % D 0 % B 0 % D 1 % 8 1 % D 1 %81%D1%82%D0%BE%D1%8F%D0%BD%D0%B8%D0%B8_%D0% 94%D0%B0%D0%BCD0%B5%D1%80%D0%B0%D1%83-% D 0 % 9 B % D 0 % B 5 % D 0 % B 2 % D 0 % B 5 % D0%BDD1%88%D1%82%D0%B5%D0%B9%D0%BD%D0%B0. – Назва з титул. екрану. 7. Насонов Д. Функция Левенштейна [Електронний ресурс] / Д. Насонов. – Режим доступу: http://rain.ifmo.ru/cat/data/theory/unsorted/levenshtein-2006/article.pdf. – Назва з титул. екрану.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 O. V. Bisikalo, V. A. Vysotska
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.