Automation of searching for terms in the explanatory dictionary

Authors

DOI:

https://doi.org/10.15276/opu.3.62.2020.11

Keywords:

interpretation of the term, electronic dictionary, system of labels, mathematical model of the explanatory dictionary, dictionary of the subject area

Abstract

In this paper, an approach for automating the search for interpretations of terms for a specific domain in explanatory dictionary and on the Internet is proposed. A mathematical model of the explanatory dictionary is developed. It bases on the structure of the dictionary entry. The methodology for setting up an analyzer of a dictionary entry in a dictionary that has not been used before is developed. A methodology for automated search for one-word and multiword terms in electronic dictionaries has been developed. It bases on scanning dictionary entries in search for term using Regular Expressions. The automation of searching on the Internet resource using a browser automation tool Selenium is proposed. Automated analysis of search results in according to subject area have been developed in two methods. If there is a stylistic label in the structure of a dictionary entry, which indicates the area of the polysemantic term using, the results are corrected by filtering out definitions that do not correspond to this subject area. If there is no stylistic label, the search results are filtering out in the way of screening out definitions for occurrence of search terms. The creation of Dictionary bank for storing set-up to search electronic dictionaries is proposed. The program product, which allows search organizing in added and built-in electronic dictionaries and on the Internet resource, was developed. Using the program requires involvement of an expert to correct and verify the results. Working with the program requires the involvement of an expert to correct and verify the results. The effectiveness of the approach is confirmed experimentally. Groups of English, Russian and Ukrainian terms from different subject areas were used in the experiments. Formulas to determine the time spent on searching are proposed to assess the effectiveness of the implementation of the developed methods. The results showed a reduce of the time spent on search in the automated mode in about 5 times compared to the manual one. It is shown that adding an explanatory dictionary of a specialized subject area gives the most certain definition of terms in search process.

Downloads

Download data is not yet available.

Author Biography

Аlexey B. Kungurtsev, Odessа Polytechnic National University

PhD, Prof.

References

Бусел В.Т. Великий тлумачний словник сучасної української мови. Київ : Перун, 2001. 1696 с.

Ушаков Д.Н. Толковый словарь современного русского языка. Москва : Альта-принт; ДОМ. XXI век. 2009. 1239 с.

Angus Stevens Oxford Dictionary of English. Publisher: Oxford University Press. 2010. 2112 p. DOI: 10.1093/acref/9780199571123.001.0001.

Покровский В.И. Энциклопедический словарь медицинских терминов. Москва : Медицина, 2001. 960 с.

Вакуленко М.О., Вакуленко О.В. Фізичний тлумачний словник. URL: http://slavdpu.dn.ua/fizmatzbirnyk/slovniky/sl11.pdf.

Mark L. Steinberg, Sharon D. Cosloy, Dictionary of biotechnology and genetic engineering. Third Edi-tion. New York : Facts on File, 2006. 276 p.

Кунгурцев О.Б., Ковальчук С.В., Поточняк Я.В., Широкоступ М.В. Побудова словника предмет-ної області на основі автоматизованого аналізу текстів українською мовою. Технічні науки та технології. 2016. №3 (5). С. 164–174. URL: http://nbuv.gov.ua/UJRN/tnt_2016_3_21.

Califf M.E., Mooney R.J. Bottom-up relational learning of pattern matching rules for information ex-traction. Journal of Machine Learning Research. 2003. 4. P. 177–210. DOI: 10.1162/153244304322972685.

B. T. S. Atkins and M. Rundell. The Oxford Guide to Practical Lexicography. 2008. 540 p. DOI: https://doi.org/10.1093/ijl/ecn039.

Кунгурцев А. Б., Поточняк Я. В., Силяев Д. А. Метод автоматизированного построения толково-го словаря предметной области. Технологический аудит и резервы производства. 2015. Т. 2, № 2 (22). С. 58–63. DOI: https://doi.org/10.15587/2312-8372.2015.40895.

Kertvin Interpretatio. Freeware & Shareware. Обзоры программ для Windows, Linux, Mobile, Macin-tosh. URL: http://www.softholm.com/download-software-free16427.html.

Будыкина В.Г. Виды и функции словарных помет в российской и зарубежной лексикографиче-ской практике. Филологические науки. Вопросы теории и практики. 2019. Том 12. Выпуск 4. C. 240–244. DOI: https://doi.org/10.30853/filnauki.2019.4.50.

Валгина Н.С., Розенталь Д.Э., Фомина М.И. Современный русский язык: Учебник / Под редак-цией Н.С. Валгиной. 6-е изд., Москва : Логос, 2002. 528 с.

Goyvaerts Jan, Levithan Steven. Regular Expressions Cookbook. O'Reilly Media. Second Edition, 2012.

Словари и энциклопедии на «Академике». URL: https://academic.ru.16. Selenium official web-site. URL: https://www.seleniumhq.org.

Словарь на сайте Института языкознания им. А. А. Потебни. URL: http://www.inmo.org.ua/ sum.html.

John P. Considine. Dictionaries in Early Modern Europe: Lexicography and the Making of Heritage. Cambridge University Press, 2008. p. 298. DOI: https://doi.org/10.1017/CBO9780511485985.

Joanna Olechno-Wasiluk. The structure of an article entry in the dictionary Russia. Rocznik Instytutu Polsko-Rosyjskiego. 2015. № 1 (8). P. 147–158.

Downloads

Published

2020-11-12

How to Cite

[1]
Kungurtsev А.B., Novikova, N. and Kozhushan, M. 2020. Automation of searching for terms in the explanatory dictionary. Proceedings of Odessa Polytechnic University. 3(62) (Nov. 2020), 91–100. DOI:https://doi.org/10.15276/opu.3.62.2020.11.

Issue

Section

Informacion technology. Automation