APPLICATION OF INFORMATION TECHNOLOGIES FOR SEMANTIC TEXT PROCESSING

Authors

DOI:

https://doi.org/10.37943/AITU.2020.75.91.002

Keywords:

computer linguistics, semantics, text, method, expert system, machine text analysis.

Abstract

An expert system for text analysis based on the heuristic knowledge of an expert linguist is proposed. Methods of linguistic analysis of the text through the use of computer technology have been further developed. Data verification was performed on the example of the Germanic language group. The algorithm of the system operation is given. The sequence of actions of the text analysis process is described. Research relates to the subject of computational linguistics and helps to automate text analysis processes. The main purpose of the research is to improve the machine’s understanding of the semantic structure of the text by finding current connections between the main members of the sentence, current connections between secondary members of the sentence, the best concept of the current word and the function of the current word in the sentence. Semantic networks are used in the software solution. The Java programming shell, such as NetBeans IDE 8.1, and the CLIPS shell, were used to create the software product. The main logical connections and structure of the program are described in the article. Methods and relations are considered on the example of the Germanic group of languages. All languages of the Germanic group are similar because they have a direct line of words which makes them even more similar: subject + predicate + subordinate clauses. Thus, to reflect the structure of the Germanic group of languages, it is sufficient to consider one of them. Namely, English, as it is the most common (1.5 billion people), international, has the largest vocabulary among the group (500 thousand words) and, in our opinion, the most complex.

Author Biographies

O. Kravchenko, Taras Shevchenko National University of Kyiv, Ukraine

Candidate of Technical Sciences, Department of Information Systems and
Technology
kravchenko_ov@gmail.com, orcid.org/0000-0002-9669-2579
Taras Shevchenko National University of Kyiv, Ukraine

Zh. Plakasova, Cherkassy State Technological University, Ukraine

Senior Lecturer, Department of Automated Systems Software
zh.plakasova@chdtu.edu.ua, orcid.org/0000-0003-3911-2600
Cherkassy State Technological University, Ukraine

M. Gladka, Taras Shevchenko National University of Kyiv, Ukraine

Assistant Department Department of Information Systems and Technology
miragladka@gmail.com., orcid.org/0000-0001-5233-2021
Taras Shevchenko National University of Kyiv, Ukraine

А. Karapetyan, Cherkassy State Technological University, Ukraine

Candidate of Technical Sciences, Department of Information Technology
Design
anait.r.karapetyan@gmail.com., orcid.org/0000-0002-7412-3252
Cherkassy State Technological University, Ukraine

S. Besedina, Bohdan Khmelnytsky National University of Cherkasy, Ukraine

Candidate of Technical Sciences, Associate Professor, Department of
Information Technologies
besedina_sv@ukr.net, https://orcid.org/0000-0002-5391-643X
Bohdan Khmelnytsky National University of Cherkasy, Ukraine

References

Kocherhan, M.P. (2020, June 6). Vstup do movoznavstva. Resource access mode: https://pidruchniki.com/1222090548043/dokumentoznavstvo/ vstup_do_movoznavstva (in Ukrainian).

Karpilovsʹka, A.E. (2006). Vstup do prykladnoyi linhvistyky: komp'yuterna linhvistyka. – Donetsʹk: Yuho-Vostok, 187. (in Ukrainian)

Kenzhaev, A.D. (2020, June 6) Machine translation: history and modernity. Resource access mode: https://lomonosov-msu.ru/archive/Lomonosov_2014/2568/2200_72719_187154.pdf. (in Russian).

Meyye, A. (2016). Osnovnyye osobennosti germanskoy gruppy yazykov.Per. s fr. Izd. Stereotip.URSS, 168.

Slovari i sistemy mashinnogo perevoda (2020). Resource access mode: http//www.itland.com.ua/products/sect.php.section.

Ivanov, O. V. (2009). Kompʺyuternyy kontent-analiz: problemy ta perspektyvy vyrishennya. Metodolohiya, teoriya ta praktyka sotsiolohichnoho analizu suchasnoho suspilʹstva, 15, 335-340.

Monroe, B.L., & Schrodt, P. A. (2008) Introduction to the Special Issue: The Statistical Analysis of Political Text. Political Analysis, 16, 351–355.

Marchenko, O.O. (2015). Systema analizu koreferentnykh zv'yazkiv u tekstakh// Shtuchnyy intelekt, № 3-4 [Electronic resource]: Resource access mode: http://dspace.nbuv.gov.ua/bitstream/handle/123456789/117200/01/Marchenko.pdf?sequence

Deep Semantic Analysis of Text James F. Allen1,2 Mary Swift1 Will de Beaumont [Electronic resource]: Resource access mode: https://www.aclweb.org/anthology/W08-2227.pdf (accessed 01.06.2020)

Klapur, A. (2007) Semantic analy p sis of text and speech [Electronic resource]: Resource access mode: https://www.cs.tut.fi/sgn/ arg/klap/introduction-semantics.pdf (accessed 01.06.2020)

Dandelion API. Semantic Text Analytics as a service. [Electronic resource]: Resource access mode: https://dandelion.eu/. (accessed 01.06.2020)

Nikolayeva, S.Yu. (2018) Zmist fakhovoho vyprobuvannya do aspirantury zi spetsialʹnosti 011 osvitni/pedahohichni nauky dlya spetsializatsiyi "Teoriya ta metodyka navchannya: hermansʹki/romansʹki movy"// International Scientific and Practical Conference World Science И-во: ROST (Dubai), 4(30), 52-59.

Downloads

Published

2020-06-30

How to Cite

Kravchenko, O., Plakasova, Z., Gladka, M., Karapetyan А., & Besedina, S. (2020). APPLICATION OF INFORMATION TECHNOLOGIES FOR SEMANTIC TEXT PROCESSING. Scientific Journal of Astana IT University, 18–31. https://doi.org/10.37943/AITU.2020.75.91.002

Issue

Section

Information Technologies
betpas
pendik escort anadolu yakasi escort bostanci escort kadikoy escort kartal escort kurtkoy escort umraniye escort
maltepe escort ataşehir escort ataşehir escort ümraniye escort pendik escort kurtköy escort anadolu yakası escort üsküdar escort şerifali escort kartal escort gebze escort kadıköy escort bostancı escort göztepe escort kadıköy escort bostancı escort üsküdar escort ataşehir escort maltepe escort kurtköy escort anadolu yakası escort ataşehir escort beylikdüzü escort