Machine learning methods for solving semantics and context problems in processing textual data

Keywords: Machine learning, natural language processing, semantics, context, textual data, neural networks, transformers, BERT, GPT-3, data analysis, sentiment analysis, semantic analysis

Abstract

Topicality. As machine learning capabilities expand and impact many aspects of modern life, such as natural language processing, understanding semantics and context in textual data is becoming increasingly important. Semantics and context play a significant role in the ability of machines to understand human language. They are central elements in various applications such as machine translation, sentiment analysis, spam detection, voice recognition, and others. However, these aspects are often neglected or underestimated when processing textual data. Despite significant progress in this area, the problem of semantics and context remains unresolved, which reduces the efficiency and accuracy of many machine learning systems.

Goal: The main goal of this article is to investigate the problem of understanding semantics and context in machine learning in the textual data processing. The article aims to identify the main challenges associated with understanding semantics and context, and how they affect various aspects of text processing. Additionally, current techniques and approaches used in the field of machine learning for solving those problems have been analyzed and their limitations identified.

Research methods. Analysis, explanation, classification.

The results. It has been found that despite significant advances in machine learning technologies, problems of semantics and context in processing textual data are still existing. They affect the quality and accuracy of decisions made by machine learning based systems, which can lead to incorrect analysis and distortion of data. It has been found that even modern transformer-based models can face challenges in understanding semantics and context, especially in complex and multi-valued scenarios.

Conclusions. On the basis of the conducted research, it has been concluded that the problem of semantics and context in the processing of textual data is significant and requires further study. The existing methods and technologies show high results in some cases, but may be insufficient in others, especially complex ones. It is proposed to continue research in this area, to develop new methods and approaches that will be able to effectively solve these problems. It is also important to study how different contextual factors affect the semantics of textual data and how these effects can be taken into account when designing and using machine learning systems.

Downloads

Download data is not yet available.

Author Biographies

Ihor Malyha, Kharkiv National University of V.N. Karazin,4 Svobody Square, Kharkiv, Ukraine, 61077

PhD student

Serhiy Shmatkov, Kharkiv National University of V.N. Karazin,4 Svobody Square, Kharkiv, Ukraine, 61077

Doctor of science, professor; Head of the Department of Theoretical Theoretical and Applied System Engineering

References

/

References

Published
2022-12-26
How to Cite
Malyha, I., & Shmatkov, S. (2022). Machine learning methods for solving semantics and context problems in processing textual data. Bulletin of V.N. Karazin Kharkiv National University, Series «Mathematical Modeling. Information Technology. Automated Control Systems», 56, 35-42. https://doi.org/10.26565/2304-6201-2022-56-03
Section
Статті