Меню
No. 4 (25) - 2024 / 2024-12-31 / Number of views: 12
Authors
Keywords
Link to DOI:
How to quote
This article presents the results of an analysis of machine learning algorithms for sentimental data analysis in the Kazakh language, and as a result of the analysis, effective algorithms are determined. With the increasing volume of Kazakh-language content on social networks, news and online stores, the need for tools and methods for processing data in the Kazakh language has also increased in order to obtain valuable information about people's opinions and views. Therefore, the dataset used in the study was collected from real online stores and news sites. The volume of the collected data set is 1500 records, 80% of which were used for training the algorithms, and 20% for testing. For sentimental data analysis, machine learning algorithms such as logistic regression, multinomial naive Bayes, support vector machine (SVM), XGBoost and long short-term memory (LSTM) deep learning are considered. The study tested algorithms by increasing the dataset from 500 records to 1500 records, and various algorithm methods such as individual, ensemble, and augmented were implemented and tested. The results obtained during testing were presented in terms of algorithm accuracy.