×

You are using an outdated browser Internet Explorer. It does not support some functions of the site.

Recommend that you install one of the following browsers: Firefox, Opera or Chrome.

Contacts:

+7 961 270-60-01
ivdon3@bk.ru

Creating a dataset of russian texts for emotion analysis using Robert Plutchik's model

Abstract

Creating a dataset of russian texts for emotion analysis using Robert Plutchik's model

Sklyarov M.A., Levshin D. V., Zubkov A.V.

Incoming article date: 15.02.2025

The purpose of research is to increase the level of specification of sentiment within the framework of sentiment analysis of Russian-language texts by developing a dataset with an extensive set of emotional categories. The paper discusses the main methods of sentimental analysis and the main emotional models. A software system for decentralizing data tagging has been developed and described. The novelty of this work lies in the fact that to determine the emotional coloring of Russian-language texts, an emotional model is used for the first time, which contains more than 8 emotional classes, namely the model of R. Plutchik. As a result, a new dataset was developed for the study and analysis of emotions. This dataset consists of 24,435 unique records labeled into 32 emotion classes, making it one of the most diverse and detailed datasets in the field. Using the resulting dataset, a neural network was trained that determines the author’s set of emotions when writing text. The resulting dataset provides an opportunity for further research in this area. One of the promising tasks is to enhance the efficiency of neural networks trained on this dataset.

Keywords: sentiment, analysis, model, Robert Plutchik, emotions, markup, text