×

You are using an outdated browser Internet Explorer. It does not support some functions of the site.

Recommend that you install one of the following browsers: Firefox, Opera or Chrome.

Contacts:

+7 961 270-60-01
ivdon3@bk.ru

Using the determining the similarity of words method to evaluate text vectorization algorithms

Abstract

Using the determining the similarity of words method to evaluate text vectorization algorithms

Saygin A.A., Fedosin S.A.

Incoming article date: 26.05.2024

The article provides a brief description of the existing methods of vectorization of texts in natural language. The evaluation is described by the method of determining the similarity of words. A comparative analysis of the operation of several vectorizer models is carried out. The process of selecting data for evaluation is described. The results of evaluating the performance of the models are compared.

Keywords: natural language processing, vectorization, word-form embedding, semantic similarity, correlation