Autores
Quiroz Mercado Job Isaias
Barrón Fernández Ricardo
Ramírez Salinas Marco Antonio
Título Semantic Similarity Estimation Using Vector Symbolic Architectures
Tipo Revista
Sub-tipo JCR
Descripción IEEE Access
Resumen For many natural language processing applications, estimating similarity and relatedness between words are key tasks that serve as the basis for classification and generalization. Currently, vector semantic models (VSM) have become a fundamental language modeling tool. VSMs represent words as points in a high-dimensional space and follow the distributional hypothesis of meaning, which assumes that semantic similarity is related to the context. In this paper, we propose a model whose representations are based on the semantic features associated with a concept within the ConceptNet knowledge graph. The proposed model is based on a vector symbolic architecture framework, which defines a set of arithmetic operations to encode the semantic features within a single high-dimensional vector. In addition to word distribution, these vector representations consider several types of information. Moreover, owing to the properties of high-dimensional spaces, they have the additional advantage of being interpretable. We analyze the model's performance on the SimLex-999 dataset, a dataset where commonly used distributional models (e.g., word2vec or GloVe) perform poorly. Our results are similar to those of other hybrid models, and they surpass several state-of-the-art distributional and knowledge-based models. 
Observaciones DOI 10.1109/ACCESS.2020.3001765
Lugar New Jersey
País Estados Unidos
No. de páginas 109120-109132
Vol. / Cap. v. 8
Inicio 2020-06-11
Fin
ISBN/ISSN