Autores
Nuñez Prado César Jesús
Sidorov Grigori
Gelbukh Alexander
Título Similarity Metrics for Automatic Verb Sense Disambiguation Using a Corpus based on WordNet; [Métricas de similitud para la desambiguación automática del sentido de los verbos sobre un corpus basado en WordNet]
Tipo Revista
Sub-tipo CONACYT
Descripción Computacion y Sistemas
Resumen This research explores the sense disambiguation of polysemous verbs in the definitions of a digital dictionary (WordNet) through two similarity metrics: Cosine Similarity and the Modified Simplified Lesk Algorithm. These methods were applied with the aim of identifying the three senses of greatest correspondence for each verb, thus facilitating the selection of the most appropriate sense according to the context in which it appears. Verb sense disambiguation is a fundamental task in the field of Natural Language Processing (NLP), with broad applications including automatic translation, information retrieval and semantic analysis, and it has been evaluated in competitions such as SensEval and SemEval. In this research, WordNet was employed to build a dataset that includes verbs with multiple definitions, representing a common and challenging scenario in lexical disambiguation. To validate the results, a manual evaluation by experts was used, allowing to establish a reliable reference on the correct meaning of each verb. Accuracy results were 82.78% for the Cosine Similarity method and 73.12% for the Modified Simplified Lesk Algorithm, evidencing the relative effectiveness of each method and providing a starting point for improving disambiguation models in advanced PLN tasks. © 2024 Instituto Politecnico Nacional. All rights reserved.
Observaciones DOI 10.13053/CyS-28-4-4695
Lugar Ciudad de México
País Mexico
No. de páginas 1727-1740
Vol. / Cap. v. 28 no. 4
Inicio 2024-10-01
Fin
ISBN/ISSN