SABER

Autores
Nuñez Prado César Jesús
Sidorov Grigori
Gelbukh Alexander

Título	Similarity Metrics for Automatic Verb Sense Disambiguation Using a Corpus based on WordNet; [Métricas de similitud para la desambiguación automática del sentido de los verbos sobre un corpus basado en WordNet]
Tipo	Revista
Sub-tipo	CONACYT
Descripción	Computacion y Sistemas
Resumen	This research explores the sense disambiguation of polysemous verbs in the definitions of a digital dictionary (WordNet) through two similarity metrics: Cosine Similarity and the Modified Simplified Lesk Algorithm. These methods were applied with the aim of identifying the three senses of greatest correspondence for each verb, thus facilitating the selection of the most appropriate sense according to the context in which it appears. Verb sense disambiguation is a fundamental task in the field of Natural Language Processing (NLP), with broad applications including automatic translation, information retrieval and semantic analysis, and it has been evaluated in competitions such as SensEval and SemEval. In this research, WordNet was employed to build a dataset that includes verbs with multiple definitions, representing a common and challenging scenario in lexical disambiguation. To validate the results, a manual evaluation by experts was used, allowing to establish a reliable reference on the correct meaning of each verb. Accuracy results were 82.78% for the Cosine Similarity method and 73.12% for the Modified Simplified Lesk Algorithm, evidencing the relative effectiveness of each method and providing a starting point for improving disambiguation models in advanced PLN tasks. © 2024 Instituto Politecnico Nacional. All rights reserved.
Observaciones	DOI 10.13053/CyS-28-4-4695
Lugar	Ciudad de México
País	Mexico
No. de páginas	1727-1740
Vol. / Cap.	v. 28 no. 4
Inicio	2024-10-01
Fin
ISBN/ISSN