Autores
Sidorov Grigori
Gelbukh Alexander
Lavín Villa Moisés Eduardo
Chanona Hernández Liliana
Título Automatic Term Extraction Using Log-Likelihood Based Comparison with General Reference Corpus
Tipo Congreso
Sub-tipo SCOPUS
Descripción Lecture Notes in Computer Science
Resumen In the paper we present a method that allows an extraction of single-word terms for a specific domain. At the next stage these terms can be used as candidates for multi-word term extraction. The proposed method is based on comparison with general reference corpus using log-likelihood similarity. We also perform clustering of the extracted terms using k-means algorithm and cosine similarity measure. We made experiments using texts of the domain of computer science. The obtained term list is analyzed in detail.
Observaciones 15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010; Code 81373ISBN: 3642138802;978-364213880-5
Lugar Cardiff
País Reino Unido
No. de páginas 248-255
Vol. / Cap. 6177
Inicio 2010-06-23
Fin 2010-06-25
ISBN/ISSN 3642138802;978-36421