SABER

Título	Automatic Term Extraction Using Log-Likelihood Based Comparison with General Reference Corpus
Tipo	Congreso
Sub-tipo	SCOPUS
Descripción	Lecture Notes in Computer Science
Resumen	In the paper we present a method that allows an extraction of single-word terms for a specific domain. At the next stage these terms can be used as candidates for multi-word term extraction. The proposed method is based on comparison with general reference corpus using log-likelihood similarity. We also perform clustering of the extracted terms using k-means algorithm and cosine similarity measure. We made experiments using texts of the domain of computer science. The obtained term list is analyzed in detail.
Observaciones	15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010; Code 81373ISBN: 3642138802;978-364213880-5
Lugar	Cardiff
País	Reino Unido
No. de páginas	248-255
Vol. / Cap.	6177
Inicio	2010-06-23
Fin	2010-06-25
ISBN/ISSN	3642138802;978-36421