Título |
Automatic Term Extraction Using Log-Likelihood Based Comparison with General Reference Corpus |
Tipo |
Congreso |
Sub-tipo |
SCOPUS |
Descripción |
Lecture Notes in Computer Science |
Resumen |
In the paper we present a method that allows an extraction of single-word terms for a specific domain. At the next stage these terms can be used as candidates for multi-word term extraction. The proposed method is based on comparison with general reference corpus using log-likelihood similarity. We also perform clustering of the extracted terms using k-means algorithm and cosine similarity measure. We made experiments using texts of the domain of computer science. The obtained term list is analyzed in detail. |
Observaciones |
15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010; Code 81373ISBN: 3642138802;978-364213880-5 |
Lugar |
Cardiff |
País |
Reino Unido |
No. de páginas |
248-255 |
Vol. / Cap. |
6177 |
Inicio |
2010-06-23 |
Fin |
2010-06-25 |
ISBN/ISSN |
3642138802;978-36421 |