| Título |
Author Clustering using Hierarchical Clustering Analysis. Notebook for PAN at CLEF 2017 |
| Tipo |
Congreso |
| Sub-tipo |
Memoria |
| Descripción |
18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017 |
| Resumen |
This paper presents our approach to the Author Clustering task at PAN 2017. We performed a hierarchical clustering analysis of different document features: typed and untyped character n-grams, and word n-grams.We experimented with two feature representation methods, log-entropy model, and tf-idf; while tuning minimum frequency threshold values to reduce the dimensionality. Our system was ranked 1st in both subtasks, author clustering and authorship-link ranking. |
| Observaciones |
CEUR Workshop Proceedings, v. 1866 |
| Lugar |
Dublin |
| País |
Irlanda |
| No. de páginas |
7 p. |
| Vol. / Cap. |
|
| Inicio |
2017-09-11 |
| Fin |
2017-09-14 |
| ISBN/ISSN |
|