Autores
Markov Ilia
Sidorov Grigori
Título CIC-GIL approach to author profiling in Spanish tweets: Location and occupation
Tipo Congreso
Sub-tipo Memoria
Descripción 3rd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2018
Resumen We present the CIC-GIL approach to the author profiling (AP) task at MEX-A3T 2018. The task consists of two subtasks: identification of authors’ location (6-way) and occupation (8-way) in a corpus of Mexican Spanish tweets. We used the logistic regression algorithm trained on typed character n-grams, function-word n-grams, and regionalisms for location identification, and typed character n-grams with several modifications for occupation identification. Our best run showed F1-macro score of 73.63% for location and 48.94% for occupation identification. The results are competitive with other participating teams; in particular, our best run was ranked fourth in the shared task. © 2018 CEUR-WS. All Rights Reserved.
Observaciones CEUR Workshop Proceedings, v. 2150
Lugar Sevilla
País España
No. de páginas 97-101
Vol. / Cap.
Inicio 2018-09-18
Fin
ISBN/ISSN