Autores
Aroyehun Segun Taofeek
Gelbukh Alexander
Sidorov Grigori
Título Two-Level Classifier for Detecting Categories of Offensive Language
Tipo Libro
Sub-tipo Indefinido
Descripción Recent Developments and the New Directions of Research, Foundations, and Applications 
Resumen We explore the task of offensive content classification on the HASOC 2021 shared task dataset. Our approach is based on two-level classification scheme (corresponding to Subtasks 1A and 1B respectively). The first level is a binary classification (offensive or not). The second level further classifies only the offensive instances. The classifier at each level is a fine-tuned transformer model. Our model on the English dataset achieves an overall best macro F1 score of 0.831 and 0.666 on Subtasks 1A and 1B, respectively. The model performance on Hindi and Marathi is competitive: macro F1 score of 0.778 for Hindi Subtask 1A and 0.553 for Hindi Subtask 1B (fourth place on the leaderboard), and a macro F1 score of 0.847 for Marathi Subtask 1A (13th on the leaderboard). © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Observaciones DOI 10.1007/978-3-031-23476-7_20 Studies in Fuzziness and Soft Computing, v. 423
Lugar Cham
País Suiza
No. de páginas 225-232
Vol. / Cap. STUDFUZZ v, 423
Inicio 2023-06-27
Fin
ISBN/ISSN 9783031234750