Autores
Gelbukh Alexander
Título NLP-NITMZ@DPIL-FIRE2016: Language independent paraphrases detection
Tipo Congreso
Sub-tipo Memoria
Descripción 2016 Forum for Information Retrieval Evaluation, FIRE 2016
Resumen In this paper we describe the detailed information of NLP-NITMZ system on the participation of DPIL1 shared task at Forum for Information Retrieval Evaluation (FIRE 2016). The main aim of DPIL shared task is to detect paraphrases in Indian Languages. Paraphrase detection is an important part in the field of Information Retrieval, Document Summarization, Question Answering, Plagiarism Detection etc. In our approach, we used language independent feature-set to detect paraphrases in Indian languages. Features are mainly based on lexical based similarity. Our system's three features are: Jaccard Similarity, length normalized Edit Distance and Cosine Similarity. Finally, these feature-set are trained using Probabilistic Neural Network (PNN) to detect the paraphrases. With our feature-set, we achieved 88.13% average accuracy in Sub-Task 1 and 71.98% average accuracy in Sub-Task 2.
Observaciones CEUR Workshop Proceedings, v. 1737
Lugar Kolkata
País India
No. de páginas 256-259
Vol. / Cap.
Inicio 2016-12-07
Fin 2016-12-10
ISBN/ISSN