Autores
Tusell Rey Claudia Caridad
Yáñez Márquez Cornelio
Título Instance Selection for Hybrid and Incomplete Data based on Clustering
Tipo Revista
Sub-tipo CONACYT
Descripción International Journal of Combinatorial Optimization Problems and Informatics
Resumen This paper presents the HICCS algorithm, a novel clustering approach that handles hybrid and incomplete data. HICCS improves clustering by using compact sets as initial clusters, employing holotypes to measure intergroup dissimilarity, and merging clusters based on similarity in an order-independent manner. Additionally, it incorporates a user-defined similarity function, making it adaptable to various real-world domains. Furthermore, we introduce the IS-HICCS algorithm for instance selection, which reduces the instance set without compromising classifier accuracy, highlighting clustering's potential to enhance supervised classification models. We evaluate HICCS and IS-HICCS on synthetic and real-life datasets, showing their statistically superior performance compared to other clustering and instance selection methods, respectively.
Observaciones DOI 10.61467/2007.1558.2025.v16i3.845
Lugar Juitepec
País Mexico
No. de páginas 405-419
Vol. / Cap. v. 16 no. 3
Inicio 2025-07-14
Fin
ISBN/ISSN