SABER

Autores
Tusell Rey Claudia Caridad
Yáñez Márquez Cornelio

Título	Instance Selection for Hybrid and Incomplete Data based on Clustering
Tipo	Revista
Sub-tipo	CONACYT
Descripción	International Journal of Combinatorial Optimization Problems and Informatics
Resumen	This paper presents the HICCS algorithm, a novel clustering approach that handles hybrid and incomplete data. HICCS improves clustering by using compact sets as initial clusters, employing holotypes to measure intergroup dissimilarity, and merging clusters based on similarity in an order-independent manner. Additionally, it incorporates a user-defined similarity function, making it adaptable to various real-world domains. Furthermore, we introduce the IS-HICCS algorithm for instance selection, which reduces the instance set without compromising classifier accuracy, highlighting clustering's potential to enhance supervised classification models. We evaluate HICCS and IS-HICCS on synthetic and real-life datasets, showing their statistically superior performance compared to other clustering and instance selection methods, respectively.
Observaciones	DOI 10.61467/2007.1558.2025.v16i3.845
Lugar	Juitepec
País	Mexico
No. de páginas	405-419
Vol. / Cap.	v. 16 no. 3
Inicio	2025-07-14
Fin
ISBN/ISSN