Guiding Feature Selection with Complexity Measures

D. Moctezuma, C. Lancho Martín

Feature selection aims to identify a subset of features that achieves performance comparable to or better than that obtained using the full feature set. The goal is to remove redundant or noisy variables, retaining only the informative ones, thereby enabling the development of more reliable ML models.

This objective is closely related to the concept of data complexity. Complexity measures characterize factors that reflect the intrinsic difficulty of a dataset. In supervised classification, aspects such as class distribution, decision boundary geometry, and noise levels can significantly impact learning performance and complexity measures aim to quantify them.

Although several state-of-the-art studies have applied complexity measures to feature selection with positive results, a comprehensive analysis of how to systematically exploit them is still lacking. In this work, we address this gap by conducting a global analysis and proposing guidelines for their use in feature selection.

Keywords: feature selection complexity measures

Scheduled

GT SW II: Inteligencia Artificial y Aprendizaje Automático

September 5, 2026 10:00 AM

Aula 21

Other papers in the same session

Enhancing NLP Models with GenAI

N. Madrueño Sierro, I. Martín de Diego, A. Fernández Isabel

Estadística aplicada: innovación y análisis avanzado scon Power BI

M. E. Escorihuela Sahún

Algoritmos de clasificación para la detección y el control de plagas a partir de imágenes multiespectrales

P. Belén Galipienso, M. J. Nueda Roldán, D. Arcos Limiñana

Guiding Feature Selection with Complexity Measures

Other papers in the same session

Cookie policy