V. Moreno, L. Fernández Piana

Feature selection is a fundamental challenge in high-dimensional regression, where identifying the most relevant variables is essential for both interpretability and predictive performance. Although LASSO is widely used for this purpose, its reliance on a single tuning parameter $\lambda$ often limits its ability to recover the true active set. We address this issue by introducing a feature importance score, denoted $I^*$, that leverages the full LASSO path and aggregates information across multiple regularization levels.
We develop two complementary algorithms for computing and applying this score: LASSO.PATH, which efficiently constructs $I^*$, and LASSO.PATH.BISEC, which selects variables by estimating an optimal data-driven threshold. Through simulations and real-data analyses, we demonstrate that our approach improves feature selection accuracy and reduces false discoveries compared with traditional LASSO-based methods.

Keywords: LASSO path, variable selection, feature importance

Scheduled

Data Analysis
September 4, 2026  3:30 PM
Aula 20


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.