M. Álvarez Martín, E. Boj del Val, A. Grané Chávez

In this work we explore distance-based methodologies for data imputation in complex data sets of mixed-type data. Our proposal is based on the use of robust distances, calculated with the dbrobust R package, which allows combining numerical and categorical variables while reducing the influence of outliers. In particular, we analyze several real data sets with varying percentages of missing data and evaluate the efficiency and computing time of our proposal and some competitors. The results show that robust methods offer efficient missing value imputation.

Keywords: data imputation, dbrobust, mixed-type data, robust distances

Scheduled

GT AMyC I: Advances in Distance-Based Methods
September 4, 2026  9:00 AM
Aula 28


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.