Designing an application for classifying Time Use Survey texts

A. Ortín Gómez, M. J. Nueda Roldán, M. Siles Molina

The EMEGAE initiative addresses gender inequality in the Spanish Higher Education and Research System (SESIE). It focuses on the analysis of the distribution of tasks between women and men, as well as on the valuation of these tasks, considering whether they are adequately recognized, made visible, or remunerated within academic and research institutions.

In this work, we present an application developed to support the collection, processing and automatic categorization of data provided by survey participants, as well as the Natural Language Processing (NLP) methods on which the solution is based. Initially, few-shot learning techniques using Large Language Models, and subsequently, the exploration of other more scalable and interpretable approaches, such as vector search based on a domain-adapted embeddings model.

Palabras clave: Text classification Time Use Survey NLP LLM

Programado

Análisis de Datos en CCSS II

3 de septiembre de 2026 11:10

Aula 21

Otros trabajos en la misma sesión

Desempeño cuantitativo en ingeniería y brecha de género: evidencia con modelos mixtos de efectos cruzados

E. Delahoz-Domínguez, M. Serrano Hernandez, L. M. Carmona Armenta

Estimación bayesiana bajo cuasiseparación en encuestas: comparación de priors en regresión logística aplicada a la autoidentificación con la pobreza

D. Moreno Alameda

Discrepancias entre la percepción pública y los registros oficiales de incendios en Galicia

E. Nogueira Moure, M. Chas-Amil, J. Touza

Designing an application for classifying Time Use Survey texts

Otros trabajos en la misma sesión

Política de cookies