A. Ortín Gómez, M. J. Nueda Roldán, M. Siles Molina

The EMEGAE initiative addresses gender inequality in the Spanish Higher Education and Research System (SESIE). It focuses on the analysis of the distribution of tasks between women and men, as well as on the valuation of these tasks, considering whether they are adequately recognized, made visible, or remunerated within academic and research institutions.

In this work, we present an application developed to support the collection, processing and automatic categorization of data provided by survey participants, as well as the Natural Language Processing (NLP) methods on which the solution is based. Initially, few-shot learning techniques using Large Language Models, and subsequently, the exploration of other more scalable and interpretable approaches, such as vector search based on a domain-adapted embeddings model.

Keywords: Text classification, Time Use Survey, NLP, LLM

Scheduled

Data Analysis in CCSS II
September 3, 2026  11:10 AM
Aula 21


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.