ViVoLab, Aragón Institute for Engineering Research (I3A)
University of Zaragoza, Spain
Pablo Gimeno was born in Valencia, Spain (1994). He received the Bachelor degree in Telecommunication Engineering and the Master degree in Telecommunication Engineering from the University of Zaragoza, Spain in the years 2016 and 2018 respectively.
He is currently pursuing his PhD under the supervision of Dr. Alfonso Ortega. His research interests span the areas of audio and speech processing, audio segmentation, speech activity detection and machine learning applied to audio processing. So far, part of his work has been published to different international conferences and prestigious peer-reviewed journals.
He also collaborates actively in teaching courses related to signal processing for the Bachelor degree in Telecommunication Engineering (Audio & Image Processing, Signal Processing laboratory) and the Master degree in Telecommunication Engineering (Speech Technologies).
A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation Conferencia
Proceedings of XII Jornadas en Tecnología del Habla and VIII Iberian SLTech (Iberspeech), 2022.
Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains Artículo de revista
En: Applied Sciences, vol. 12, no. 4, pp. 1832, 2022.
Multimodal Diarization Systems by Training Enrollment Models as Identity Representations Artículo de revista
En: Applied Sciences, vol. 12, no. 3, pp. 1141, 2022.
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021 Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2021-September, 2021.
Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data Artículo de revista
En: IEEE Signal Processing Letters, vol. 28, pp. 1135-1139, 2021.