2022
Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains Artículo de revista
En: Applied Sciences, vol. 12, no. 4, pp. 1832, 2022.
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems Artículo de revista
En: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 772-784, 2022.
Multimodal Diarization Systems by Training Enrollment Models as Identity Representations Artículo de revista
En: Applied Sciences, vol. 12, no. 3, pp. 1141, 2022.
2021
The Domain Mismatch Problem in the Broadcast Speaker Attribution Task Artículo de revista
En: Applied Sciences, vol. 11, no. 18, pp. 8521, 2021.
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021 Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2021-September, 2021.
Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2021-September, 2021.
Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data Artículo de revista
En: IEEE Signal Processing Letters, vol. 28, pp. 1135-1139, 2021.
Proceedings of XI Jornadas en Tecnología del Habla and VII Iberian SLTech (Iberspeech), vol. 2021-March, 2021.
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge Conferencia
Proceedings of XI Jornadas en Tecnología del Habla and VII Iberian SLTech (Iberspeech), vol. 2021-March, 2021.
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge Conferencia
Proceedings of XI Jornadas en Tecnología del Habla and VII Iberian SLTech (Iberspeech), vol. 2021-March, 2021.
Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification Conferencia
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2021-June, 2021.
Progressive Loss Functions for Speech Enhancement with Deep Neural Networks Artículo de revista
En: EURASIP Journal on Audio, Speech, and Music Processing, vol. 2021, no. 1, pp. 1-16, 2021.
2020
Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2020-October, 2020.
Training Speaker Enrollment Models by Network Optimization Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2020-October, 2020.
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions Conferencia
Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), vol. 2020-October, 2020.
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification Artículo de revista
En: Computer Speech & Language, vol. 63, pp. 101078, 2020.
Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification Conferencia
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2020-May, 2020.
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data Artículo de revista
En: EURASIP Journal on Audio, Speech, and Music Processing, vol. 2020, pp. 1–19, 2020.
2019
Unsupervised adaptation of PLDA models for broadcast diarization Artículo de revista
En: EURASIP Journal on Audio, Speech, and Music Processing, vol. 2019, no. 1, pp. 1–13, 2019.
Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media Artículo de revista
En: Applied Sciences, vol. 9, no. 24, pp. 5412, 2019.