Publicaciones (18) Publicaciones de Juan Manuel Martin Doñas

2024

  1. EMPHASIS: Empowering Decision Making with Higher Productivity by Means of HyperAutomation

    CEUR Workshop Proceedings

  2. Stream-based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2023

  1. An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. The Vicomtech partial deepfake detection and location system for the 2023 ADD Challenge

    CEUR Workshop Proceedings

  3. When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2022

  1. Cascade or Direct Speech Translation? A Case Study

    Applied Sciences (Switzerland), Vol. 12, Núm. 3

  2. THE VICOMTECH AUDIO DEEPFAKE DETECTION SYSTEM BASED ON WAV2VEC2 FOR THE 2022 ADD CHALLENGE

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

2020

  1. Asteroid: The PyTorch-based audio source separation toolkit for researchers

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

  2. Online Multichannel Speech Enhancement Based on Recursive em and DNN-Based Speech Presence Estimation

    IEEE/ACM Transactions on Audio Speech and Language Processing, Vol. 28, pp. 3080-3094

  3. Silent speech interfaces for speech restoration: A review

    IEEE Access, Vol. 8, pp. 177995-178021

2019

  1. An extended Kalman filter for RTF estimation in dual-microphone smartphones

    European Signal Processing Conference

  2. Dual-channel speech enhancement based on extended Kalman filter relative transfer function estimation

    Applied Sciences (Switzerland), Vol. 9, Núm. 12

  3. Multi-channel block-online source extraction based on utterance adaptation

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

2018

  1. A deep learning loss function based on the perceptual evaluation of the speech quality

    IEEE Signal Processing Letters, Vol. 25, Núm. 11, pp. 1680-1684

  2. A postfiltering approach for dual-microphone smartphones

    4th International Conference, IberSPEECH 2018

  3. Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones

    2018 41st International Conference on Telecommunications and Signal Processing, TSP 2018

2017

  1. Dual-channel DNN-Based Speech enhancement for smartphones

    2017 IEEE 19th International Workshop on Multimedia Signal Processing, MMSP 2017

2016

  1. Deep neural network-based noise estimation for robust ASR in dual-microphone smartphones

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)