Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders
Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, 28, pp.1788-1800. ⟨ 10.1109/TASLP.2020.3000593 ⟩. ⟨ hal-02364900v3 ⟩
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. A Recurrent Variational Autoencoder for Speech Enhancement. IEEE International Conference on Acoustic Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.1-7, ⟨ 10.1109/ICASSP40776.2020.9053164 ⟩. ⟨ hal-02329000v2 ⟩
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers
Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers. IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2019, 42, pp.1-17. ⟨ 10.1109/TPAMI.2019.2953020 ⟩. ⟨ hal-01950866v2 ⟩
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots
Xavier Alameda-Pineda, Soraya Arias, Yutong Ban, Guillaume Delorme, Laurent Girin, et al.. Audio-Visual Variational Fusion for Multi-Person Tracking with Robots. ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨ 10.1145/3343031.3350590 ⟩. ⟨ hal-02354514 ⟩
Bayesian time-domain multiple sound source localization for a stochastic machine
Raphael Frisch, Marvin Faix, Jacques Droulez, Laurent Girin, Emmanuel Mazer. Bayesian time-domain multiple sound source localization for a stochastic machine. EUSIPCO 2019 - 27th European Signal Processing Conference, Sep 2019, A Coruna, Spain. pp.1-5, ⟨ 10.23919/EUSIPCO.2019.8902666 ⟩. ⟨ hal-02377220 ⟩
Notes on the use of variational autoencoders for speech and audio spectrogram modeling
Laurent Girin, Fanny Roche, Thomas Hueber, Simon Leglaive. Notes on the use of variational autoencoders for speech and audio spectrogram modeling. DAFx 2019 - 22nd International Conference on Digital Audio Effects, Sep 2019, Birmingham, United Kingdom. pp.1-8. ⟨ hal-02349385 ⟩
Audio-noise Power Spectral Density Estimation Using Long Short-term Memory
Xiaofei Li, Simon Leglaive, Laurent Girin, Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2019, 26 (6), pp.918-922. ⟨ 10.1109/LSP.2019.2911879 ⟩. ⟨ hal-02100059 ⟩
Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin. Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models. SMC 2019 - 16th Sound & Music Computing Conference, May 2019, Malaga, Spain. ⟨ hal-02349406 ⟩
Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering
Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019, 27 (9), pp.1365-1377. ⟨ 10.1109/TASLP.2019.2919183 ⟩. ⟨ hal-01969041 ⟩
Speech enhancement with variational autoencoders and alpha-stable distributions
Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud. Speech enhancement with variational autoencoders and alpha-stable distributions. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545, ⟨ 10.1109/ICASSP.2019.8682546 ⟩. ⟨ hal-02005106 ⟩
Prénom NOM | Date d'entrée en thèse | Sujet | Ecole doctorale |
GEORGES Marc-Antoine | 01/10/2019 | Hybrid Bayesian and deep neural modeling for weakly supervised | EDISCE |
STEPHENSON Brooke | 06/01/2020 | Incremental sequence-to-sequence mapping for speech generation using deep neural networks | EEATS |
Grenoble Images Parole Signal Automatique laboratoire
UMR 5216 CNRS - Grenoble INP - Université Joseph Fourier - Université Stendhal