ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Usunier, N. & Dupoux, E. (2016). Joint Learning of Speaker and Phonetic Similarities with Siamese Networks. In INTERSPEECH-2016, 1295-1299.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Versteegh, M. & Dupoux, E. (2016 ). A Deep Scattering Spectrum - Deep Siamese Network Pipeline For Unsupervised Acoustic Modeling. In ICASSP-2016, 4965-4969.

Reviewed conference proceeding  

Versteegh, M., Anguera, X., Jansen, A. & Dupoux, E. (2016). The Zero Resource Speech Challenge 2015: Proposed Approaches and Results. , Vol. 81: In SLTU-2016 Procedia Computer Science, 67-72.

Reviewed conference proceeding  

Versteegh, M., Thiollière, R., Schatz, T., Cao, X., Anguera, X., Jansen, A. & Dupoux, E. (2015). The Zero Resource Speech Challenge 2015. In INTERSPEECH-2015, 3169-3173.

Reviewed conference proceeding  

Varadarajan, B., Khudanpur, S. & Dupoux, E. (2008 ). Unsupervised Learning of Acoustic Subword Units. In Proceedings of ACL-08: HLT, 165-168.

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

Thual, A., Dancette, C., Karadayi, J., Benjumea, J. & Dupoux, E. (2018). A K-nearest neighbours approach to unsupervised spoken term discovery. In EEE Spoken Language Technology SLT-2018.

Reviewed conference proceeding  

Thiollière, R., Dunbar, E., Synnaeve, G., Versteegh, M. & Dupoux, E. (2015 ). A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling. In INTERSPEECH-2015, 3179-3183.

Reviewed conference proceeding  

Synnaeve, G., Versteegh, M. & Dupoux, E. (2014). Learning words from images and speech. In NIPS Workshop on Learning Semantics.

Reviewed conference proceeding  

Synnaeve, G. & Dupoux, E. (2016). A temporal coherence loss function for learning unsupervised acoustic embeddings. , Vol. 81: In SLTU-2016 Procedia Computer Science, 95-100.

Reviewed conference proceeding  

Synnaeve, G., Schatz, T. & Dupoux, E. (2014 ). Phonetics embedding learning with side information. In IEEE Spoken Language Technology Workshop, 106 - 111. doi:10.1109/slt.2014.7078558

Reviewed conference proceeding  

Synnaeve, G., Dautriche, I., Boerschinger, B., Johnson, M. & Dupoux, E. (2014). Unsupervised word segmentation in context. In Proceedings of 25th International Conference on Computational Linguistics (CoLing), 2326-2334.

Reviewed conference proceeding  

Schatz, T., Turnbull, R., Bach, F. & Dupoux, E. (2017). A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability. In INTERSPEECH-2017.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Cao, X., Bach, F., Hynek, H. & Dupoux, E. (2014 ). Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise. In INTERSPEECH-2014, 915-919.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Bach, F., Jansen, A., Hynek, H. & Dupoux, E. (2013 ). Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline. In INTERSPEECH-2013, 1781-1785.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In ICASSP-2020.

Reviewed conference proceeding  

Riad, R., Dancette, C., Karadayi, J., Zeghidour, N., Schatz, T. & Dupoux, E. (2018). Sampling strategies in Siamese Networks for unsupervised speech representation learning. In Interspeech 2018.

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Book review  

Pacherie, E. (2004). Looking for the Agent in Action. Trends in Cognitive Sciences, 8, 2, 54–55.

Reviewed conference proceeding  

Ondel, L., Godard, P., Besacier, L., Larsen, E., Hasegawa-Johnson, M., Scharenborg, O., Dupoux, E., Burget, L., Yvon, F. & Khudanpur, S. (2018). Bayesian Models For Unit Discovery On a Very Low Resource Language. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Reviewed conference proceeding  

Ogawa, T., Mallidi, S. , Dupoux, E., Cohen, J., Feldman, N. & Hermansky, H. (2016). A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation. In ICPR.

Reviewed conference proceeding  

Minagawa-Kawai, Y., Naoi, N., Nishijima, N., Kojima, S. & Dupoux, E. (2007). Developmental changes in cerebral responses to native and non-native vowels: a NIRS study. In Proceedings of the International Conference of Phonetic Sciences XVI, Saarbrucken, 1877–1880.

Reviewed conference proceeding  

Michon, E., Dupoux, E. & Cristia, A. (2015). Salient dimensions in implicit phonotactic learning. In INTERSPEECH-2015, 2665-2669.

Reviewed conference proceeding  

Michel, P., Räsänen, O., Thiollière, R. & Dupoux, E. (2017). Blind phoneme segmentation with temporal prediction errors. In Proceedings of ACL: Student Research Workshop.

Reviewed conference proceeding  

Ludusan, B., Mazuka, R., Bernard, M., Cristia, A. & Dupoux, E. (2017). The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective. , Vol. 2: In ACL 2017, 178-183. doi:10.18653/v1/P17-2028

Reviewed conference proceeding  

Ludusan, B. & Dupoux, E. (2016). Automatic syllable segmentation using broad phonetic class information. , Vol. 81: In SLTU-2016 Procedia Computer Science, 101-106.

Reviewed conference proceeding  

Ludusan, B., Caranica, A., Cucu, H., Buzo, A., Burileanu, C. & Dupoux, E. (2015 ). Exploring multi-language resources for unsupervised spoken term discovery. In Speech Technology and Human-Computer Dialogue (SpeD), 2015 International Conference on, 1-6.

Reviewed conference proceeding  

Ludusan, B., Seidl, A., Dupoux, E. & Cristia, A. (2015). Motif discovery in infant- and adult-directed speech. In Proceedings of CogACLL2015, 93-102. doi:10.18653/v1/W15-2413