ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Usunier, N. & Dupoux, E. (2016). Joint Learning of Speaker and Phonetic Similarities with Siamese Networks. In INTERSPEECH-2016, 1295-1299.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Versteegh, M. & Dupoux, E. (2016 ). A Deep Scattering Spectrum - Deep Siamese Network Pipeline For Unsupervised Acoustic Modeling. In ICASSP-2016, 4965-4969.

Reviewed conference proceeding  

Versteegh, M., Anguera, X., Jansen, A. & Dupoux, E. (2016). The Zero Resource Speech Challenge 2015: Proposed Approaches and Results. , Vol. 81: In SLTU-2016 Procedia Computer Science, 67-72.

Reviewed conference proceeding  

Versteegh, M., Thiollière, R., Schatz, T., Cao, X., Anguera, X., Jansen, A. & Dupoux, E. (2015). The Zero Resource Speech Challenge 2015. In INTERSPEECH-2015, 3169-3173.

Reviewed conference proceeding  

Varadarajan, B., Khudanpur, S. & Dupoux, E. (2008 ). Unsupervised Learning of Acoustic Subword Units. In Proceedings of ACL-08: HLT, 165-168.

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

Thual, A., Dancette, C., Karadayi, J., Benjumea, J. & Dupoux, E. (2018). A K-nearest neighbours approach to unsupervised spoken term discovery. In EEE Spoken Language Technology SLT-2018.

Reviewed conference proceeding  

Thiollière, R., Dunbar, E., Synnaeve, G., Versteegh, M. & Dupoux, E. (2015 ). A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling. In INTERSPEECH-2015, 3179-3183.

Reviewed conference proceeding  

Synnaeve, G., Versteegh, M. & Dupoux, E. (2014). Learning words from images and speech. In NIPS Workshop on Learning Semantics.

Reviewed conference proceeding  

Synnaeve, G. & Dupoux, E. (2016). A temporal coherence loss function for learning unsupervised acoustic embeddings. , Vol. 81: In SLTU-2016 Procedia Computer Science, 95-100.

Reviewed conference proceeding  

Synnaeve, G., Schatz, T. & Dupoux, E. (2014 ). Phonetics embedding learning with side information. In IEEE Spoken Language Technology Workshop, 106 - 111. doi:10.1109/slt.2014.7078558

Reviewed conference proceeding  

Synnaeve, G., Dautriche, I., Boerschinger, B., Johnson, M. & Dupoux, E. (2014). Unsupervised word segmentation in context. In Proceedings of 25th International Conference on Computational Linguistics (CoLing), 2326-2334.

Reviewed conference proceeding  

Schatz, T., Turnbull, R., Bach, F. & Dupoux, E. (2017). A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability. In INTERSPEECH-2017.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Cao, X., Bach, F., Hynek, H. & Dupoux, E. (2014 ). Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise. In INTERSPEECH-2014, 915-919.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Bach, F., Jansen, A., Hynek, H. & Dupoux, E. (2013 ). Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline. In INTERSPEECH-2013, 1781-1785.

Reviewed conference proceeding  

Schatz, T., Feldman, N., Goldwater, S., Cao, X. & Dupoux, E. (2021). Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. In National Academy of Sciences (Eds.), Vol. 118: In Proceedings of the National Academy of Sciences, 7. doi:10.31234/osf.io/fc4wh

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

Reviewed conference proceeding  

Riad, R., Dancette, C., Karadayi, J., Zeghidour, N., Schatz, T. & Dupoux, E. (2018). Sampling strategies in Siamese Networks for unsupervised speech representation learning. In Interspeech 2018.

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Reviewed conference proceeding  

Ondel, L., Godard, P., Besacier, L., Larsen, E., Hasegawa-Johnson, M., Scharenborg, O., Dupoux, E., Burget, L., Yvon, F. & Khudanpur, S. (2018). Bayesian Models For Unit Discovery On a Very Low Resource Language. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Reviewed conference proceeding  

Ogawa, T., Mallidi, S. , Dupoux, E., Cohen, J., Feldman, N. & Hermansky, H. (2016). A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation. In ICPR.

Reviewed conference proceeding  

Minagawa-Kawai, Y., Naoi, N., Nishijima, N., Kojima, S. & Dupoux, E. (2007). Developmental changes in cerebral responses to native and non-native vowels: a NIRS study. In Proceedings of the International Conference of Phonetic Sciences XVI, Saarbrucken, 1877–1880.

Reviewed conference proceeding  

Michon, E., Dupoux, E. & Cristia, A. (2015). Salient dimensions in implicit phonotactic learning. In INTERSPEECH-2015, 2665-2669.

Reviewed conference proceeding  

Michel, P., Räsänen, O., Thiollière, R. & Dupoux, E. (2017). Blind phoneme segmentation with temporal prediction errors. In Proceedings of ACL: Student Research Workshop.

Reviewed conference proceeding  

Ludusan, B., Mazuka, R., Bernard, M., Cristia, A. & Dupoux, E. (2017). The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective. , Vol. 2: In ACL 2017, 178-183. doi:10.18653/v1/P17-2028

Reviewed conference proceeding  

Ludusan, B. & Dupoux, E. (2016). Automatic syllable segmentation using broad phonetic class information. , Vol. 81: In SLTU-2016 Procedia Computer Science, 101-106.

Reviewed conference proceeding  

Ludusan, B., Caranica, A., Cucu, H., Buzo, A., Burileanu, C. & Dupoux, E. (2015 ). Exploring multi-language resources for unsupervised spoken term discovery. In Speech Technology and Human-Computer Dialogue (SpeD), 2015 International Conference on, 1-6.