ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

Reviewed conference proceeding  

Schatz, T., Feldman, N., Goldwater, S., Cao, X. & Dupoux, E. (2021). Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. In National Academy of Sciences (Eds.), Vol. 118: In Proceedings of the National Academy of Sciences, 7. doi:10.31234/osf.io/fc4wh

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

de Seyssel, M. & Dupoux, E. (2020). Does bilingual input hurt? A simulation of language discrimination and clustering using i-vectors. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Non-reviewed conference proceeding  

Kahn, J., Rivière, M., Zheng, W., Kharitonov, E., Xu, Q., Mazaré, P., Karadayi, J., Lipchinsky , V., Collobert, R., Fuegen, C., Likhomanenko, T., Synnaeve, G., Joulin, A., Mohamed, A. & Dupoux, E. (2020). Libri-Light: A Benchmark for ASR with Limited or No Supervision. In Acoustics Speech and Signal Processing (ICASSP) ICASSP 2020 - 2020 IEEE International Conference, 2020 IEEE International Conference , 7414-7418. doi:10.1109/ICASSP40776.2020.9052942

Non-reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Bouchacourt, F., Dupoux, E. & Baroni, M. (2020). Compositionality and Generalization in Emergent Languages. In Proceedings of ACL 2020 (58th Annual Meeting of the Association for Computational Linguistics), East Stroudsburg PA: ACL.

Non-reviewed conference proceeding  

Berthet, M., Benjumea, J., Millet, J., Cäsar, C., Zuberbühler, K. & Dunbar, E. (2020). Can emotional communication be semantically rich? The case of titi monkeys. In “An integrative approach to the study of language evolution: Re-drawing the boundaries of language”. Workshop at the Evolang XIII, Brussels, Belgium.

Reviewed conference proceeding  

Kharitonov, E., Rivière, M., Synnaeve, G., Wolf, L., Mazaré, P., Douze, M. & Dupoux, E. (2020). Data Augmenting Contrastive Learning of Speech Representations in the Time Domain. In IEEE (Eds.), In 2021 IEEE Spoken Language Technology Workshop (SLT, 215-222.

Reviewed conference proceeding  

Er Jiang, B., Dunbar, E., Sonderegger, M., Clayards, M. & Dupoux, E. (2020). Modelling Perceptual Effects of Phonology with ASR Systems. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Reviewed conference proceeding  

Fourtassi, A. & Dupoux, E. (2019). Phoneme learning is influenced by the taxonomic organization of the semantic referents. In Cognitive Science Society (Eds.), In Proceedings of the Cognitive Science Conference, 323-324.

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Dupoux, E. & Baroni, M. (2019). Anti-efficient encoding in emergent communication. In NeuRIPS.

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Lazaric, A., Dupoux, E. & Baroni, M. (2019). Word-order biases in deep-agent emergent communication. In ACL 2019.

Reviewed conference proceeding  

Dunbar, E., Algayres, R., Karadayi, J., Bernard, M., Benjumea, J., Cao, X., Miskic, L., Dugrain, C., Ondel, L., Black, A., Besacier, L., Sakti, S. & Dupoux, E. (2019). The Zero Resource Speech Challenge 2019: TTS without T. In INTERSPEECH-2019.

Non-reviewed conference proceeding  

Wilcox, E. & Spector, B. (2019). The Role of Prior Beliefs in The Rational Speech Act Model of Pragmatics: Exhaustivity as a Case Study. In CogSci, 3099-3105.

Non-reviewed conference proceeding  

Anvari, A., Maldonado, M. & Soria Ruiz, A. (2019). The Puzzle of Reflexive Belief Construction in Spanish. In Espinal, M. T.; Castroviejo, E.; Leonetti, M.; McNally, L.; and Real-Puigdollers, C. (Eds.), In Proceedings of the 23rd Sinn und Bedeutung Conference, 57–74.

Non-reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2019). End-to-End Speech Recognition from the raw waveform. In Interspeech-2018.

Reviewed conference proceeding  

Thual, A., Dancette, C., Karadayi, J., Benjumea, J. & Dupoux, E. (2018). A K-nearest neighbours approach to unsupervised spoken term discovery. In EEE Spoken Language Technology SLT-2018.

Reviewed conference proceeding  

Holzenberger, N., Du, M., Karadayi, J., Riad, R. & Dupoux, E. (2018). Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments. In Interspeech 2018. doi:10.21437/Interspeech.2018-2364

Reviewed conference proceeding  

Riad, R., Dancette, C., Karadayi, J., Zeghidour, N., Schatz, T. & Dupoux, E. (2018). Sampling strategies in Siamese Networks for unsupervised speech representation learning. In Interspeech 2018.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Cao, X., Dakhlia, C., Del Carmen, P., Jaouani, M., Ould-Arbi, M. & Dupoux, E. (2018). Baby Cloud, a technological platform for parents and researchers. In LREC 2018 - 11th edition of the Language Resources and Evaluation Conference.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Ondel, L., Godard, P., Besacier, L., Larsen, E., Hasegawa-Johnson, M., Scharenborg, O., Dupoux, E., Burget, L., Yvon, F. & Khudanpur, S. (2018). Bayesian Models For Unit Discovery On a Very Low Resource Language. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Reviewed conference proceeding  

Défossez, A., Zeghidour, N., Usunier, N., Bottou, L. & Bach, F. (2018). SING: Symbol-to-Instrument Neural Generator. In Conference on Neural Information Processing Systems (NIPS), Dec 2018, Montréal, Canada.

Reviewed conference proceeding  

Räsänen, O., Thiollière, R. & Dupoux, E. (2017). Blind phoneme segmentation with temporal prediction errors. In Proceedings of ACL: Student Research Workshop.

Reviewed conference proceeding  

Schatz, T., Turnbull, R., Bach, F. & Dupoux, E. (2017). A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability. In INTERSPEECH-2017.

Reviewed conference proceeding  

Larsen, E., Dupoux, E. & Cristia, A. (2017). Relating unsupervised word segmentation to reported vocabulary acquisition. In INTERSPEECH-2017, 2198-2202. doi:10.21437/Interspeech.2017-937

Reviewed conference proceeding  

Chaabouni, R., Dunbar, E., Zeghidour, N. & Dupoux, E. (2017). Learning weakly supervised multimodal phoneme embeddings. In INTERSPEECH-2017, 2218-2222. doi:10.21437/Interspeech.2017-1689