ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Cao, X., Dakhlia, C., Del Carmen, P., Jaouani, M., Ould-Arbi, M. & Dupoux, E. (2018). Baby Cloud, a technological platform for parents and researchers. In LREC 2018 - 11th edition of the Language Resources and Evaluation Conference.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Défossez, A., Zeghidour, N., Usunier, N., Bottou, L. & Bach, F. (2018). SING: Symbol-to-Instrument Neural Generator. In Conference on Neural Information Processing Systems (NIPS), Dec 2018, Montréal, Canada.

International Journal article  

Bouton, S., Chambon, V., Tyrand, R., Guggisberg, A., Seeck, M., Karkar, S., Van De Ville, D. & Giraud, A. (2018). Focal versus distributed temporal cortex activity for speech sound category assignment. Proceedings of the National Academy of Sciences of the United States of America, . doi:10.1073/pnas.1714279115

International Journal article  

Dupoux, E. (2018). Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner. Cognition, 173, 43-59. doi:10.1016/j.cognition.2017.11.008

Non-reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2019). End-to-End Speech Recognition from the raw waveform. In Interspeech-2018.

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Dupoux, E. & Baroni, M. (2019). Anti-efficient encoding in emergent communication. In NeuRIPS.

International Journal article  

Jacquemot, C., Lalanne, C., Sliwinski, A., Piccinini, P., Dupoux, E. & Bachoud-Levi, A. (2019). Improving language evaluation in neurological disorders: The French Core Assessment of Language Processing (CALAP). Psychological assessment, 31(5), 622-630. doi:10.1037/pas0000683

International Journal article  

Cristia, A., Dupoux, E., Gurven, M. & Stieglitz, J. (2019). Child-Directed Speech Is Infrequent in a Forager-Farmer Population: A Time Allocation Study. Child development, 90(3), 759-773. doi:10.1111/cdev.12974

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Lazaric, A., Dupoux, E. & Baroni, M. (2019). Word-order biases in deep-agent emergent communication. In ACL 2019.

International Journal article  

Cristia, A., Dupoux, E., Ratner, N. & Soderstrom, M. (2019). Segmentability Differences Between Child-Directed and Adult-Directed Speech: A Systematic Test With an Ecologically Valid Corpus. Open mind : discoveries in cognitive science, 3, 13-22. doi:10.1162/opmi_a_00022

Reviewed conference proceeding  

Dunbar, E., Algayres, R., Karadayi, J., Bernard, M., Benjumea, J., Cao, X., Miskic, L., Dugrain, C., Ondel, L., Black, A., Besacier, L., Sakti, S. & Dupoux, E. (2019). The Zero Resource Speech Challenge 2019: TTS without T. In INTERSPEECH-2019.

Reviewed conference proceeding  

Fourtassi, A. & Dupoux, E. (2019). Phoneme learning is influenced by the taxonomic organization of the semantic referents. In Cognitive Science Society (Eds.), In Proceedings of the Cognitive Science Conference, 323-324.

Reviewed conference proceeding  

Er Jiang, B., Dunbar, E., Sonderegger, M., Clayards, M. & Dupoux, E. (2020). Modelling Perceptual Effects of Phonology with ASR Systems. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548

International Journal article  

Bernard, M., Thiollière, R., Saksida, A., Loukatou, G., Larsen, E., Johnson, M., Fibla, L., Dupoux, E., Daland, R., Cao, X. & Cristia, A. (2020). WordSeg: Standardizing unsupervised word form segmentation from text. Behavior research methods, 52(1), 264-278. doi:10.3758/s13428-019-01223-3

Non-reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Bouchacourt, F., Dupoux, E. & Baroni, M. (2020). Compositionality and Generalization in Emergent Languages. In Proceedings of ACL 2020 (58th Annual Meeting of the Association for Computational Linguistics), East Stroudsburg PA: ACL.

Non-reviewed conference proceeding  

Kahn, J., Rivière, M., Zheng, W., Kharitonov, E., Xu, Q., Mazaré, P., Karadayi, J., Lipchinsky , V., Collobert, R., Fuegen, C., Likhomanenko, T., Synnaeve, G., Joulin, A., Mohamed, A. & Dupoux, E. (2020). Libri-Light: A Benchmark for ASR with Limited or No Supervision. In Acoustics Speech and Signal Processing (ICASSP) ICASSP 2020 - 2020 IEEE International Conference, 2020 IEEE International Conference , 7414-7418. doi:10.1109/ICASSP40776.2020.9052942

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

de Seyssel, M. & Dupoux, E. (2020). Does bilingual input hurt? A simulation of language discrimination and clustering using i-vectors. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Reviewed conference proceeding  

Kharitonov, E., Rivière, M., Synnaeve, G., Wolf, L., Mazaré, P., Douze, M. & Dupoux, E. (2021). Data Augmenting Contrastive Learning of Speech Representations in the Time Domain. In IEEE (Eds.), In 2021 IEEE Spoken Language Technology Workshop (SLT, 215-222.

Reviewed conference proceeding  

Riochet, R., Castro, M. , Bernard, M., Lerer, A., Fergus, R., Izard, V. & Dupoux, E. (2021). Intphys 2019: A benchmark for visual intuitive physics understanding. In IEEE Transactions on Pattern Analysis and Machine Intelligence. doi:10.1109/TPAMI.2021.3083839

International Journal article  

Riad, R., Karadayi, J., Bachoud-Levi, A. & Dupoux, E. (2021). Learning spectro-temporal representations of complex sounds with parameterized neural networks. The Journal of the Acoustical Society of America, 150, 353. doi:10.1121/10.000548210.1121/10.0005482

International Journal article  

Lakhotia, K. , Kharitonov, E., Hsu, W. , Adi, Y. , Polyak, A. , Bolte, B. , Nguyen, T. , Copet, J. , Baevski, A., Mohamed, A. & Dupoux, E. (2021). On Generative Spoken Language Modeling from Raw Audio. Transactions of the Association for Computational Linguistics, 9, 1336-1354. doi:10.1162/tacl_a_00430

International Journal article  

Ludusan, B., Morii, M., Minagawa, Y. & Dupoux, E. (2021). The effect of different information sources on prosodic boundary perception. JASA Express Letters , 1(115203 ). doi:10.1121/10.0007150

Reviewed conference proceeding  

Schatz, T., Feldman, N., Goldwater, S., Cao, X. & Dupoux, E. (2021). Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. In National Academy of Sciences (Eds.), Vol. 118: In Proceedings of the National Academy of Sciences, 7. doi:10.31234/osf.io/fc4wh

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

International Journal article  

Ludusan, B., Mazuka, R. & Dupoux, E. (2021). Does infant-directed speech help phonetic learning? A machine learning investigation. Cognitive Science, 45(5), e12946. doi:10.1111/cogs.12946