ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Défossez, A., Zeghidour, N., Usunier, N., Bottou, L. & Bach, F. (2018). SING: Symbol-to-Instrument Neural Generator. In Conference on Neural Information Processing Systems (NIPS), Dec 2018, Montréal, Canada.

Non-reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2019). End-to-End Speech Recognition from the raw waveform. In Interspeech-2018.

Non-reviewed conference proceeding  

Zuk, N. , Di Liberto, G. & Lalor, E. (2019). Linear-nonlinear Bernoulli modeling for quantifying temporal coding of phonemes in brain responses to continuous speech. In 2019 Conference on Cognitive Computational Neuroscience, Berlin, Germany.

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Dupoux, E. & Baroni, M. (2019). Anti-efficient encoding in emergent communication. In NeuRIPS.

Reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Lazaric, A., Dupoux, E. & Baroni, M. (2019). Word-order biases in deep-agent emergent communication. In ACL 2019.

Reviewed conference proceeding  

Dunbar, E., Algayres, R., Karadayi, J., Bernard, M., Benjumea, J., Cao, X., Miskic, L., Dugrain, C., Ondel, L., Black, A., Besacier, L., Sakti, S. & Dupoux, E. (2019). The Zero Resource Speech Challenge 2019: TTS without T. In INTERSPEECH-2019.

Reviewed conference proceeding  

Fourtassi, A. & Dupoux, E. (2019). Phoneme learning is influenced by the taxonomic organization of the semantic referents. In Cognitive Science Society (Eds.), In Proceedings of the Cognitive Science Conference, 323-324.

Non-reviewed conference proceeding  

Zakharov, D., Dogonasheva, O. & Gutkin, B. (2020). Role of Pyramidal Cell M-current in Weak Pyramidal/Interneuronal Gamma Cluster Formation. In 2020 4th Scientific School on Dynamics of Complex Networks and their Application in Intellectual Robotics (DCNAIR), Innopolis, Russia, IEEE. doi:10.1109/DCNAIR50402.2020.9216942

Non-reviewed conference proceeding  

Thoret, E., Andrillon, T., Gauriau, C., Léger, D. & Pressnitzer, D. (2020). Sleep deprivation impacts speech spectro-temporal modulations. In e-FA2020 (e- Forum Acusticum 2020 ), Lyon, France.

Reviewed conference proceeding  

Er Jiang, B., Dunbar, E., Sonderegger, M., Clayards, M. & Dupoux, E. (2020). Modelling Perceptual Effects of Phonology with ASR Systems. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548

Non-reviewed conference proceeding  

Chaabouni, R., Kharitonov, E., Bouchacourt, F., Dupoux, E. & Baroni, M. (2020). Compositionality and Generalization in Emergent Languages. In Proceedings of ACL 2020 (58th Annual Meeting of the Association for Computational Linguistics), East Stroudsburg PA: ACL.

Non-reviewed conference proceeding  

Kahn, J., Rivière, M., Zheng, W., Kharitonov, E., Xu, Q., Mazaré, P., Karadayi, J., Lipchinsky , V., Collobert, R., Fuegen, C., Likhomanenko, T., Synnaeve, G., Joulin, A., Mohamed, A. & Dupoux, E. (2020). Libri-Light: A Benchmark for ASR with Limited or No Supervision. In Acoustics Speech and Signal Processing (ICASSP) ICASSP 2020 - 2020 IEEE International Conference, 2020 IEEE International Conference , 7414-7418. doi:10.1109/ICASSP40776.2020.9052942

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

de Seyssel, M. & Dupoux, E. (2020). Does bilingual input hurt? A simulation of language discrimination and clustering using i-vectors. In CogSci 2020-42nd Annual Virtual Meeting of the Cognitive Science Society.

Non-reviewed conference proceeding  

Zakharov, D., Dogonasheva, O. & Gutkin, B. (2021). Bistability of globally synchronous and chimera states in a ring of phase oscillators coupled by a cosine kernel. In 2021 5th Scientific School Dynamics of Complex Networks and their Applications (DCNA), 211-214. doi:10.1109/DCNA53427.2021.9586968

Non-reviewed conference proceeding  

Dogonasheva, O., Gutkin, B. & Zakharov, D. (2021). Calculation of travelling chimera speeds for dynamical systems with ring topologies. In 5th Scientific School Dynamics of Complex Networks and their Applications (DCNA), 61-64. doi:10.1109/DCNA53427.2021.9586903

Reviewed conference proceeding  

Kharitonov, E., Rivière, M., Synnaeve, G., Wolf, L., Mazaré, P., Douze, M. & Dupoux, E. (2021). Data Augmenting Contrastive Learning of Speech Representations in the Time Domain. In IEEE (Eds.), In 2021 IEEE Spoken Language Technology Workshop (SLT, 215-222.

Reviewed conference proceeding  

Riochet, R., Castro, M. , Bernard, M., Lerer, A., Fergus, R., Izard, V. & Dupoux, E. (2021). Intphys 2019: A benchmark for visual intuitive physics understanding. In IEEE Transactions on Pattern Analysis and Machine Intelligence. doi:10.1109/TPAMI.2021.3083839

Reviewed conference proceeding  

Graves, J., Egré, P., Pressnitzer, D. & de Gardelle, V. (2021). An implicit representation of stimulus ambiguity in pupil size. , Vol. 18: In Proceedings of the National Academy of Sciences, e2107997118. doi:10.1073/pnas.2107997118

Non-reviewed conference proceeding  
Non-reviewed conference proceeding  

Caucheteux, C. , Gramfort, A. & King, J. (2021). Disentangling syntax and semantics in the brain with deep networks. , Vol. 139: In International Conference on Machine Learning, PMLR, 1336-1348.

Reviewed conference proceeding  

Schatz, T., Feldman, N., Goldwater, S., Cao, X. & Dupoux, E. (2021). Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. In National Academy of Sciences (Eds.), Vol. 118: In Proceedings of the National Academy of Sciences, 7. doi:10.31234/osf.io/fc4wh

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

Reviewed conference proceeding  

Dunbar, E., Bernard, M., Hamilakis, N., Nguyen, T. , de Seyssel, M., Rozé, P., Rivière, M., Kharitonov, E. & Dupoux, E. (2021). The Zero Resource Speech Challenge 2021: Spoken language modelling. In Conference of the International Speech Communication Association, Brno, Czech Republic. doi:10.1109/TPAMI.2021.3083839

Non-reviewed conference proceeding  

Riochet, R., Ynocente Castro, M., Bernard, M., Lerer, A., Fergus, R., Izard, V. & Dupoux, E. (2022). IntPhys 2019: A Benchmark for Visual Intuitive Physics Understanding. , Vol. 44: In IEEE Transactions on Pattern Analysis and Machine Intelligence, 5016-5025. doi:10.1109/TPAMI.2021.3083839

Non-reviewed conference proceeding  

Riad, R., Titeux, H., Lemoine, L., Montillot, J., Sliwinski, A., Hamet Bagnou, J., Cao, X., Bachoud-Levi, A. & Dupoux, E. (2022). A comparison study on patient-psychologist voice diarization. In Ninth Workshop on Speech and Language Processing for Assistive Technologies (SLPAT-2022), Dublin, Ireland: Association for Computational Linguistics, 30-36. doi:10.18653/v1/2022.slpat-1.4

Reviewed conference proceeding  

M Siriwardena, Y., Marion, G. & Shamma, S. (2022). The Mirrornet: Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 946-950. doi:10.1109/ICASSP43922.2022.9747358