ENS - Ecole Normale Supérieure
Back to top

Publications

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Usunier, N. & Dupoux, E. (2016). Joint Learning of Speaker and Phonetic Similarities with Siamese Networks. In INTERSPEECH-2016, 1295-1299.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Versteegh, M. & Dupoux, E. (2016 ). A Deep Scattering Spectrum - Deep Siamese Network Pipeline For Unsupervised Acoustic Modeling. In ICASSP-2016, 4965-4969.

Reviewed conference proceeding  

Versteegh, M., Anguera, X., Jansen, A. & Dupoux, E. (2016). The Zero Resource Speech Challenge 2015: Proposed Approaches and Results. , Vol. 81: In SLTU-2016 Procedia Computer Science, 67-72.

Reviewed conference proceeding  

Versteegh, M., Thiollière, R., Schatz, T., Cao, X., Anguera, X., Jansen, A. & Dupoux, E. (2015). The Zero Resource Speech Challenge 2015. In INTERSPEECH-2015, 3169-3173.

Reviewed conference proceeding  

Varadarajan, B., Khudanpur, S. & Dupoux, E. (2008 ). Unsupervised Learning of Acoustic Subword Units. In Proceedings of ACL-08: HLT, 165-168.

Reviewed conference proceeding  

Turnbull, R. & Peperkamp, S. (2019). Across-language priming in bilinguals: does English bet prime French bête? In S. Calhoun, P. Escudero, M. Tabain & P. Warren (Eds.), In Proceedings of the 19th International Congress of Phonetic Sciences, Canberra, Australia, Melbourne, Australia, 1367-1371.

Reviewed conference proceeding  

Turnbull, R. & Peperkamp, S. (2017). What governs a language's lexicon? Determining the organizing principles of phonological neighbourhood networks. In Complex Networks & Their Applications V. Proceedings of the 5th International Workshop on Complex Networks and their Applications, 83-94. doi:10.1007/978-3-319-50901-3_7

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

Thual, A., Dancette, C., Karadayi, J., Benjumea, J. & Dupoux, E. (2018). A K-nearest neighbours approach to unsupervised spoken term discovery. In EEE Spoken Language Technology SLT-2018.

Reviewed conference proceeding  

Thiollière, R., Dunbar, E., Synnaeve, G., Versteegh, M. & Dupoux, E. (2015 ). A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling. In INTERSPEECH-2015, 3179-3183.

Reviewed conference proceeding  

Synnaeve, G. & Dupoux, E. (2016). A temporal coherence loss function for learning unsupervised acoustic embeddings. , Vol. 81: In SLTU-2016 Procedia Computer Science, 95-100.

Reviewed conference proceeding  

Synnaeve, G., Dautriche, I., Boerschinger, B., Johnson, M. & Dupoux, E. (2014). Unsupervised word segmentation in context. In Proceedings of 25th International Conference on Computational Linguistics (CoLing), 2326-2334.

Reviewed conference proceeding  

Synnaeve, G., Versteegh, M. & Dupoux, E. (2014). Learning words from images and speech. In NIPS Workshop on Learning Semantics.

Reviewed conference proceeding  

Synnaeve, G., Schatz, T. & Dupoux, E. (2014 ). Phonetics embedding learning with side information. In IEEE Spoken Language Technology Workshop, 106 - 111. doi:10.1109/slt.2014.7078558

Reviewed conference proceeding  

Schlenker, P., Chemla, E., Arnold, K., Lemasson, A., Ouattara, K., Keenan, S., Stephan, C., Ryder, R. & Zuberbühler, K. (2013). Towards a formal analysis of primate alarm calls. In 23rd Semantics and Linguistics Theory Conference.

Reviewed conference proceeding  

Schlenker, P., Chemla, E., Arnold, K., Lemasson, A., Ouattara, K., Keenan, S., Stephan, C., Ryder, R. & Zuberbühler, K. (2013). Dialectal variation in the meanings of Campbell’s monkey alarm calls. In XIXth International Congress of Linguists (ICL19)-Workshop" Language variation at the interface of psycholinguistics and sociolinguistics".

Reviewed conference proceeding  

Schatz, T., Feldman, N., Goldwater, S., Cao, X. & Dupoux, E. (2021). Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. In National Academy of Sciences (Eds.), Vol. 118: In Proceedings of the National Academy of Sciences, 7. doi:10.31234/osf.io/fc4wh

Reviewed conference proceeding  

Schatz, T., Turnbull, R., Bach, F. & Dupoux, E. (2017). A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability. In INTERSPEECH-2017.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Cao, X., Bach, F., Hynek, H. & Dupoux, E. (2014 ). Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise. In INTERSPEECH-2014, 915-919.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Bach, F., Jansen, A., Hynek, H. & Dupoux, E. (2013 ). Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline. In INTERSPEECH-2013, 1781-1785.

Reviewed conference proceeding  

Scharenborg, O., Besacier, L., Black, A., Hasegawa-Johnson, M., Metze, F., Neubig, G., Stuker, S., Godard, P., Muller, M., Ondel, L., Palaskar, S., Arthur, P., Ciannella, F., Du, M., Larsen, E., Merkx, D., Riad, R., Wang, L. & Dupoux, E. (2018). Linguistic Unit Discovery From Multi-Modal Inputs In Unwritten Languages: Summary Of The “Speaking Rosetta” Jsalt 2017 Workshop. In Jsalt 2017 Workshop.

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548

Reviewed conference proceeding  

Riad, R., Bachoud-Levi, A., Rudzicz , F. & Dupoux, E. (2020). Identification of primary and collateral tracks in stuttered speech. , Vol. Proceedings of The 12th Language Resources and Evaluation Conference: In LREC, European Language Resources Association, 1681–1688.

Reviewed conference proceeding  

Riad, R., Dancette, C., Karadayi, J., Zeghidour, N., Schatz, T. & Dupoux, E. (2018). Sampling strategies in Siamese Networks for unsupervised speech representation learning. In Interspeech 2018.

Reviewed conference proceeding  

Peperkamp, S., Hegde , M. & Carbajal, J. (2019). Liquid deletion in French child-directed speech. In Proceedings of INTERSPEECH, Graz, Austria, ISCA, 3574-3578.

Reviewed conference proceeding  

Peperkamp, S. & Iturralde Zurita, A. (2019). Compensation for French liquid deletion during auditory sentence processing. In Proceedings of INTERSPEECH, Graz, Austria, ISCA, 1951-1955.

Reviewed conference proceeding  

Peperkamp, S. & Bouchon, C. (2011). The relation between perception and production in L2 phonological processing. In Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), 168-171.