Sean Foley (USC)

23 January 2026, 14h0015h30

We release the USC Long Single-Speaker (LSS) dataset containing real-time MRI video of the vocal tract dynamics and simultaneous audio obtained during speech production. This unique dataset contains roughly one hour of video and audio data from a single native speaker of American English, making it one of the longer publicly available single-speaker datasets of real-time MRI speech data. Along with the articulatory and acoustic raw data, we release derived representations of the data that are suitable for a range of downstream tasks. This includes video cropped to the vocal tract region, sentence-level splits of the data, restored and denoised audio, and regions-of-interest timeseries. We also benchmark this dataset on articulatory synthesis and phoneme recognition tasks, providing baseline performance for these tasks on this dataset which future research can aim to improve upon.

Prochains événements

Voir la liste d'événements

27 March 2026

SRPP The past and present of stop vocalization in Danish

Rasmus Puggaard-Rode(University of Oxford)

10 April 2026

SRPP 10/04/2026 Megan Dailey

Megan Dailey (University of Lausanne)

30 April 2026

Stefanie Keulen - Seminar 1

Language and the brain: a lifetime perspective.

05 May 2026

Stefanie Keulen - Seminar 2

The enigmatic cerebellum: involvement in speech and language.

Information relative aux conditions de la RGPD concernant les cookies