SRPP: Studying speech rate cross-linguistically: Resource building and case studies on final lengthening and pause probabilities

Frank Seifart (Leibniz-Zentrum Allgemeine Sprachwissenschaft/ZAS, Berlin)

04 June 2021, 14h0015h30

In the first part of this talk, I will introduce DoReCo, an initiative to create a multilingual reference corpus, consisting of at least 10,000 words for at least 50 languages. DoReCo extracts from fieldwork-based language documentation collections narrative texts that are already transcribed, translated into a major language, and morphologically analyzed. Within DoReCo, we convert these data to a common file format and time-align them at the phoneme level using the MAUS software. In the second part of this talk, I will present two cross-linguistic studies on a subset of this corpus: One study investigates word lengthening as a function of utterance-final position. Another, still ongoing study investigates pause probabilities before nouns vs. verbs and relates findings to the fact that, typologically, there are fewer prefixes on nouns vs. verbs.

Information relative aux conditions de la RGPD concernant les cookies