A dataset containing formant values, amplitude, articulation rate, and following segment data for 10 New Zealand English monophthongs, along with participant demographics.
qb_vowels
A data frame with 26331 rows and 14 variables:
Anonymised speaker code (char).
Wells lexical sets for 10 NZE monophthongs. Levels: DRESS, FLEECE, GOOSE, KIT, LOT, NURSE, START, STRUT, THOUGHT, TRAP, FOOT (char).
First formant in Hz, extracted from vowel mid-point using LaBB-CAT interface with Praat.
Second formant in Hz, extracted from vowel mid-point using LaBB-CAT interface with Praat.
Age category of speaker. Values: 18-25, 26-35, 36-45, ..., 76-85 (char).
Gender of participant. Values: M, F (char).
New Zealand ethnic category of participant. Values: NZ mixed ethnicity, NZ European, Other (char).
Frequency of word from which vowel token is taken in CELEX.
Anonymised word id (char).
Time in seconds at which vowel segment starts.
Length of vowel in seconds.
Articulation rate of utterance from which token is taken.
Category of following segment. NB: liquids have already been removed. Levels: labial, velar, other (factor).
Maximum amplitude of word from which vowel token is taken, generated by LaBB-CAT interface with Praat.
Original data was generated for Wilson Black et al. (2023).
Wilson Black, Joshua, Jennifer Hay, Lynn Clark & James Brand (2023): The overlooked effect of amplitude on within-speaker vowel variation. Linguistics Vanguard. Walter de Gruyter GmbH. 9(1). 173–189. doi:10.1515/lingvan-2022-0086