From videos to TextGrids
2024-03-28
Phonetic
Extraction and
Alignment of
Subtitled
YouTube
Videos
#! /bin/sh
SPPAS (Bigi 2012; Bigi and Hirst 2012)
P2FA (Yuan and Liberman 2008)
PRAAT (Boersma and Weenink 2019)
R (R Core Team 2023)
yt-dlp (yt-dlp 2022)
ffmpeg (Developers 2021)
the Longman Pronunciation Dictionary (Wells 2008)
2 aligners are better than just one
Step 2 prevents cascading alignment errors
Added values:
Data aligned by SPPAS.
Nb of TextGrids: 453
Total length of the videos: 172:39:22
Data on monophthongs.
References: Deterding (1997)
References: Hillenbrand et al. (1995)
Next are the formant tracks for monophthongs.
Data on diphthongs.
Next are the formant tracks for diphthongs.
Data aligned by P2FA.
Data on monophthongs.
References: Deterding (1997)
References: Hillenbrand et al. (1995)
Next are the formant tracks for monophthongs.
Data on diphthongs.
Next are the formant tracks for diphthongs.
Let’s now compare the data obtained with the two aligners.
but potential issues with Wells (2008)
create interactive website
upload interactive diagnoses
adrienmeli@gmail.com
ALOES 2024 pre-conference workshop