Inducing phonetic distances from dialect variation

Authors

  • Martijn Wieling Department of Humanities Computing, University of Groningen
  • Eliza Margaretha Department of Humanities Computing, University of Groningen
  • John Nerbonne Department of Humanities Computing, University of Groningen

Abstract

In this study we attempt to derive phonetic distances from alternative dialectal pronunciations used in different geographical varieties. We use two dialect atlases each containing the phonetic transcriptions of the same set of words at hundreds of sites. We collect the sound correspondences through alignment with the Levenshtein distance algorithm, and then apply an information-theoretic measure, pointwise mutual information, assigning smaller segment distances to segments which frequently correspond. We iterate alignment and information-theoretic distance assignment until both stabilize and we evaluate the quality of the phonetic distances obtained by comparing them to acoustic vowel distances. For both Dutch and German, we find strong correlations between the induced phonetic distances and the acoustic distances, illustrating the usefulness of the method in deriving valid phonetic distances from dialectal pronunciations.

Downloads

Published

2011-12-01

How to Cite

Wieling, M., Margaretha, E., & Nerbonne, J. (2011). Inducing phonetic distances from dialect variation. Computational Linguistics in the Netherlands Journal, 1, 109–118. Retrieved from https://clinjournal.org/clinj/article/view/10

Issue

Section

Articles

Most read articles by the same author(s)