The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation

Authors

  • Erik Tjong Kim Sang Meertens Institute
  • Marcel Bollmann Ruhr-Universität Bochum
  • Remko Boschker University of Groningen
  • Francisco Casacuberta Universitat Polit`ecnica de Val`encia
  • Feike Dietz Utrecht University
  • Stefanie Dipper Ruhr-Universität Bochum
  • Miguel Domingo University of Zagreb
  • Rob van der Goot University of Groningen
  • Marjo van Koppen Utrecht University
  • Nikola Ljubeši´c Jožef Stefan Institute
  • Robert Östling Department of Linguistics, Stockholm University
  • Florian Petran Ruhr-Universität Bochum
  • Eva Pettersson University of Uppsala
  • Yves Scherrer University of Geneva
  • Marijn Schraagen Utrecht University
  • Leen Sevens KU Leuven
  • Jörg Tiedemann University of Helsinki
  • Tom Vanallemeersch KU Leuven
  • Kalliopi Zervanou Utrecht University

Abstract

The CLIN27 shared task evaluates the effect of translating historical text to modern text with the goal of improving the quality of the output of contemporary natural language processing tools applied to the text. We focus on improving part-of-speech tagging analysis of seventeenth-century Dutch. Eight teams took part in the shared task. The best results were obtained by teams employing character-based machine translation. The best system obtained an error reduction of 51% in comparison with the baseline of tagging unmodified text. This is close to the error reduction obtained by human translation (57%).

Downloads

Published

2017-12-01

How to Cite

Tjong Kim Sang, E., Bollmann, M., Boschker, R., Casacuberta, F., Dietz, F., Dipper, S., Domingo, M., van der Goot, R., van Koppen, M., Ljubeši´c, N., Östling, R., Petran, F., Pettersson, E., Scherrer, Y., Schraagen, M., Sevens, L., Tiedemann, J., Vanallemeersch, T., & Zervanou, K. (2017). The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation. Computational Linguistics in the Netherlands Journal, 7, 53–64. Retrieved from https://clinjournal.org/clinj/article/view/68

Issue

Section

Articles