Skip to main content Skip to main navigation

Publication

Subjective Text Complexity Assessment for German

Laura Seiffe; Fares Kallel; Sebastian Möller; Babak Naderi; Roland Roller
In: Proceedings of the Language Resources and Evaluation Conference. International Conference on Language Resources and Evaluation (LREC-2022), Marseille, France, Pages 707-714, European Language Resources Association, 6/2022.

Abstract

For different reasons, text can be difficult to read and understand for many people, especially if the text's language is too complex. In order to provide suitable text for the target audience, it is necessary to measure its complexity. In this paper we describe subjective experiments to assess the readability of German text. We compile a new corpus of sentences provided by a German IT service provider. The sentences are annotated with the subjective complexity ratings by two groups of participants, namely experts and non-experts for that text domain. We then extract an extensive set of linguistically motivated features that are supposedly interacting with complexity perception. We show that a linear regression model with a subset of these features can be a very good predictor of text complexity.

Projekte

Weitere Links