Data type: Text (2026)
Availability: Restricted
Publication Date: May 8, 2026
Licence: Licence not relevant, dataset not to others
The dataset comprises the text corpus of the dissertation project "Kurzes Erzählen im 21. Jahrhundert. Entstehungsbedingungen und Aneignungspotenziale kurzer Erzählformen in der Gegenwart". It contains 1,615 German-language short narrative forms published between 2008 and 2023 in books, literary journals (Das GRAMM, Sinn und Form, Sprache im technischen Zeitalter), literary competitions (Münchner/Deutscher Kurzgeschichtenwettbewerb, Bachmannpreis, Open Mike, Schreibwettbewerb des Literaturhaus Zürich, Moerser Literaturpreis), and an online platform (Fanfiktion.de). The texts are available in two versions (.txt; UTF-8): as full texts (in the directory textkorpus_kurze_erzaehlformen\volltexte) and in a form pre-processed for Topic Modeling (with the suffix: _pp in the directory textkorpus_kurze_erzaehlformen\pp_spacy_nonames). Additionally, the dataset includes the trained topic model, various scripts used in the preprocessing and evaluation of the texts, and various diagnostic and result files.