The Graphic Narrative Corpus (GNC): Design, Annotation, and Analysis for the Digital Humanities

Dunst A, Hartel R, Laubrock J (2018)


Publication Type: Conference contribution

Publication year: 2018

Publisher: IEEE Computer Society

Book Volume: 3

Pages Range: 15-20

Conference Proceedings Title: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR

Event location: Kyoto, JPN

ISBN: 9781538635865

DOI: 10.1109/ICDAR.2017.286

Abstract

Developed for an interdisciplinary DH project, the Graphic Narrative Corpus (GNC) is the first digital corpus of graphic novels, memoirs, and non-fiction written in English. It currently includes 160 book-length titles and will grow to around 250 graphic narratives by 2018. In contrast to collections such as Manga109, the eBDtheque, and the Iyyer corpus, the GNC was conceived to serve both the research needs of humanities and social science scholars and as a data set for computational analysis. The GNC has been constructed as a stratified monitor corpus that balances different historical periods, geographical origin, literary genres, and the gender and ethnic background of authors. Based on an extension of John Walsh's XML-dialect CBML and editor software developed for the corpus, annotation combines a focus on the first ten pages of each title and sample annotation of full-length books. XML-annotation currently includes visual objects, as well as word-image and character relations (panels, characters, balloons, captions, text, interaction types). In addition, we also provide eye-tracking data for annotated titles. Information about the corpus and sample visualizations can be found at: https://groups.uni-paderborn.de/graphic-literature/gncorpus/corpus.php.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Dunst, A., Hartel, R., & Laubrock, J. (2018). The Graphic Narrative Corpus (GNC): Design, Annotation, and Analysis for the Digital Humanities. In Proceedings of the International Conference on Document Analysis and Recognition, ICDAR (pp. 15-20). Kyoto, JPN: IEEE Computer Society.

MLA:

Dunst, Alexander, Rita Hartel, and Jochen Laubrock. "The Graphic Narrative Corpus (GNC): Design, Annotation, and Analysis for the Digital Humanities." Proceedings of the 2nd International Workshop on Comics Analysis, Processing and Understanding, MANPU 2017, Kyoto, JPN IEEE Computer Society, 2018. 15-20.

BibTeX: Download