On Modelling Corpus Citations in Computational Lexical Resources

Khan AF, Ionov M, Chiarcos C, Romary L, Sérasset G, Kabashi B (2024)


Publication Type: Conference contribution

Publication year: 2024

Publisher: European Language Resources Association (ELRA)

Pages Range: 12385-12394

Conference Proceedings Title: 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings

Event location: Hybrid, Torino, ITA

ISBN: 9782493814104

Abstract

In this article we look at how two different standards for lexical resources, TEI and OntoLex, deal with corpus citations in lexicons. We will focus on how corpus citations in retrodigitised dictionaries can be modelled using each of the two standards since this provides us with a suitably challenging use case. After looking at the structure of an example entry from a legacy dictionary, we examine the two approaches offered by the two different standards by outlining an encoding for the example entry using both of them (note that this article features the first extended discussion of how the Frequency Attestation and Corpus (FrAC) module of OntoLex deals with citations). After comparing the two approaches and looking at the advantages and disadvantages of both, we argue for a combination of both. In the last part of the article we discuss different ways of doing this, giving our preference for a strategy which makes use of RDFa.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Khan, A.F., Ionov, M., Chiarcos, C., Romary, L., Sérasset, G., & Kabashi, B. (2024). On Modelling Corpus Citations in Computational Lexical Resources. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue (Eds.), 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp. 12385-12394). Hybrid, Torino, ITA: European Language Resources Association (ELRA).

MLA:

Khan, Anas Fahad, et al. "On Modelling Corpus Citations in Computational Lexical Resources." Proceedings of the Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024, Hybrid, Torino, ITA Ed. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, European Language Resources Association (ELRA), 2024. 12385-12394.

BibTeX: Download