This repository contains pointers to various datasets of adposition/case supersenses in multiple languages.
The name of the language links to annotation guidelines if available. The guidelines version reflected in the corpus appears {in curly braces}.
-
STREUSLE: Online reviews section of the English Web Treebank, obtained from the Universal Dependencies project. STREUSLE is fully annotated with SNACS (as well as other lexical semantic annotations) and served as the primary reference corpus in developing the English SNACS guidelines. {EN v2.5} Citation: Schneider et al. ACL 2018
-
The Little Prince: Annotation is underway.
-
PASTRIE (Reddit international English). {EN v2.5} Citation: Kranzlein et al. LAW 2020
- 小王子(Xiǎo Wáng Zǐ) [The Little Prince]. {based on EN v2.3} Citation: Peng et al. LREC 2020
- 어린왕자(Erin Wangca) [The Little Prince]. {KO v0.9 based on EN v2.5} Citation: Hwang et al. DMR 2020
- Der kleine Prinz [The Little Prince]. {based on EN v2.0 (superset)/2.5 (revised subset)} Citation (paper includes guidelines): Prange and Schneider Künstliche Intelligenz 2021
- Hindi Little Prince: Annotation is underway.
- Hwang, Jena D., Hanwool Choe, Na-Rae Han, and Nathan Schneider. "K-SNACS: Annotating Korean adposition semantics." In Proceedings of the Second International Workshop on Designing Meaning Representations, pp. 53-66. 2020.
- Kranzlein, Michael, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, and Nathan Schneider. "PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English." In Proceedings of the 14th Linguistic Annotation Workshop, pp. 105-116. 2020.
- Peng, Siyao, Yang Liu, Yilun Zhu, Austin Blodgett, Yushi Zhao, and Nathan Schneider. "A Corpus of Adpositional Supersenses for Mandarin Chinese." In Proceedings of The 12th Language Resources and Evaluation Conference, pp. 5986-5994. 2020.
- Prange, Jakob, and Nathan Schneider. "Draw mir a sheep: a supersense-based analysis of German case and adposition semantics." KI-Künstliche Intelligenz. 2021.
- Schneider, Nathan, Jena D. Hwang, Vivek Srikumar, Jakob Prange, Austin Blodgett, Sarah Moeller, Aviram Stern, Adi Shalev, and Omri Abend. "Comprehensive Supersense Disambiguation of English Prepositions and Possessives." In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 185-196. 2018.