eng GerCo: German Adjective-Noun Collocations Dataset
eng The dataset contains 4732 adjective-noun pairs extracted from the DWDS corpora [1] with the application Wortprofil [2]. All the phrases have been annotated by two experts as collocations vs non-collocations. The non-collocations have been further classified by one of the annotators as free phrases, idioms, named entities, and terms. If you want to use this dataset for research purposes, please refer to the following paper: Yana Strakatova, Neele Falk, Isabel Fuhrmann, Daniela Rossmann, Erhard Hinrichs. All That Glitters is Not Gold: A Gold Standard of Adjective-Noun Collocations for German. 2019. References: [1]: DWDS – Digitales Wörterbuch der deutschen Sprache. Das Wortauskunftssystem zur deutschen Sprache in Geschichte und Gegenwart, hrsg. v. d. Berlin-Brandenburgischen Akademie der Wissenschaften. [2]: DWDS-Wortprofil, erstellt durch das Digitale Wörterbuch der deutschen Sprache.
2017-03-14
1
4f2a62c1-35e2-4c7e-ab4b-7680c8a5f2cb
8cefa5dd-f5fb-4527-8acb-88cc6824eb48
4732 phrases