eng Royal Society Corpus Universal Dependency Parsed
eng This is a Universal Dependency parsed version of the The Royal Society Corpus (RSC) 6.0 Open In the preparation of the corpus, "good sentences" were extracted from RSC V6.0 Open, excluding sentences with the following features (a) beginning with a word in lower case and the sentence preceding them (incomplete), (b) sentences with less than 8 tokens (too short), (c) as well as sentences lacking a verb (verbless), (d) being in a language different from English. The downloadable corpus has the following annotations word lemma upos — Part-of-Speech using Universal Depencies pos — Part-of-Speech using PennTreebank tagset ufeat — Universal Features (morphological annotation) parent — the parent of a token in the dependency tree urel — Universal Relation dl — Dependency length srp — Surprisal srp_avg — Average Surprisal
Englisch
236 Megabyte
public
0d8a5b8e-6dc9-46ff-ab9b-7471cd3a20cf
c1c9b626-0a08-4962-9a02-04fd60f7cd5f
vorhanden
CLARIND-UdS: Repositorium für Sprachressourcen an der Universität des Saarlandes
corpus
Sprachwissenschaften
geschrieben