eng Manas-UdS Kyrgyz Corpus

eng The Manas-UdS Kyrgyz Corpus is an annotated corpus of the Kyrgyz language. Part one comprises 1,205,888 words of 84 literary texts of five genres: novel, novelette, epic, minor epic, and fairy tale. The corpus is annotated with lemma and part-of-speech tags and rich per-text meta-data. The texts were sourced from the Bizdin Muras foundation which promotes the development of the Kyrgyz language (http://bizdin.kg). Part two adds Kyrgyz proverbs (also from the Bizdin Muras foundation) and ca. 1 Million words of newspaper text generously provided by Erkin-Too, the state official newspaper of the Kyrgyz Republic (https://erkin-too.kg/).

Kirgisisch

2,2 Millionen Wörter

public

df29f8c5-ade7-4d1f-af56-5cb89d257e67

c1c9b626-0a08-4962-9a02-04fd60f7cd5f

CLARIND-UdS: Repositorium für Sprachressourcen an der Universität des Saarlandes

corpus

Sprachwissenschaften

geschrieben

2023

1959-2021

Keine Verknüpfungen gefunden