"PolDiLemma" Middle Polish Diachrone Lemmatised Corpus eng
The PolDiLemma corpus is a diachronic corpus made of political, religious, scientific and historical texts from different authors of the Middle Polish period (16th-18th century). It contains in total ca. 7 million tokens. Characteristic for this period is the slow development of a supra-regional standard language, a process of standardisation on the basis of the variety of the Polish nobility, under the influence of Latin and other foreign languages as well as different social or regional varieties. All texts (free licenses) are gathered from Federacja Bibliotek Cyfrowych (Digital Library Federation). The Middle Polish texts illustrate the history of the language and give the opportunity to explore some first-hand evidence of the development of Polish in its historical context. Studying the history of the language is a way to familiarize oneself with aspects of the history of Poland in general. It also helps to build up valuable methodological knowledge in diachronic linguistics and philology. eng
Tschechisch
Deutsch
Latein
Polnisch
4,2 Millionen Wörter
public
vorhanden
CLARIND-UdS: Repositorium für Sprachressourcen an der Universität des Saarlandes
corpus
Sprachwissenschaften
geschrieben
e48d64a4-5893-480c-aeed-962a9bc8c526
hdl:11858/00-246C-0000-0023-8C44-B
2025-05-12T07:45:47.276Z
collection
c1c9b626-0a08-4962-9a02-04fd60f7cd5f