Textual collection

Published Read-only DTA/DWDS

    Contentual information

    The element is a mandatory field
    Wikipedia-Korpus deu
    The element is a mandatory field
    Das Wikipedia-Korpus enthält Volltexte aus den Artikeln aus der deutschsprachigen Wikipedia auf der Basis des Datenbank-Abzugs vom 1.7.2025. Texte aus anderen Seitenarten (Diskussionen etc.) wurden... Das Wikipedia-Korpus enthält Volltexte aus den Artikeln aus der deutschsprachigen Wikipedia auf der Basis des Datenbank-Abzugs vom 1.7.2025. Texte aus anderen Seitenarten (Diskussionen etc.) wurden nicht aufgenommen. Das Korpus wurde so konzipiert, dass es sich insbesondere für die Recherche nach (fach)sprachlichen lexikografischen Belegen eignet. Deshalb wurden bei der automatischen Korpuskuration die Artikeltexte so weit wie möglich um Textteile bereinigt, die nicht satzförmig oder nicht deutschsprachig sind. Dies betrifft unter anderem Überschriften, Tabellen, Literaturangaben und fremdsprachliche Zitate, soweit diese in den Wikipedia-Quellen als solche ausgezeichnet sind. deu
    Optional field, specification not mandatory
    3021449 Dokumente, 1348643724 Tokens
    The element is a mandatory field
    Multiple entries are permitted
    academic
    The element is a mandatory field
    Multiple entries are permitted
    Content is validated according to the data model
    The element is a mandatory field
    Multiple entries are permitted
    geschrieben
    Optional field, specification not mandatory
    Multiple entries are permitted
    The element is a mandatory field
    Multiple entries are permitted
    text
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    2006-2025
    The element is a mandatory field
    vorhanden
    Optional field, specification not mandatory
    Multiple entries are permitted

    Optional field, specification not mandatory
    text (Rohtext)
    Optional field, specification not mandatory
    komplett
    Optional field, specification not mandatory
    manuell

    Optional field, specification not mandatory
    lemma (Lemmatisierung, text)
    Optional field, specification not mandatory
    komplett
    Optional field, specification not mandatory
    automatisch

    Optional field, specification not mandatory
    orth (Orthographic transcription of (mostly) spoken resources, text)
    Optional field, specification not mandatory
    komplett
    Optional field, specification not mandatory
    automatisch

    Optional field, specification not mandatory
    norm (Orthographic normalization of (mostly) spoken resources, text)
    Optional field, specification not mandatory
    komplett
    Optional field, specification not mandatory
    automatisch

    Optional field, specification not mandatory
    pos (UD17)
    Optional field, specification not mandatory
    komplett
    Optional field, specification not mandatory
    automatisch
    Optional field, specification not mandatory
    Multiple entries are permitted
    Textsammlung
    Optional field, specification not mandatory
    Multiple entries are permitted
    Belletristik
    Gebrauchsliteratur
    Wissenschaft
    Optional field, specification not mandatory
    Multiple entries are permitted
    Sprachwissenschaften
    Optional field, specification not mandatory
    Multiple entries are permitted

    Size_length

    Optional field, specification not mandatory
    Multiple entries are permitted

    The element is a mandatory field
    3021449
    The element is a mandatory field
    Dokumente
    Optional field, specification not mandatory
    Multiple entries are permitted

    The element is a mandatory field
    1348643724
    The element is a mandatory field
    Token
    Optional field, specification not mandatory
    Multiple entries are permitted

    Technical information

    The element is a mandatory field
    Content is validated according to the data model
    Optional field, specification not mandatory
    Content is validated according to the data model
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted

    Organizational information

    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted

    The element is a mandatory field
    The element is a mandatory field
    Multiple entries are permitted
    Verantwortliche Institution
    Optional field, specification not mandatory

    The element is a mandatory field
    The element is a mandatory field
    Multiple entries are permitted
    Verantwortliche Institution
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted
    Digitales Wörterbuch der deutschen Sprache

    Registry Metadata

    The element is a mandatory field
    The element is a mandatory field
    The element is a mandatory field
    May 3, 2026, 3:03:47 AM
    The element is a mandatory field
    18
    The element is a mandatory field
    August 5, 2025, 10:00:27 AM