Lexical Resource

Published Invalid Read-only TALAR

    Bibliographical Metadata

    The element is a mandatory field
    Multiple entries are permitted
    Vector representations of German words and compounds eng
    Optional field, specification not mandatory
    Multiple entries are permitted

    The element is a mandatory field
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Multiple entries are permitted

    Verwaltungsinformationen

    The element is a mandatory field
    Multiple entries are permitted
    Content is validated according to the data model
    The element is a mandatory field
    2017-03-14
    Optional field, specification not mandatory
    Multiple entries are permitted

    The element is a mandatory field
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Multiple entries are permitted

    Relationale_metadaten

    Optional field, specification not mandatory
    Multiple entries are permitted

    Metadaten_zum_lebenszyklus

    The element is a mandatory field
    1
    The element is a mandatory field
    Produktiv

    Rechtliche_metadaten

    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted

    Lexikologische_metadaten

    Type of the resource
    The element is a mandatory field
    Multiple entries are permitted
    Wörterbuch
    Language of the objects
    Optional field, specification not mandatory
    Multiple entries are permitted
    Language of the resource description
    Optional field, specification not mandatory
    Multiple entries are permitted
    The element is a mandatory field
    The element is a mandatory field
    Multiple entries are permitted
    The element is a mandatory field
    Multiple entries are permitted
    Written
    Optional field, specification not mandatory
    Multiple entries are permitted
    The element is a mandatory field
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    Optional field, specification not mandatory
    The element is a mandatory field
    Multiple entries are permitted
    50 dimensions
    100 dimensions
    200 dimensions
    300 dimensions

    Technische_metadaten

    Optional field, specification not mandatory
    Multiple entries are permitted
    Content is validated according to the data model
    Optional field, specification not mandatory
    Content is validated according to the data model
    Optional field, specification not mandatory
    Multiple entries are permitted
    Content is validated according to the data model
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted

    Klartextbeschreibung

    Optional field, specification not mandatory
    Multiple entries are permitted
    Word representations used in Dima(2015), Dima (2019). The vectors were generated from the decow14ax corpus (https://corporafromtheweb.org/), ~10 billion words of raw text. Corpus pre-processing: wo... Word representations used in Dima(2015), Dima (2019). The vectors were generated from the decow14ax corpus (https://corporafromtheweb.org/), ~10 billion words of raw text. Corpus pre-processing: words lowercased, punctuation removed, each number was replaced by the string 'NUMBER'. Embeddings trained using a minimum word frequency of 100, leading to a vocabulary 1,029,270 words. The vocabulary file 'decow14ax_all_min_100.vocab' contains these word representations and their frequency in the support corpus. 'decow14ax_full.vocab' contains the full vocabulary generated for the corpus (no cut-off). The embeddings were trained with GloVe, for 15 iterations, using a 10-word symmetric window of text (20 words surrounding a particular word). The files are suffixed with the dimensionality of the vector representations: 50 dimensional, 100 dimensional, 200 dimensional and 300 dimensional. MAX_ITER=15 WINDOW_SIZE=10 BINARY=0 NUM_THREADS=8 X_MAX=100 eng

    Raumbezogene_metadaten

    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Multiple entries are permitted
    Content is validated according to the data model
    Optional field, specification not mandatory
    Multiple entries are permitted
    Optional field, specification not mandatory
    Optional field, specification not mandatory

    Registry Metadata

    The element is a mandatory field
    The element is a mandatory field
    The element is a mandatory field
    June 5, 2025, 7:27:41 PM
    The element is a mandatory field
    5
    The element is a mandatory field
    May 27, 2025, 10:23:45 AM