~ Registry Navigation
Search
Resources and entities
Textual collections
Services
Editions
Lexical Resources
Lexical Resources (v2)
Repositories
Further Entities
Import sources
Institutions
Persons
Works
EN
German
English
Login
Problems, troubleshooting, features
Lexical Resource
Published
Invalid
Read-only
TALAR
Bibliographical Metadata
Titel
The element is a mandatory field
Multiple entries are permitted
Vector representations of German words and compounds
eng
Bib_creator_person
Optional field, specification not mandatory
Multiple entries are permitted
Dima; Gina-Corina; Promotion Universität Tübingen, Philosophische Fakultät, Fachbereich Allgemeine und vergleichende Sprachwissenschaft; https://d-nb.info/gnd/1181819423
Person
The element is a mandatory field
Dima; Gina-Corina; Promotion Universität Tübingen, Philosophische Fakultät, Fachbereich Allgemeine und vergleichende Sprachwissenschaft; https://d-nb.info/gnd/1181819423
Comment
Optional field, specification not mandatory
Bib_creator_institution
Optional field, specification not mandatory
Multiple entries are permitted
Verwaltungsinformationen
Md_id
The element is a mandatory field
Multiple entries are permitted
Content is validated according to the data model
https://doi.org/10.57754/FDAT.fx84s-dxe33
Md_timestamp
The element is a mandatory field
2017-03-14
Md_creator_person
Optional field, specification not mandatory
Multiple entries are permitted
nnsdg01@uni-tuebingen.de
Person
The element is a mandatory field
nnsdg01@uni-tuebingen.de
Comment
Optional field, specification not mandatory
Md_creator_institution
Optional field, specification not mandatory
Multiple entries are permitted
Relationale_metadaten
Rel
Optional field, specification not mandatory
Multiple entries are permitted
Metadaten_zum_lebenszyklus
Lc_version
The element is a mandatory field
1
Lc_status
The element is a mandatory field
Produktiv
Rechtliche_metadaten
Ar_license
Optional field, specification not mandatory
Multiple entries are permitted
restricted use, request required
Ar_license_holder
Optional field, specification not mandatory
Multiple entries are permitted
Lexikologische_metadaten
Type
Type of the resource
The element is a mandatory field
Multiple entries are permitted
Wörterbuch
Object language
Language of the objects
Optional field, specification not mandatory
Multiple entries are permitted
Deutsch (deu)
Description language
Language of the resource description
Optional field, specification not mandatory
Multiple entries are permitted
Lex_entry_type
The element is a mandatory field
Lex_data_type
The element is a mandatory field
Multiple entries are permitted
Lex_modality
The element is a mandatory field
Multiple entries are permitted
Written
Lex_language_region
Optional field, specification not mandatory
Multiple entries are permitted
Lex_language_period
The element is a mandatory field
Lex_dialect
Optional field, specification not mandatory
Lex_diaphrasic
Optional field, specification not mandatory
Lex_diastratic
Optional field, specification not mandatory
Lex_domain
Optional field, specification not mandatory
Lex_size
The element is a mandatory field
Multiple entries are permitted
50 dimensions
100 dimensions
200 dimensions
300 dimensions
Technische_metadaten
Tech_api_endpoint
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
Fcs_endpoint
Optional field, specification not mandatory
Content is validated according to the data model
Tech_landing_page
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
https://doi.org/10.57754/FDAT.fx84s-dxe33
Tech_data_format
Optional field, specification not mandatory
Multiple entries are permitted
Tech_text_encoding
Optional field, specification not mandatory
Multiple entries are permitted
Tech_text_script
Optional field, specification not mandatory
Multiple entries are permitted
Tech_font_spec
Optional field, specification not mandatory
Multiple entries are permitted
Klartextbeschreibung
Description of the resource
Optional field, specification not mandatory
Multiple entries are permitted
Word representations used in Dima(2015), Dima (2019). The vectors were generated from the decow14ax corpus (https://corporafromtheweb.org/), ~10 billion words of raw text. Corpus pre-processing: wo...
Word representations used in Dima(2015), Dima (2019). The vectors were generated from the decow14ax corpus (https://corporafromtheweb.org/), ~10 billion words of raw text. Corpus pre-processing: words lowercased, punctuation removed, each number was replaced by the string 'NUMBER'. Embeddings trained using a minimum word frequency of 100, leading to a vocabulary 1,029,270 words. The vocabulary file 'decow14ax_all_min_100.vocab' contains these word representations and their frequency in the support corpus. 'decow14ax_full.vocab' contains the full vocabulary generated for the corpus (no cut-off). The embeddings were trained with GloVe, for 15 iterations, using a 10-word symmetric window of text (20 words surrounding a particular word). The files are suffixed with the dimensionality of the vector representations: 50 dimensional, 100 dimensional, 200 dimensional and 300 dimensional. MAX_ITER=15 WINDOW_SIZE=10 BINARY=0 NUM_THREADS=8 X_MAX=100
eng
Raumbezogene_metadaten
Dct_covers
Optional field, specification not mandatory
Multiple entries are permitted
Geo_feature
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
Geo_has_geometry
Optional field, specification not mandatory
Multiple entries are permitted
Geo_image
Optional field, specification not mandatory
Geo_epsg
Optional field, specification not mandatory
Registry Metadata
Resource (latest version)
The element is a mandatory field
3b2f7fe4-2081-47af-aeeb-0f822a262770
Displayed version
The element is a mandatory field
6841d38ddfe34a43998c43c2
Version timestamp
The element is a mandatory field
June 5, 2025, 7:27:41 PM
Versions
The element is a mandatory field
5
Resource created
The element is a mandatory field
May 27, 2025, 10:23:45 AM