~ Registry Navigation
Search
Resources and entities
Textual collections
Services
Editions
Lexical Resources
Repositories
Further Entities
Import sources
Institutions
Persons
Works
EN
German
English
Login
Problems, troubleshooting, features
Lexical Resource
Published
Invalid
Read-only
TALAR
Bibliographical Metadata
Titel
The element is a mandatory field
Multiple entries are permitted
German compound splitting dataset
deu
Bib_creator_person
Optional field, specification not mandatory
Multiple entries are permitted
Bib_creator_institution
Optional field, specification not mandatory
Multiple entries are permitted
Verwaltungsinformationen
Md_id
The element is a mandatory field
Multiple entries are permitted
Content is validated according to the data model
https://doi.org/10.57754/FDAT.ydh6f-rgv47
Md_timestamp
The element is a mandatory field
2017-03-14
Md_creator_person
Optional field, specification not mandatory
Multiple entries are permitted
nnsmj01@uni-tuebingen.de
Person
The element is a mandatory field
nnsmj01@uni-tuebingen.de
Comment
Optional field, specification not mandatory
Md_creator_institution
Optional field, specification not mandatory
Multiple entries are permitted
Relationale_metadaten
Rel
Optional field, specification not mandatory
Multiple entries are permitted
Metadaten_zum_lebenszyklus
Lc_version
The element is a mandatory field
1
Lc_status
The element is a mandatory field
development
Rechtliche_metadaten
Ar_license
The element is a mandatory field
Multiple entries are permitted
available for research purpose upon personal contact
Ar_license_holder
Optional field, specification not mandatory
Multiple entries are permitted
Lexikologische_metadaten
Type
Type of the resource
The element is a mandatory field
Multiple entries are permitted
Lexicon
Object language
Language of the objects
Optional field, specification not mandatory
Multiple entries are permitted
Deutsch (deu)
Description language
Language of the resource description
Optional field, specification not mandatory
Multiple entries are permitted
Lex_entry_type
The element is a mandatory field
Lex_data_type
The element is a mandatory field
Multiple entries are permitted
Lex_modality
The element is a mandatory field
Multiple entries are permitted
Lex_language_region
Optional field, specification not mandatory
Multiple entries are permitted
Lex_language_period
The element is a mandatory field
Lex_dialect
Optional field, specification not mandatory
Lex_diaphrasic
Optional field, specification not mandatory
Lex_diastratic
Optional field, specification not mandatory
Lex_domain
Optional field, specification not mandatory
Lex_size
The element is a mandatory field
Multiple entries are permitted
Technische_metadaten
Tech_api_endpoint
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
Tech_landing_page
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
Tech_data_format
Optional field, specification not mandatory
Multiple entries are permitted
Tech_text_encoding
Optional field, specification not mandatory
Multiple entries are permitted
Tech_text_script
Optional field, specification not mandatory
Multiple entries are permitted
Tech_font_spec
Optional field, specification not mandatory
Multiple entries are permitted
Klartextbeschreibung
Description of the resource
Optional field, specification not mandatory
Multiple entries are permitted
The compounds that were used in Ma et al (2016) paper entitled "Letter Sequence Labeling for Compound Splitting". It contains both two-constituent and multi-constituent compounds. As standard evalu...
The compounds that were used in Ma et al (2016) paper entitled "Letter Sequence Labeling for Compound Splitting". It contains both two-constituent and multi-constituent compounds. As standard evaluation also involves non-compounds, the data also include non-compounds that we used. The data are organized into the exact same training/test/development split as in the paper.
eng
Raumbezogene_metadaten
Dct_covers
Optional field, specification not mandatory
Multiple entries are permitted
Geo_feature
Optional field, specification not mandatory
Multiple entries are permitted
Content is validated according to the data model
Geo_has_geometry
Optional field, specification not mandatory
Multiple entries are permitted
Geo_image
Optional field, specification not mandatory
Geo_epsg
Optional field, specification not mandatory
Registry Metadata
Resource (latest version)
The element is a mandatory field
f1a5caec-3529-4b1c-9473-b23f073ee8b6
Displayed version
The element is a mandatory field
66f2cfcb77a79455fa111907
Version timestamp
The element is a mandatory field
February 4, 2025, 11:52:47 AM
Versions
The element is a mandatory field
1
Resource created
The element is a mandatory field
February 4, 2025, 11:52:47 AM