19 von 19 Ressourcen
Korpus / Textsammlung
Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and three age groups. Speakers' age vary from approx. 5 to 45 years. The Phonoprosodic corpus of spoken Catalan (PhonCAT) was compiled between July 2006 and June 2011 in the research project Phonoprosodic development of Catalan in its current bilingual context (H6, director: Conxita Lleò), part of the Collaborative Research Centre "Multilingualism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. It comprises read and elicited as well as spontaneous speech data from speakers of Catalan in Barcelona. The speakers were divided into nine groups according to their age and to the district in Barcelona in which they live. For most recordings, selected words were transcribed phonetically by one to three transcribers and annotated with their orthographic target form. A smaller set of recordings was transcribed orthographically. Benet, Ariadna, Susana Cortés, and Conxita Lleó. 2012. "Phonoprosodic Corpus of Spoken Catalan (PhonCAT)." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 215–29. Hamburg Studies in Multilingualism, 14. John Benjamins. CLARIN Metadata summary for Catalan in a bilingual context (PhonCAT) (CMDI-based) Title: Catalan in a bilingual context (PhonCAT) Description: Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and three age groups. Speakers' age vary from approx. 5 to 45 years. Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler) Project: H6 "Phonoprosodic development of Catalan in its current bilingual context", German Research Foundation (DFG) Keywords: adult bilingualism, bilingual society, cross-sectional data, simultaneous bilingualism, successive bilingualism, child L2 acquisition, L1 data, child bilingualism, language contact, EXMARaLDA Language: Catalan (cat) Size: 234 speakers (0 female, 234 male), 225 communications, 205 recordings, 8719 minutes, 875 transcriptions, 187967 words Genre: discourse Modality: spoken References: Benet, Ariadna, Susana Cortés, and Conxita Lleó. 2012. "Phonoprosodic Corpus of Spoken Catalan (PhonCAT)." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 215–29. Hamburg Studies in Multilingualism, 14. John Benjamins.
Korpus / Textsammlung
Original Data: Audio recordings (semi-spontaneous interviews) with German/Italian and German/French bilingual speakers aged approx. 15-55 years at the recording sessions. The simultaneous bilinguals with German and French/Italian as L1s have been recorded twice, i.e. once for each language. The successive bilinguals with German as L1 and French/Italian as L2, or French/Italian as L1 and German as L1 all have AOAs between 11 and 38 years and have been recorded using their L2. Corpus: The Hamburg Adult Bilingual LAnguage (HABLA) corpus was compiled between July 2009 and June 2011 in the research project Linguistic Features of First Language Attrition and Second Language Acquisition in Adult Bilinguals (German-French and German-Italian) (E11, principal investigator: Tanja Kupisch), part of the Collaborative Research Centre "Multilingualism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. Kupisch, Tanja, Dagmar Barton, Giulia Bianchi, and Ilse Stangen. 2012. "The HABLA-Corpus (German-French and German-Italian)." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 163–79. Hamburg Studies in Multilingualism, 14. John Benjamins. CLARIN Metadata summary for Hamburg Adult Bilingual LAnguage (HABLA) (CMDI-based) Title: Hamburg Adult Bilingual LAnguage (HABLA) Description: Audio recordings (semi-spontaneous interviews) with German/Italian and German/French bilingual speakers aged approx. 15-55 years at the recording sessions. The simultaneous bilinguals with German and French/Italian as L1s have been recorded twice, i.e. once for each language. The successive bilinguals with German as L1 and French/Italian as L2, or French/Italian as L1 and German as L1 all have AOAs between 11 and 38 years and have been recorded using their L2. Publication date: 2011-06-30 Data owner: Tanja Kupisch, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, tanja.kupisch@uni-hamburg.de Contributors: Tanja Kupisch, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, tanja.kupisch@uni-hamburg.de (compiler) Project: E11 "Linguistic Aspects of Language Attrition and Second Language Acquisition in adult bilinguals (German-French and German-Italian)", German Research Foundation (DFG) Keywords: cross-sectional data, simultaneous bilingualism, successive bilingualism, L2 data, language attrition, adult bilingualism, EXMARaLDA Languages: German (deu), French (fra), Italian (ita) Size: 140 speakers (88 female, 49 male, 3 unknown), 169 communications, 169 recordings, 4748 minutes, 169 transcriptions, 737797 words Temporal Coverage: 2009-07-01/2011-10-01 Spatial Coverage: Hamburg, DE; Lyon, FR; Berlin, DE; Lyon, DE; IT Genre: discourse Modality: spoken
Korpus / Textsammlung
Between 2003 and 2006 the project E3 collected the corpus Madrid-PhonBLA (Bilingual Language Acquisition), which contains cross-sectional data from 71 Spanish-German bilingual children, aged 2 to 6, growing up in Madrid (Spain), and the PhonMAS (Monolingual Acquisition of Spanish) corpus, which contains cross-sectional data from 45 monolingual Spanish children, aged 2 to 9. The Madrid-PhonBLA corpus contains 46 hours of bilingual (Spanish and German) spontaneous speech, and the PhonMAS contains 45 hours of monolingual Spanish speech. The recordings were made in Madrid, in various preschools. The purpose of such data was to use them as a larger basis for comparison with the bilingual corpus on sound phenomena a total of 45 monolingual children (between 2 and 6 years of age). Four adults were recorded by the E3 project in Madrid, too. The audio files of PhonMAS contain 17 hours of spontaneous speech. Two of the adults' corpora in these files have been orthographically transcribed and transferred to EXMARaLDA. From the 168 available audio files (from 116 bilinguals and 52 monolinguals), 37 are provided with orthographic and phonetic transcriptions, and 34 files have been transferred to EXMARaLDA. For the phonetic transcription of those 34 files the International Phonetic Alphabet (IPA) was used. The criteria used for transcribing these files were the same as those used for the transcription of the Hamburg-PhonBLA corpus. Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins.
Korpus / Textsammlung
The TEDH has been created as part of the project "Foreign Language Acquisition in German-Turkish bilinguals". The TEDH Corpus contains interviews in three languages: Turkish, English, German. The corpus contains 74 communications from 25 different speakers. The bulk of the language material to be integrated, glossed and annotated has been collected by several researchers and is available in audio format. The transcription data as well as the metadata of the corpus are processed and stored in EXMARaLDA format. The TEDH-corpus (Türkish-Englisch-Deutsch bei Herkunftssprecherinnen) was compiled between July 2010 and June 2012 in the research project "Foreign Language Acquisition in German-Turkish bilinguals" (principal investigator: Tanja Kupisch, coordination: Ilse Stangen). This project was part of the iNet3 at the research initiative "Linguistic Diversity Management in Urban Areas" (LiMA) at Hamburg University. The corpus includes speech data (transcriptions and audio files) from semi-structured interviews in Turkish, German and English of 25 Turkish-German bilinguals with an age range of 14 to 42 years, as well as speech data from L2 speakers of Turkish (with German as L1) and German (with Turkish as L1) and background data of all speakers. Recordings can be made available upon request, provided the user agrees to the terms and conditions. Kupisch, T., Lloyd-Smith, A. and I. Stangen (accepted). Perceived global accent in Turkish heritage speakers in Germany: The impact of exposure and use for early bilinguals. In: Fatih Bayram and Deniz Tat (eds.) Studies in Turkish as a Heritage Language. Kupisch, T., Belikova, A., Özçelik, Ö., Stangen, I. and L. White. (2017). On complete acquisition in heritage speakers: The definiteness effect in German-Turkish bilinguals. Linguistic Approaches to Bilingualism, 7, 1-31. Stangen, I., Kupisch, T., A.L. Proietti Erguen and M. Zielke. (2015). Foreign accent in heritage speakers of Turkish in Germany. In: Hagen Peukert (ed.), Transfer effects in multilingual language development. Amsterdam: Benjamins, 87-108. Kupisch, T., Snape, N. and I. Stangen (2012). Foreign language acquisition in heritage speakers: The acquisition of articles in L3-English by German-Turkish bilinguals. In: Joana Duarte and Ingrid Gogolin (eds.), Linguistic Superdiversity in Urban Areas. Research Approaches. Amsterdam: Benjamins, 99-121. CLARIN Metadata summary for Türkisch-Englisch-Deutsch bei Herkunftssprechern (TEDH) (CMDI-based) Title: Türkisch-Englisch-Deutsch bei Herkunftssprechern (TEDH) Description: The TEDH has been created as part of the project "Foreign Language Acquisition in German-Turkish bilinguals". The TEDH Corpus contains interviews in three languages: Turkish, English, German. The corpus contains 74 communications from 25 different speakers. The bulk of the language material to be integrated, glossed and annotated has been collected by several researchers and is available in audio format. The transcription data as well as the metadata of the corpus are processed and stored in EXMARaLDA format. Data owner: Tanja Kupisch, Fachbereich Linguistik / Universitätsstraße 10 / D-78464 Konstanz, Tanja.Kupisch@uni-konstanz.de Contributors: Tanja Kupisch, Fachbereich Linguistik / Universitätsstraße 10 / D-78464 Konstanz, Tanja.Kupisch@uni-konstanz.de (depositor), Tanja Kupisch (compiler), Ilse Stangen (compiler), Marina Zielke (compiler), Ilse Stangen (compiler), Tanja Kupisch (researcher), Ilse Stangen (researcher), Marina Zielke (researcher) Keywords: annotated, bilingual society, code-switching, information status, language contact, EXMARaLDA Languages: German (deu), Turkish (tur), English (eng) Size: 44 speakers (29 female, 9 male, 6 unknown), 74 communications, 25.27 hours, 1516 minutes, 74 recordings, 74 transcriptions, 0 words Spatial Coverage: Hamburg, DE Genre: discourse Modality: spoken References: Kupisch, T., Lloyd-Smith, A. and I. Stangen (accepted). Perceived global accent in Turkish heritage speakers in Germany: The impact of exposure and use for early bilinguals. In: Fatih Bayram and Deniz Tat (eds.) Studies in Turkish as a Heritage Language. Kupisch, T., Belikova, A., Özçelik, Ö., Stangen, I. and L. White. (2017). On complete acquisition in heritage speakers: The definiteness effect in German-Turkish bilinguals. Linguistic Approaches to Bilingualism, 7, 1-31. Stangen, I., Kupisch, T., A.L. Proietti Erguen and M. Zielke. (2015). Foreign accent in heritage speakers of Turkish in Germany. In: Hagen Peukert (ed.), Transfer effects in multilingual language development. Amsterdam: Benjamins, 87-108. Kupisch, T., Snape, N. and I. Stangen (2012). Foreign language acquisition in heritage speakers: The acquisition of articles in L3-English by German-Turkish bilinguals. In: Joana Duarte and Ingrid Gogolin (eds.), Linguistic Superdiversity in Urban Areas. Research Approaches. Amsterdam: Benjamins, 99-121.
Korpus / Textsammlung
Audio recordings of evocative field experiments (picture story, retelling, spontaneous discourse etc.) with Turkish/German bilingual children and monolingual Turkish / monolingual German children as control data. ENDFAS and SKOBI are two corpora of Turkish-German bilingual children, created for the research projects ENDFAS ("Die Entwicklung narrativer Diskursfähigkeiten im Deutschen und Türkischen in Familie und Schule") and SKOBI ("Sprachliche Konnektivität bei bilingual türkisch-deutsch aufwachsenden Kindern und Jugendlichen") at the University of Hamburg. The corpus creators are Jochen Rehbein, Annette Herkenrath and Birsel Karakoç. Post-editing and publication of the data was done by project Z2 at the Research Center on Multilingualism, University of Hamburg. Herkenrath, Annette, and Jochen Rehbein. 2012. "Pragmatic Corpus Analysis, Exemplified by Turkish-German Bilingual and Monolingual Data." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 123–52. Hamburg Studies in Multilingualism, 14. John Benjamins. CLARIN Metadata summary for Rehbein-ENDFAS/Rehbein-SKOBI-Korpus (CMDI-based) Title: Rehbein-ENDFAS/Rehbein-SKOBI-Korpus Description: Audio recordings of evocative field experiments (picture story, retelling, spontaneous discourse etc.) with Turkish/German bilingual children and monolingual Turkish / monolingual German children as control data. Publication date: 2009-03-04 Data owner: Jochen Rehbein, Institut für Germanistik I / Von Melle Park 6 / D-20146 Hamburg, rehbein@uni-hamburg.de Contributors: Jochen Rehbein, Institut für Germanistik I / Von Melle Park 6 / D-20146 Hamburg, rehbein@uni-hamburg.de (compiler) Project: E5 "Linguistic connectivity in bilingual Turkish-German children", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, monolingual data, L1 data, L2 data, contact variety, EXMARaLDA Languages: German (deu), Turkish (tur) Size: 523 speakers (284 female, 200 male, 36 unknown), 1017 communications, 371 recordings, 14033 minutes, 836 transcriptions, 748074 words Temporal Coverage: 1987-10-10/2009-09-19 Spatial Coverage: Bozyazı, TR; Anamur, TR; Istanbul, TR; Hamburg, DE; Ankara, TR; TR; Mersin, TR; istanbul, TR; Hamburg, TR; Anakara, TR; Icel, Bozyazi, TR; Icel, Anamur, TR; Icel, Anamar, TR; Icel (Anamur), TR; Icel, (Anamur), TR; Icel, (Bozyazi), TR; Icel (Bozyazi), TR; Eskisehir, TR; Tokat, TR; Izmir, TR; Istan, TR; Istanbul, TR Genre: discourse Modality: spoken
Korpus / Textsammlung
Audio recordings of four German/Spanish simultaneous bilingual children starting at approx. 1 year and ending between the ages 2;4 and 3 years. There are 144 recording sessions (interviewer/child interaction), half of them conducted in a German and half in a Spanish speaking environment. Phonologie-Erwerb Deutsch-Spanisch als Erste Sprachen (PEDSES) is a phonetically and orthographically transcribed corpus of German and Spanish, finalized in the project Prosodic constraints on phonological and morphological development in bilingual first language acquisition at the Research Center on Multilingualism, University of Hamburg. Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins. CLARIN Metadata summary for Phonologie-Erwerb Deutsch-Spanisch als Erste Sprachen (PEDSES) (CMDI-based) Title: Phonologie-Erwerb Deutsch-Spanisch als Erste Sprachen (PEDSES) Description: Audio recordings of four German/Spanish simultaneous bilingual children starting at approx. 1 year and ending between the ages 2;4 and 3 years. There are 144 recording sessions (interviewer/child interaction), half of them conducted in a German and half in a Spanish speaking environment. Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (depositor), Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler) Project: E3 "Prosodic Constraints on Phonological and Morphological development in Bilingual First Language Acquisition", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, longitudinal data, simultaneous bilingualism, L2 data, L1 data, L1-Daten, L2-Daten, EXMARaLDA Languages: German (deu), Spanish (spa) Size: 22 speakers (15 female, 7 male), 146 communications, 137.79 hours, 4550 minutes, 144 recordings, 127 transcriptions, 101292 words Genre: discourse Modality: spoken Modality: Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins.
Korpus / Textsammlung
Audio and Video recordings of five German/Spanish bilingual children starting at approx. 1 year and 6 months and ending at age 6-7 years with about 419 recordings (interviewer/child interaction), 203 in a German and 216 in a Spanish speaking environment. PhonBLA Longitudinalstudie Hamburg is a phonetically and orthographically transcribed corpus of German and Spanish, finalized in the project Prosodic constraints on phonological and morphological development in bilingual first language acquisition at the Research Center on Multilingualism, University of Hamburg. Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins. CLARIN Metadata summary for PhonBLA Longitudinalstudie Hamburg (CMDI-based) Title: PhonBLA Longitudinalstudie Hamburg Description: Audio and Video recordings of five German/Spanish bilingual children starting at approx. 1 year and 6 months and ending at age 6-7 years with about 419 recordings (interviewer/child interaction), 203 in a German and 216 in a Spanish speaking environment. Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (depositor), Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler), Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (researcher) Project: E3 "Prosodic Constraints on Phonological and Morphological development in Bilingual First Language Acquisition", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, longitudinal data, simultaneous bilingualism, L2 data, L1 data, EXMARaLDA Languages: German (deu), Spanish (spa) Size: 62 speakers (43 female, 18 male, 1 unknown), 422 communications, 225.96 hours, 13558 minutes, 419 recordings, 413 transcriptions, 303792 words Genre: discourse Modality: spoken References: Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins.
Korpus / Textsammlung
Video recordings of seven German/French simultaneous bilingual children, starting at approx. 1 year and 6 months. One or two recordings eachmonth until approx. 5 years and 6 months (and some later recordings for some subjects). In each recording session (interviewer/child interaction) the child is addressed in both languages in one French and one German part. For privacy protection reasons, only the transcripts can be made available. Video recordings cannot be published. Meisel, Jürgen (ed.) (1994): Bilingual First Language Acquistion: French and German Grammatical Development (Language Acquisition & Language Disorders). Amsterdam: John Benjamins.
Korpus / Textsammlung
Audio recordings of five German and six Spanish speaking monolingual children. For the German children there are 176 recordings (interviewer/child interaction), on an average starting at 9 months and ending at 3 years; for the Spanish children there are 79 recordings, on average starting at 9 months and ending at 2;6 years. PAIDUS is a phonetically and orthographically transcribed corpus of German and Spanish child language, finalized in the project Prosodic constraints on phonological and morphological development in bilingual first language acquisition at the Research Center on Multilingualism, University of Hamburg. Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins. CLARIN Metadata summary for Parameterfixierung im Deutschen und Spanischen (PAIDUS) (CMDI-based) Title: Parameterfixierung im Deutschen und Spanischen (PAIDUS) Description: Audio recordings of five German and six Spanish speaking monolingual children. For the German children there are 176 recordings (interviewer/child interaction), on an average starting at 9 months and ending at 3 years; for the Spanish children there are 79 recordings, on average starting at 9 months and ending at 2;6 years. Publication date: 2010-09-30 Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (depositor), Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler) Project: E3 "Prosodic Constraints on Phonological and Morphological development in Bilingual First Language Acquisition", German Research Foundation (DFG) Keywords: child language acquisition, longitudinal data, monolingual data, EXMARaLDA Languages: German (deu), Spanish (spa) Size: 67 speakers (44 female, 23 male), 255 communications, 138.47 hours, 8308 minutes, 255 recordings, 253 transcriptions, 166976 words Genre: discourse Modality: spoken References: Lleó, Conxita. 2012. "Monolingual and Bilingual Phonoprosodic Corpora of Child German and Child Spanish." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, 14:pp. 107–22. Hamburg Studies in Multilingualism. John Benjamins.
Korpus / Textsammlung
Audio recordings in Spanish with 23 German/Spanish simultaneous bilingual children living in Germany and attending the Spanish complementary school at the first level. 1-6 recordings with each child, with 11 children also before the children attended the Spanish complementary school. All recordings feature elicited speech: A picture naming task, a story telling task, a morphosyntactic test, a lexical test, and the HAVAS 5. Rich metadata on language use and attitudes in the family submitted by the parents. ALCEBLA is a phonetically and orthographically transcribed corpus of German and Spanish, finalized in the project Research based support of the complementary Spanish school in Germany (FUSED) at the Research Center on Multilingualism, University of Hamburg. Ulloa Saceda, Marta; Lleó, Conxita and García Sánchez, Izarbe (2012): Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monolingual children. In: Schmidt, Thomas and Wörner, Kai (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism (14), 97-106. Amsterdam: John Benjamins. CLARIN Metadata summary for ALCEBLA (CMDI-based) Title: ALCEBLA Description: Audio recordings in Spanish with 23 German/Spanish simultaneous bilingual children living in Germany and attending the Spanish complementary school at the first level. 1-6 recordings with each child, with 11 children also before the children attended the Spanish complementary school. All recordings feature elicited speech: A picture naming task, a story telling task, a morphosyntactic test, a lexical test, and the HAVAS 5. Rich metadata on language use and attitudes in the family submitted by the parents. Publication date: 2020 Data owner: Conxita Lleó, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler) Project: T4 "Research based support of the complementary Spanish school in Germany (FUSED)", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, simultaneous bilingualism, EXMARaLDA Languages: German (deu), Spanish (spa) Size: 23 speakers (14 female, 9 male), 86 communications, 84 recordings, 2434 minutes, 66 transcriptions, 36717 words Spatial Coverage: DE Genre: discourse Modality: spoken References: Ulloa Saceda, Marta; Lleó, Conxita and García Sánchez, Izarbe (2012): Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monolingual children. In: Schmidt, Thomas and Wörner, Kai (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism (14), 97-106. Amsterdam: John Benjamins.
Korpus / Textsammlung
Audio recordings of eight German participants in Spain, with Spanish as L2, children first exposed to their L2 at or before the age of three, and children exposed to their L2 at or after the age of five. Recording sessions in Spanish based on picture naming and story telling etc. Rich metadata on language use and attitudes in the family submitted by the parents. Phon-CL2 is a phonetically and orthographically transcribed corpus of German and Spanish, finalized in the project Prosodic constraints on phonological and morphological development in bilingual first language acquisition at the Research Center on Multilingualism, University of Hamburg. Ulloa, Marta Saceda, Conxita Lleó, and Izarbe Garcia Sanchez. 2012. "Corpora of Spoken Spanish by Simultaneous and Successive German-Spanish Bilingual and Spanish Monolingual Children." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 97–106. Hamburg Studies in Multilingualism, 14. John Benjamins. CLARIN Metadata summary for Phon-CL2 (CMDI-based) Title: Phon-CL2 Description: Audio recordings of eight German participants in Spain, with Spanish as L2, children first exposed to their L2 at or before the age of three, and children exposed to their L2 at or after the age of five. Recording sessions in Spanish based on picture naming and story telling etc. Rich metadata on language use and attitudes in the family submitted by the parents. Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler) Project: E3 "Prosodic Constraints on Phonological and Morphological development in Bilingual First Language Acquisition", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, successive bilingualism, child L2 acquisition, L2 data, adult bilingualism, EXMARaLDA Languages: German (deu), Spanish (spa) Size: 22 speakers (8 female, 14 male), 26 communications, 0 recordings, 0 minutes, 26 transcriptions, 17412 words Genre: discourse Modality: spoken References: Ulloa, Marta Saceda, Conxita Lleó, and Izarbe Garcia Sanchez. 2012. "Corpora of Spoken Spanish by Simultaneous and Successive German-Spanish Bilingual and Spanish Monolingual Children." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 97–106. Hamburg Studies in Multilingualism, 14. John Benjamins.
Korpus / Textsammlung
Video recordings and transcriptions of three German/Portuguese simultaneous bilingual children, starting at approx. 1 year and 6 months. One or two recordings each month until approx. 5 years and 6 months. In each recording session (interviewer/child interaction) the child is addressed in both languages in one Portuguese and one German part. For privacy protection reasons, only the transcripts can be made available. Video recordings cannot be published.
Korpus / Textsammlung
Subcorpus 'Brazilian Portuguese' (two children) from the BIPODE project. Video recordings and transcriptions of two German/Portuguese simultaneous bilingual children, starting at approx. 1 year and 6 months. One or two recordings each month until approx. 5 years and 6 months. In each recording session (interviewer/child interaction) the child is addressed in both languages in one Portuguese and one German part. For privacy protection reasons, only the transcripts can be made available. Video recordings cannot be published.
Korpus / Textsammlung
Subcorpus 'Iberian Portuguese' (one child) from the BIPODE project. Video recordings of one German/Portuguese simultaneous bilingual child, starting at approx. 1 year and 6 months. One or two recordings each month until approx. 5 years and 6 months. In each recording session (interviewer/child interaction) the child is addressed in both languages in one Portuguese and one German part. For privacy protection reasons, only the transcripts can be made available. Video recordings cannot be published.
Korpus / Textsammlung
Video recordings in German of eight bilingual children with L1 Turkish and L2 German with AOA of 3-4 years. Several recordings of spontaneous speech (play) during 7-28 months at ages approx. 3-6,5 years, and of elicited language with focus on article usage. Comparable data for the TÜ_DE-L1-Korpus. The "Turkish-German Successive-Bilinguals Corpus" (TÜ_DE_cL2 Hamburg) is a corpus of spoken child language that has been collected in the project Specific Language Impairment and Early Successive Language Acquisition (Project E4) at the Research Center on Multilingualism, University of Hamburg. The corpus consists of 94 longitudinal spontaneous speech samples (20-50 minutes) and of data from 18 elicitation tasks in German from eight typically-developing successive-bilingual children with Turkish as their first language (L1). The corpus was recorded between 2003 and 2008. The children were born in Germany and lived in families with Turkish as the home language. The L1 of both parents was Turkish and they were not very proficient in German which was their second language (L2). None of these children had elder siblings attending school who could have brought German into the family life. Furthermore, the families lived in parts of Hamburg with a mainly Turkish-speaking population. The children started to acquire German as early L2, when they started to attend a German Kindergarten (KiTa) at the age of three to four. They stayed at least four hours a day in the KiTa. Thus, the age of onset (AO) of German is between a few months before the age of three (AO3) and a few months after the age of four (AO4). The children have no hearing loss or language-impairment (in German and Turkish). None of the children had a cognitive impairment (non-verbal IQ-test). The recordings of Gül and Faruk start after two and three months of exposure to German (ME), the recordings of the other six children after ME 8 and 9. The spontaneous speech data were recorded in a playing session with the child and a monolingual German interviewer. There are between six and twenty-three spontaneous speech samples per child over a period between seven and 28 months. The elicitation task refers to the use of articles in nominative and accusative constructions. The sessions were videotaped and then transcribed on the basis of conventions of the project. The transcriptions were checked by a second person. The transcriptions contain (orthographic) verbal utterances of the child and the interviewer, comments on deviant pronunciation or specific characteristics of the child`s speech and partly the child`s non-verbal behaviour and comments on the situation. CLARIN Metadata summary for TÜ_DE-cL2-Korpus (CMDI-based) Title: TÜ_DE-cL2-Korpus Description: Video recordings in German of eight bilingual children with L1 Turkish and L2 German with AOA of 3-4 years. Several recordings of spontaneous speech (play) during 7-28 months at ages approx. 3-6,5 years, and of elicited language with focus on article usage. Comparable data for the TÜ_DE-L1-Korpus. Data owner: Monika Rothweiler, rothweiler@uni-bremen.de Contributors: Monika Rothweiler, rothweiler@uni-bremen.de (compiler) Project: E4 "Specific language impairment and early second language acquisition: Differentiating deviations in morphosyntactic acquisition", German Research Foundation (DFG) Keywords: child language acquisition, child bilingualism, longitudinal data, child L2 acquisition, successive bilingualism, L2 data, EXMARaLDA Language: German (deu) Size: 19 speakers (19 female, 0 male), 112 communications, 112 transcriptions, 348292 words Genre: discourse Modality: spoken
Korpus / Textsammlung
Original Data: Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a picture story. Corpus: The Hamburg Corpus of Polish in Germany (HamCoPoliG) was compiled between July 2008 and June 2011 in the research project "Current Polish-German Bilingualism in Germany" (H8, principal investigator: Bernhard Brehmer), part of the Collaborative Research Centre "Multilingalism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. Czachor, Agnieszka (2012): Polish-German bilingual corpus: collecting and analysing written and spoken data for investigating contact-induced change. Submitted to: Schmidt, Thomas & Wörner, Kai (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism (14). Amsterdam: Benjamins. CLARIN Metadata summary for Hamburg Corpus of Polish in Germany (HamCoPoliG) (CMDI-based) Title: Hamburg Corpus of Polish in Germany (HamCoPoliG) Description: Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a picture story. Publication date: 2020-09-30 Data owner: Bernhard Brehmer, Institut für Slavistik / Von-Melle-Park 6 / D-20146 Hamburg, bernhard.brehmer@uni-hamburg.de Contributors: Bernhard Brehmer, Institut für Slavistik / Von-Melle-Park 6 / D-20146 Hamburg, bernhard.brehmer@uni-hamburg.de (depositor), Bernhard Brehmer, Institut für Slavistik / Von-Melle-Park 6 / D-20146 Hamburg, bernhard.brehmer@uni-hamburg.de (compiler), Magdalena Kulesz (data_inputter), Maja Hajduk (data_inputter), Timm Lehmberg, timm.lehmberg@uni-hamburg.de (developer), Bernhard Brehmer, Institut für Slavistik / Von-Melle-Park 6 / D-20146 Hamburg, bernhard.brehmer@uni-hamburg.de (researcher), Agnieszka Czachór (researcher), Deutsche Forschungsgemeinschaft (DFG) (sponsor) Project: H8 "Current Polish-German Bilingualism in Germany", German Research Foundation (DFG) Keywords: language attrition, adult bilingualism, cross-sectional data, simultaneous bilingualism, successive bilingualism, child L2 acquisition, adult L2 acquisition, L1 data, EXMARaLDA Language: Polish (pol) Size: 93 speakers (64 female, 29 male), 360 communications, 37.73 hours, 2264 minutes, 358 recordings, 358 transcriptions, 294663 words Temporal Coverage: 2009-04-17/2010-12-17 Spatial Coverage: Max-Brauer-Allee 60, 22765 Hamburg, DE; Warszawa, PL Genre: discourse Modality: spoken
Korpus / Textsammlung
Transcription of audio recordings of various kinds of doctor-patient communication in hospitals. There are both monolingual conversations in German, Portuguese and Turkish, recorded in the respective country, and interpreted conversations recorded in Germany (i.e. in German-Turkish, German-Portuguese, and German-Portuguese/Spanish), about 15-20 recordings of each kind. The persons interpreting are bilingual hospital employees or relatives of the patients, who are all adults living in Germany but with varying knowledge of German. The corpus Dolmetschen im Krankenhaus (DiK - Interpreting in Hospitals) was compiled between July 1999 and June 2005 in the research project "Interpreting in Hospitals" (K2, principal investigator: Kristin Bührig), part of the Collaborative Research Centre "Multilingualism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. The corpus is based on doctor-patient-communications between German doctors or nursing staff and patients with Turkish or Portuguese as their mother tongue. These conversations were translated by laypersons (nursing staff, relatives). Furthermore, the corpus comprises monolingual doctor-patient-communications from Germany, Portugal and Turkey as comparative data. Bührig, Kristin; Kliche, Ortrun; Meyer, Bernd and Pawlack, Birte. 2012. "The Corpus 'Interpreting in Hospitals'. Possible Applications for Research and Communication Training." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 305–15. Hamburg Studies in Multilingualism. John Benjamins. CLARIN Metadata summary for Dolmetschen im Krankenhaus (DiK) (CMDI-based) Title: Dolmetschen im Krankenhaus (DiK) Description: Audio recordings of various kinds of doctor-patient communication in hospitals. There are both monolingual conversations in German, Portuguese and Turkish, recorded in the respective country, and interpreted conversations recorded in Germany (i.e. in German-Turkish, German-Portuguese, and German-Portuguese/Spanish), about 15-20 recordings of each kind. The persons interpreting are bilingual hospital employees or relatives of the patients, who are all adults living in Germany but with varying knowledge of German. Publication date: 2020-09-30 Data owner: Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de; Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de Contributors: Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de (depositor), Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de (depositor), Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de (compiler), Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de (compiler), Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de (compiler), Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de (compiler), Alper Ayzin (data_inputter), Anna (data_inputter), Bernd (data_inputter), Birte Pawlack (data_inputter), Çakmak (data_inputter), Carla (data_inputter), Demet (data_inputter), Erkan (data_inputter), Isabel (data_inputter), João (data_inputter), Latif (data_inputter), Maria (data_inputter), Najhla (data_inputter), Süreyya (data_inputter), Thomas Schmidt, thomas.schmidt@uni-hamburg.de (developer), Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de (researcher), Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de (researcher), Deutsche Forschungsgemeinschaft (DFG) (sponsor) Project: K2 "Interpreting in Hospitals", German Research Foundation (DFG) Keywords: community interpreting, consecutive interpreting, interpreted communication, doctor-patient communication, communication in institutions, EXMARaLDA Languages: German (deu), Portuguese (por), Spanish (spa), Turkish (tur) Size: 187 speakers (98 female, 89 male), 91 communications, 0.0 hours, 0 minutes, 0 recordings, 92 transcriptions, 170925 words Annotation types: transcription (manual): HIAT, akz: accentuation/stress, k: free comment, sup: suprasegmental information, de: German translation, en: English translation, mt: morphological transliteration Temporal Coverage: 1997-07-11/2005-01-12 Spatial Coverage: Hamburg, DE; Vila do Conde, PT; Viana do Castelo, PT; Ankara, TR Genre: discourse Modality: spoken References: Bührig, Kristin; Kliche, Ortrun; Meyer, Bernd; Pawlack, Birte (2012) The corpus "Interpreting in Hospitals": Possible applications for research and communication training. In: Schmidt, Thomas and Wörner, Kai (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism (14), 305-315. Amsterdam: John Benjamins.
Korpus / Textsammlung
Audio and video recordings of various types of community interpreted discourse (doctor-patient communication, simulated doctor-patient communication, courtroom communication) in German (simulated and authentic doctor-patient communication) and US (courtroom communication) institutions with varying community languages. Video recordings only exist for the simulated communication. For the authentic interpreted doctor-patient communication, no audio files will be made available. The ComInDat pilot corpus contains sample data from three different projects: the DiK corpus of Portuguese/German and Turkish/German interpreted doctor-patient communication in hospitals (Bührig & Meyer 2004), he IiSCC-corpus, a corpus of interpreted court proceedings in different language constellations (Spanish/English, Russian/English, Haitian Creole/English and Polish/English) (Angermeyer 2006), a corpus of simulated interpreted doctor-patient interactions in different language constellations (Russian/German, Polish/German and Romanian/German) from a training seminar for bilingual nursing staff ("SimDiK", Bührig, Kliche, Meyer & Pawlack 2012). More information about the background of the corpus and the details of its design can be found in (Angermeyer, Meyer & Schmidt 2012). For more information about the project, please contact Philipp Angermeyer. Angermeyer, P., Meyer, B. and Schmidt, T. (2012). Sharing Community Interpreting Corpora: A pilot study. In: Schmidt, T. and Wörner, K. (eds.) Multilingual Corpora and Multilingual Corpus Analysis. Amsterdam: Benjamins, 275-294. CLARIN Metadata summary for Community Interpreting Database Pilot Corpus (ComInDat) (CMDI-based) Title: Community Interpreting Database Pilot Corpus (ComInDat) Description: Audio and video recordings of various types of community interpreted discourse (doctor-patient communication, simulated doctor-patient communication, courtroom communication) in German (simulated and authentic doctor-patient communication) and US (courtroom communication) institutions with varying community languages. Video recordings only exist for the simulated communication. For the authentic interpreted doctor-patient communication, no audio files will be made available. Publication date: 2013-06-10 Data owner: Philipp Angermeyer, Department of Languages, Literatures and Linguistics / York University / 4700 Keele Street / Canada M3J 1P3, pangerme@yorku.ca, Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de, Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de Contributors: Philipp Angermeyer, Department of Languages, Literatures and Linguistics / York University / 4700 Keele Street / Canada M3J 1P3, pangerme@yorku.ca (compiler), Kristin Bührig, Institut für Germanistik I / Von-Melle-Park 6 / D-20146 Hamburg, kristin.buehrig@uni-hamburg.de (compiler), Bernd Meyer, Arbeitsbereich Interkulturelle Kommunikation / Fachbereich 06: Translations-, Sprach- und Kulturwissenschaft / Johannes Gutenberg-Universität Mainz / An der Hochschule 2 / D-76726 Germersheim, meyerb@uni-mainz.de (compiler) Project: The Integration of Text, Sound, and Image into the Corpus-Based Analysis of Interpreter-Mediated Interaction Keywords: community interpreting, doctor-patient communication, courtroom communication, EXMARaLDA Languages: German (deu), English (eng), Spanish (spa), Turkish (tur), Polish (pol), Portuguese (por), Romanian (ron), Russian (rus), Haitian (hat) Size: 54 speakers (35 female, 16 male, 3 unknown), 14 communications, 12 recordings, 83 minutes, 17 transcriptions, 35051 words Annotation types: transcription (manual): HIAT/CHAT, deu: German translation, eng: English translation, k: free comment, lang: utterance language, sup: suprasegmental information, trans: utterance translation status, akz: accentuation/stress, pol: Polish translation Temporal Coverage: 1999-07-01/2010-03-09 Spatial Coverage: Hamburg, DE; New York, US; Neumünster, DE Genre: discourse Modality: spoken References: Angermeyer, P., Meyer, B. and Schmidt, T. (2012). Sharing Community Interpreting Corpora: A pilot study. In: Schmidt, T. and Wörner, K. (eds.) Multilingual Corpora and Multilingual Corpus Analysis. Amsterdam: Benjamins, 275-294.
Korpus / Textsammlung
Audio and video recordings of experimental/read and spontaneous speech from adult speakers of Porteño Spanish in Argentina. Speakers are 18-69 years old and from two geographic areas. For the intonational experiments, there are audio recordings only, whereas some of the free interviews and map tasks feature video recordings. The material used as stimuli in the experiments is available with references encoded in the transcriptions. The Hamburg Corpus of Argentinean Spanish (HaCASpa) was compiled in December 2008 and November/December 2009 within the context of the research project The intonation of Spanish in Argentina (H9, director: Christoph Gabriel), part of the Collaborative Research Centre "Multilingalism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. It comprises data from two varieties of Argentinean Spanish, i.e. a) the dialect spoken in the capital of Buenos Aires (also called Porteño, derived from puerto 'harbor') and b) the variety of the Neuquén/Comahue area (Northern Patagonia). The seven parts of HaCASpa correspond to the seven tasks described below in more detail: Five experiments were carried out in order to elicit specific data for research in prosody, with a main focus on (Task 1–5); in addition, several speakers took part in a free interview (Task 6) and a map task experiment (Task 7). The Task is encoded as a metadata attribute for each communication. HaCASpa comprises three different types of spoken data, depending on the Task, i.e. spontaneous, semi-spontaneous, and scripted speech. This information corresponds to the metadata attribute Speech type. The regional dimension of the corpus is represented through the attribute Area (i.e. Buenos Aires or Neuquén/Comahue), its diachronic dimension through the attribute Age group (i.e. Under 25/Over 25). The subjects are 60 native speakers of the relevant variety of Argentinean Spanish, i.e. Buenos Aires (Porteño) or Nequén/Comahue Spanish. For each speaker, the following information is available: Age, Education, Occupation, Year of school enrollment, Year of school graduation and Parents' mother tongue. The current version 0.2 contains mainly orthographic transcriptions of verbal behaviour (141,000 transcribed words) and codes that relate utterances to the materials used for the experimental tasks. Experimental design: Task (1) consists of two subparts: reading a story (1a) and retelling it (1b). For (1a), the subjects were asked to read the short story "The North Wind and the Sun", which was presented on a computer screen, two times. The fable is well known for its use of phonetic descriptions of different languages (see Handbook of the International Phonetic Association, International Phonetic Association. Cambridge: Cambridge University Press, 2005); the Latin American version we used in our data stems from the Dialectoteca del español, (coordination: C.-E. Piñeros). For (1b), the speakers were instructed to retell the story in their own words without being able to consult the text. With the help of these two parts, data of scripted (part 1a) as well as of semi-spontaneous speech (part 1b) could be collected. Task (2) was designed to collect data of semi-spontaneous speech by asking the subjects to answer questions pertaining to a given picture story. In a first step, the speakers were familiarized with the story, which was presented as two pictures displayed on a computer screen. In a second step, they were asked to answer specific questions about the story. The questions were also presented on the computer screen and varied in their design in order to elicit answers with different information-structural readings (such as broad vs. narrow focus or different focus types). In general, the speakers were free to answer as they wished. However, in order to avoid single word answers, they were asked to utter complete sentences. Task (3) consisted of reading question-answer pairs, the content of which was based on the picture stories already familiar from task (2). The answers were given together with the questions on the computer screen (i.e. one question / one answer) and the speakers simply had to read both the question and the answer. Task (4) was a reading task in which the subjects were asked to utter 10 simple subject-verb-object (SVO) sentences, presented on a computer screen. The speakers were instructed to read them at both normal and fast speech rate. Along the lines proposed in D´Imperio et al. 2005 ("Intonational Phrasing in Romance: The Role of Syntactic and Prosodic Structure", in: Prosodies: With Special Reference to Iberian Languages, ed. by Frota, S. et al., Berlin: Mouton de Gruyter, 59-97), the subject and object constituents differed in their syntactic and prosodic complexity (e.g. determiner plus noun or determiner plus noun plus adjective and one or three prosodic words, respectively). The participants were instructed to read the sentences as if they contained new information. The complete experiment design is described in Gabriel, C. et al. 2011 ("Prosodic phrasing in Porteño Spanish", in: Intonational Phrasing in Romance and Germanic: Cross-Linguistic and Bilingual Studies, ed. by Gabriel, C. & Lleó, C., Amsterdam: Benjamins, 153-182). Task (5), the so-called intonation survey, consisted of 48 situations designed to elicit various intonational contours with specific pragmatic meanings. In this inductive method, the researcher confronts the speaker with a series of hypothetical situations to which he or she is supposed to react verbally. In the Argentinean version of the questionnaire, the hypothetical situations were illustrated by appropriate pictures. The experimental design is described in more detail in Prieto, P. & Roseano, P. 2010 (eds). Transcription of Intonation of the Spanish Language. Munich: Lincom; see also the Interactive atlas of Spanish intonation (coordination: P. Prieto & P. Roseano). Task (6) was conducted to collect spontaneous speech data by conducting free interviews. In this task, the subjects were asked to tell the interviewer something about a past experience, be it a vacation or memories of Argentina as it was decades ago. Even though the interviewer was still part of the conversation, it was mainly the subjects who spoke during the recordings. Task (7) consists of Map Task dialogs. Map Task is a technique employed to collect data of spontaneous speech in which two subjects cooperate to complete a specified task. It is designed to lead the subjects to produce particular interrogative patterns. Each of the two subjects receives a map of an imaginary town marked with buildings and other specific elements. A route is marked on the map of one of the two participants, who assumes the role of the instruction-giver. The version of the same map given to the other participant, who assumes the role of the instruction-follower, differs from that of the instruction-giver in that it does not show the route to be followed. The instruction-follower therefore must ask the instruction-giver questions in order to be able to reproduce the same route on his or her own map (see also the Interactive atlas of Spanish intonation). CLARIN Metadata summary for Hamburg Corpus of Argentinean Spanish (HaCASpa) (CMDI-based) Title: Hamburg Corpus of Argentinean Spanish (HaCASpa) Description: Audio and video recordings of experimental/read and spontaneous speech from adult speakers of Porteño Spanish in Argentina. Speakers are 18-69 years old and from two geographic areas. For the intonational experiments, there are audio recordings only, whereas some of the free interviews and map tasks feature video recordings. The material used as stimuli in the experiments is available with references encoded in the transcriptions. Publication date: 2011-06-30 Data owner: Christoph Gabriel, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, christoph.gabriel@uni-hamburg.de Contributors: Christoph Gabriel, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, christoph.gabriel@uni-hamburg.de (compiler) Project: H9 "The intonation of Spanish in Argentina", German Research Foundation (DFG) Keywords: contact variety, cross-sectional data, regional variety, language contact, EXMARaLDA Language: Spanish (spa) Size: 63 speakers (39 female, 24 male), 259 communications, 261 recordings, 1119 minutes, 261 transcriptions, 141321 words Annotation types: transcription (manual): mainly orthographic, project-specific conventions, code: reference to underlying prompts Temporal Coverage: 2008-11-01/2009-12-01 Spatial Coverage: Buenos Aires, AR; Neuquén/Comahue, AR Genre: discourse Modality: spoken