Multilingual corpora and multilingual corpus analysis.:
This paper presents the metadata model of the EXMARaLDA system and its implementations. It will first take a look on existing metadata schemes for transcriptions of spoken language as well as written texts and emphasize on their advantages and disadvantages. The paper will justify the decisions agai...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
[Place of publication not identified] :
John Benjamins,
2012.
|
Schriftenreihe: | Hamburg studies on multilingualism ;
v. 14 |
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | This paper presents the metadata model of the EXMARaLDA system and its implementations. It will first take a look on existing metadata schemes for transcriptions of spoken language as well as written texts and emphasize on their advantages and disadvantages. The paper will justify the decisions against existing models that led to a new data model that does not prescribe many metadata items and relies on XML files. It will conclude with a brief outlook on ongoing efforts to standardize metadata. |
Beschreibung: | 1 online resource (pages). |
Bibliographie: | Includes bibliographical references and index. |
ISBN: | 9789027273444 9027273448 |
Internformat
MARC
LEADER | 00000cam a2200000M 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-on1264964260 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr |n||||||||| | ||
008 | 121108s2012 xx ob 001 0 eng d | ||
040 | |a SNK |b eng |e pn |c SNK |d YDXCP |d EBLCP |d MHW |d E7B |d AUW |d N$T |d OCLCF |d DEBSZ |d LOA |d PIFAG |d MERUC |d ZCU |d U3W |d UUM |d STF |d WRM |d VTS |d ICG |d INT |d TKN |d DKC |d M8D |d UKAHL |d AJS |d OCLCO |d OCLCQ |d OCLCO |d OCLCL |d TMA |d OCLCQ |d SXB | ||
019 | |a 816562968 | ||
020 | |a 9789027273444 |q (electronic bk.) | ||
020 | |a 9027273448 |q (electronic bk.) | ||
020 | |z 9027219346 | ||
020 | |z 9789027219343 | ||
035 | |a (OCoLC)1264964260 |z (OCoLC)816562968 | ||
043 | |a e-gx--- | ||
050 | 4 | |a P115.5.G3 M85 2012 | |
072 | 7 | |a EDU |x 005000 |2 bisacsh | |
082 | 7 | |a 306.44/60943 |a 306.4460943 |a 404.2 | |
049 | |a MAIN | ||
100 | 1 | |a Velupillai, Viveka. | |
245 | 1 | 0 | |a Multilingual corpora and multilingual corpus analysis. |
260 | |a [Place of publication not identified] : |b John Benjamins, |c 2012. | ||
300 | |a 1 online resource (pages). | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
490 | 0 | |a Hamburg studies on multilingualism ; |v v. 14 | |
520 | |a This paper presents the metadata model of the EXMARaLDA system and its implementations. It will first take a look on existing metadata schemes for transcriptions of spoken language as well as written texts and emphasize on their advantages and disadvantages. The paper will justify the decisions against existing models that led to a new data model that does not prescribe many metadata items and relies on XML files. It will conclude with a brief outlook on ongoing efforts to standardize metadata. | ||
504 | |a Includes bibliographical references and index. | ||
505 | 0 | |a Multilingual Corpora and Multilingual Corpus Analysis -- Editorial page -- Title page -- LCC page -- Dedication page -- Table of contents -- Introduction -- Section 1. Learner and attrition corpora -- The LeaP corpus: A multilingual corpus of spoken learner German and learner English -- 1. Introduction -- 2. LeaP corpus: Primary data -- 3. Corpus annotation -- 4. Corpus data format -- 5. Corpus search -- 6. Exploring fluency in second language learner speech with the LeaP corpus -- 7. Conclusion -- References -- Technological and methodological challenges in creating, annotating and sharing a learner corpus of -- 1. Introduction -- 2. The Hamburg Map Task Corpus -- 3. Manual interpretative annotation -- 4. Conclusion -- References -- Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in cont -- 1. Introduction -- 2. The Corpus of Reading Comprehension Exercises in German (CREG) -- 3. Corpus collection and the WELCOME tool -- 4. Inter-annotator agreement analysis for meaning assessment -- 5. Meaning assessment results -- 6. Avenues for future research -- 7. Summary -- Acknowledgments -- References -- The ALeSKo learner corpus: Design -- annotation -- quantitative analyses -- 1. Introduction -- 2. Design of the corpus -- 3. Annotation layers -- 4. Quantitative descriptive analyses -- 5. Applications for the corpus -- Acknowledgements -- References -- Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monoli -- 1. Introduction -- 2. Description of the corpora -- 3. Further research -- Acknowledgements -- References -- Monolingual and bilingual phonoprosodic corpora of child German and child Spanish -- 1. Introduction -- 2. The PAIDUS corpus -- 3. The corpus PhonBLA -- 4. Concluding remarks -- References. | |
505 | 8 | |a Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data -- 1. An introductory note on methodology -- 2. The data: Corpus and constellation -- 3. Research questions and aspects of frequency -- 4. Procedures of quantitative analysis -- 5. Classification of search results -- 6. Contextual interpretation of the items -- 7. Extending the analysis: Interpretative procedures -- 8. Consequences and further research -- Abbreviations and conventions -- References -- Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating -- 1. Introduction -- 2. Participants of the study -- 3. Corpus design -- 4. Data acquisition and storage -- 5. Transcription -- 6. Corpus publication and reuse -- References -- The HABLA-Corpus (German-French and German-Italian) -- 1. Introduction -- 2. Research on simultaneous bilingualism and the weaker (heritage) language -- 3. Corpus design -- 4. Transcription -- 5. Availability -- References -- Appendix -- Section 2. Language contact corpora -- The Hamburg Corpus of Argentinean Spanish (HaCASpa) -- 1. Introduction -- 2. Argentinean Porteño Spanish as a contact variety: The role of multilingualism and Second Language -- 3. Corpus design -- 4. Main findings -- 5. Remaining issues -- References -- Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis -- 1. Introduction -- 2. The language situation on the Faroe Islands: Sociopolitical and linguistic factors -- 3. Written and spoken language in language contact -- 4. Corpus-based analyses of contact-induced transfer -- 5. The corpora -- 6. Case study: The use of subjunctions in conditional clauses in Faroe-Danish -- 7. Conclusion -- References -- Phonoprosodic corpus of spoken Catalan (PhonCAT) -- 1. Introduction: PhonCAT -- 2. Data collection. | |
505 | 8 | |a 3. Data segmentation and coding -- 4. Collected data -- 5. Data analysis -- 6. Conclusions -- References -- Researching the intelligibility of a (German) dialect -- 1. Why passive knowledge of a dialect? -- 2. Focussing on language variation and its intelligibility in health care institutions -- 3. The corpus design -- 4. The annotation system -- 5. Evaluating the (results of) the annotating system -- References -- Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish -- 1. Specific problems in historical corpora -- 2. The HaCOSSA corpus -- 3. Digital representation and linguistic annotation -- 4. Syntactic ambiguity in Old Swedish -- 5. Concluding remarks -- References -- Section 3. Interpreting corpora -- Sharing community interpreting corpora: A pilot study -- 1. Introduction -- 2. Data for the pilot study -- 3. Technical heterogeneity of the data -- 4. Common platform for sharing the data: Integration of sound, text, and images -- 5. Common approaches to annotating the data -- 6. Conclusion and outlook -- References -- CoSi -- A Corpus of Consecutive and Simultaneous Interpreting -- 1. Introduction -- 2. Corpus design -- 3. Corpus creation and editing -- 4. Corpus use -- 5. Getting access to the corpus -- References -- The corpus "Interpreting in Hospitals": Possible applications for research and communication trainin -- 1. The corpus "Interpreting in Hospitals": Design and background -- 2. The corpus "Interpreting in Hospitals" as a source for research -- 3. Using the corpus in communication trainings -- 4. Conclusions -- References -- Section 4. Comparable and parallel corpora -- The GeWiss corpus: Comparing spoken academic German, English and Polish -- 1. Putting GeWiss into context: Motivation, aims and applications -- 2. The design of the GeWiss corpus -- 3. Data acquisition -- 4. Metadata. | |
505 | 8 | |a 5. Transcription -- 6. Annotation -- 7. Perspectives -- References -- Corpora -- Appendix 1 -- Appendix 2 -- Korpus C4: A distributed corpus of German varieties -- 1. A German variety corpus -- 2. Design of the Korpus C4 -- 3. Corpus format and metadata -- 4. Access to the Korpus C4 -- 5. Conclusion -- References -- Treebanks in translation studies: The CroCo Dependency Treebank -- 1. Introduction -- 2. The CroCo Dependency Bank -- 3. Treebanks in translation studies -- 4. Conclusion and outlook -- References -- Section 5. Corpus tools -- Multilingual phonological corpus analysis: The tools behind the PhonBank Project -- 1. Introduction -- 2. PhonBank -- 3. Phon -- 4. A practical illustration -- 5. Outlook -- References -- Finding the balance between strict defaults and total openness: Collecting and managing metadata for -- 1. What is metadata? -- 2. Why metadata? -- 3. Metadata standards -- 4. ISLE Meta Data Initiative (IMDI) -- 5. Institut für Deutsche Sprache (IDS) -- 6. EXMARaLDA metadata -- 7. Using EXMARaLDA metadata -- 8. Possible enhancements to the toolset regarding metadata -- 9. Outlook -- References -- General index -- Corpora index -- Language index. | |
650 | 0 | |a Multilingualism |z Germany. | |
650 | 0 | |a Linguistic minorities |z Germany. | |
650 | 0 | |a Contrastive linguistics. |0 http://id.loc.gov/authorities/subjects/sh85031652 | |
650 | 0 | |a Corpora (Linguistics) |0 http://id.loc.gov/authorities/subjects/sh2006006393 | |
650 | 6 | |a Multilinguisme |z Allemagne. | |
650 | 6 | |a Minorités linguistiques |z Allemagne. | |
650 | 6 | |a Linguistique contrastive. | |
650 | 6 | |a Corpus (Linguistique) | |
650 | 7 | |a EDUCATION |x Bilingual Education. |2 bisacsh | |
650 | 7 | |a Contrastive linguistics |2 fast | |
650 | 7 | |a Corpora (Linguistics) |2 fast | |
650 | 7 | |a Linguistic minorities |2 fast | |
650 | 7 | |a Multilingualism |2 fast | |
651 | 7 | |a Germany |2 fast |1 https://id.oclc.org/worldcat/entity/E39PBJtCD3rcKcPDx6FHmjvrbd | |
655 | 4 | |a Electronic book. | |
758 | |i has work: |a Multilingual corpora and multilingual corpus analysis (Text) |1 https://id.oclc.org/worldcat/entity/E39PCGBHWKPXQcHFKj9ppWcRTd |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
776 | 0 | 8 | |i Print version: |z 9027219346 |z 9789027219343 |w (OCoLC)795009278 |
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=502098 |3 Volltext |
938 | |a Askews and Holts Library Services |b ASKH |n AH28329640 | ||
938 | |a ProQuest Ebook Central |b EBLB |n EBL1049747 | ||
938 | |a ebrary |b EBRY |n ebr10617478 | ||
938 | |a EBSCOhost |b EBSC |n 502098 | ||
938 | |a YBP Library Services |b YANK |n 9675074 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-on1264964260 |
---|---|
_version_ | 1816882548958035968 |
adam_text | |
any_adam_object | |
author | Velupillai, Viveka |
author_facet | Velupillai, Viveka |
author_role | |
author_sort | Velupillai, Viveka |
author_variant | v v vv |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | P - Language and Literature |
callnumber-label | P115 |
callnumber-raw | P115.5.G3 M85 2012 |
callnumber-search | P115.5.G3 M85 2012 |
callnumber-sort | P 3115.5 G3 M85 42012 |
callnumber-subject | P - Philology and Linguistics |
collection | ZDB-4-EBA |
contents | Multilingual Corpora and Multilingual Corpus Analysis -- Editorial page -- Title page -- LCC page -- Dedication page -- Table of contents -- Introduction -- Section 1. Learner and attrition corpora -- The LeaP corpus: A multilingual corpus of spoken learner German and learner English -- 1. Introduction -- 2. LeaP corpus: Primary data -- 3. Corpus annotation -- 4. Corpus data format -- 5. Corpus search -- 6. Exploring fluency in second language learner speech with the LeaP corpus -- 7. Conclusion -- References -- Technological and methodological challenges in creating, annotating and sharing a learner corpus of -- 1. Introduction -- 2. The Hamburg Map Task Corpus -- 3. Manual interpretative annotation -- 4. Conclusion -- References -- Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in cont -- 1. Introduction -- 2. The Corpus of Reading Comprehension Exercises in German (CREG) -- 3. Corpus collection and the WELCOME tool -- 4. Inter-annotator agreement analysis for meaning assessment -- 5. Meaning assessment results -- 6. Avenues for future research -- 7. Summary -- Acknowledgments -- References -- The ALeSKo learner corpus: Design -- annotation -- quantitative analyses -- 1. Introduction -- 2. Design of the corpus -- 3. Annotation layers -- 4. Quantitative descriptive analyses -- 5. Applications for the corpus -- Acknowledgements -- References -- Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monoli -- 1. Introduction -- 2. Description of the corpora -- 3. Further research -- Acknowledgements -- References -- Monolingual and bilingual phonoprosodic corpora of child German and child Spanish -- 1. Introduction -- 2. The PAIDUS corpus -- 3. The corpus PhonBLA -- 4. Concluding remarks -- References. Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data -- 1. An introductory note on methodology -- 2. The data: Corpus and constellation -- 3. Research questions and aspects of frequency -- 4. Procedures of quantitative analysis -- 5. Classification of search results -- 6. Contextual interpretation of the items -- 7. Extending the analysis: Interpretative procedures -- 8. Consequences and further research -- Abbreviations and conventions -- References -- Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating -- 1. Introduction -- 2. Participants of the study -- 3. Corpus design -- 4. Data acquisition and storage -- 5. Transcription -- 6. Corpus publication and reuse -- References -- The HABLA-Corpus (German-French and German-Italian) -- 1. Introduction -- 2. Research on simultaneous bilingualism and the weaker (heritage) language -- 3. Corpus design -- 4. Transcription -- 5. Availability -- References -- Appendix -- Section 2. Language contact corpora -- The Hamburg Corpus of Argentinean Spanish (HaCASpa) -- 1. Introduction -- 2. Argentinean Porteño Spanish as a contact variety: The role of multilingualism and Second Language -- 3. Corpus design -- 4. Main findings -- 5. Remaining issues -- References -- Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis -- 1. Introduction -- 2. The language situation on the Faroe Islands: Sociopolitical and linguistic factors -- 3. Written and spoken language in language contact -- 4. Corpus-based analyses of contact-induced transfer -- 5. The corpora -- 6. Case study: The use of subjunctions in conditional clauses in Faroe-Danish -- 7. Conclusion -- References -- Phonoprosodic corpus of spoken Catalan (PhonCAT) -- 1. Introduction: PhonCAT -- 2. Data collection. 3. Data segmentation and coding -- 4. Collected data -- 5. Data analysis -- 6. Conclusions -- References -- Researching the intelligibility of a (German) dialect -- 1. Why passive knowledge of a dialect? -- 2. Focussing on language variation and its intelligibility in health care institutions -- 3. The corpus design -- 4. The annotation system -- 5. Evaluating the (results of) the annotating system -- References -- Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish -- 1. Specific problems in historical corpora -- 2. The HaCOSSA corpus -- 3. Digital representation and linguistic annotation -- 4. Syntactic ambiguity in Old Swedish -- 5. Concluding remarks -- References -- Section 3. Interpreting corpora -- Sharing community interpreting corpora: A pilot study -- 1. Introduction -- 2. Data for the pilot study -- 3. Technical heterogeneity of the data -- 4. Common platform for sharing the data: Integration of sound, text, and images -- 5. Common approaches to annotating the data -- 6. Conclusion and outlook -- References -- CoSi -- A Corpus of Consecutive and Simultaneous Interpreting -- 1. Introduction -- 2. Corpus design -- 3. Corpus creation and editing -- 4. Corpus use -- 5. Getting access to the corpus -- References -- The corpus "Interpreting in Hospitals": Possible applications for research and communication trainin -- 1. The corpus "Interpreting in Hospitals": Design and background -- 2. The corpus "Interpreting in Hospitals" as a source for research -- 3. Using the corpus in communication trainings -- 4. Conclusions -- References -- Section 4. Comparable and parallel corpora -- The GeWiss corpus: Comparing spoken academic German, English and Polish -- 1. Putting GeWiss into context: Motivation, aims and applications -- 2. The design of the GeWiss corpus -- 3. Data acquisition -- 4. Metadata. 5. Transcription -- 6. Annotation -- 7. Perspectives -- References -- Corpora -- Appendix 1 -- Appendix 2 -- Korpus C4: A distributed corpus of German varieties -- 1. A German variety corpus -- 2. Design of the Korpus C4 -- 3. Corpus format and metadata -- 4. Access to the Korpus C4 -- 5. Conclusion -- References -- Treebanks in translation studies: The CroCo Dependency Treebank -- 1. Introduction -- 2. The CroCo Dependency Bank -- 3. Treebanks in translation studies -- 4. Conclusion and outlook -- References -- Section 5. Corpus tools -- Multilingual phonological corpus analysis: The tools behind the PhonBank Project -- 1. Introduction -- 2. PhonBank -- 3. Phon -- 4. A practical illustration -- 5. Outlook -- References -- Finding the balance between strict defaults and total openness: Collecting and managing metadata for -- 1. What is metadata? -- 2. Why metadata? -- 3. Metadata standards -- 4. ISLE Meta Data Initiative (IMDI) -- 5. Institut für Deutsche Sprache (IDS) -- 6. EXMARaLDA metadata -- 7. Using EXMARaLDA metadata -- 8. Possible enhancements to the toolset regarding metadata -- 9. Outlook -- References -- General index -- Corpora index -- Language index. |
ctrlnum | (OCoLC)1264964260 |
dewey-full | 306.44/60943 306.4460943 404.2 |
dewey-hundreds | 300 - Social sciences 400 - Language |
dewey-ones | 306 - Culture and institutions 404 - Special topics of language |
dewey-raw | 306.44/60943 306.4460943 404.2 |
dewey-search | 306.44/60943 306.4460943 404.2 |
dewey-sort | 3306.44 560943 |
dewey-tens | 300 - Social sciences 400 - Language |
discipline | Soziologie |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>10065cam a2200697M 4500</leader><controlfield tag="001">ZDB-4-EBA-on1264964260</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr |n|||||||||</controlfield><controlfield tag="008">121108s2012 xx ob 001 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">SNK</subfield><subfield code="b">eng</subfield><subfield code="e">pn</subfield><subfield code="c">SNK</subfield><subfield code="d">YDXCP</subfield><subfield code="d">EBLCP</subfield><subfield code="d">MHW</subfield><subfield code="d">E7B</subfield><subfield code="d">AUW</subfield><subfield code="d">N$T</subfield><subfield code="d">OCLCF</subfield><subfield code="d">DEBSZ</subfield><subfield code="d">LOA</subfield><subfield code="d">PIFAG</subfield><subfield code="d">MERUC</subfield><subfield code="d">ZCU</subfield><subfield code="d">U3W</subfield><subfield code="d">UUM</subfield><subfield code="d">STF</subfield><subfield code="d">WRM</subfield><subfield code="d">VTS</subfield><subfield code="d">ICG</subfield><subfield code="d">INT</subfield><subfield code="d">TKN</subfield><subfield code="d">DKC</subfield><subfield code="d">M8D</subfield><subfield code="d">UKAHL</subfield><subfield code="d">AJS</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">TMA</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">SXB</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">816562968</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9789027273444</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9027273448</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9027219346</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9789027219343</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1264964260</subfield><subfield code="z">(OCoLC)816562968</subfield></datafield><datafield tag="043" ind1=" " ind2=" "><subfield code="a">e-gx---</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">P115.5.G3 M85 2012</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">EDU</subfield><subfield code="x">005000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">306.44/60943</subfield><subfield code="a">306.4460943</subfield><subfield code="a">404.2</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Velupillai, Viveka.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Multilingual corpora and multilingual corpus analysis.</subfield></datafield><datafield tag="260" ind1=" " ind2=" "><subfield code="a">[Place of publication not identified] :</subfield><subfield code="b">John Benjamins,</subfield><subfield code="c">2012.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (pages).</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Hamburg studies on multilingualism ;</subfield><subfield code="v">v. 14</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This paper presents the metadata model of the EXMARaLDA system and its implementations. It will first take a look on existing metadata schemes for transcriptions of spoken language as well as written texts and emphasize on their advantages and disadvantages. The paper will justify the decisions against existing models that led to a new data model that does not prescribe many metadata items and relies on XML files. It will conclude with a brief outlook on ongoing efforts to standardize metadata.</subfield></datafield><datafield tag="504" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references and index.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Multilingual Corpora and Multilingual Corpus Analysis -- Editorial page -- Title page -- LCC page -- Dedication page -- Table of contents -- Introduction -- Section 1. Learner and attrition corpora -- The LeaP corpus: A multilingual corpus of spoken learner German and learner English -- 1. Introduction -- 2. LeaP corpus: Primary data -- 3. Corpus annotation -- 4. Corpus data format -- 5. Corpus search -- 6. Exploring fluency in second language learner speech with the LeaP corpus -- 7. Conclusion -- References -- Technological and methodological challenges in creating, annotating and sharing a learner corpus of -- 1. Introduction -- 2. The Hamburg Map Task Corpus -- 3. Manual interpretative annotation -- 4. Conclusion -- References -- Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in cont -- 1. Introduction -- 2. The Corpus of Reading Comprehension Exercises in German (CREG) -- 3. Corpus collection and the WELCOME tool -- 4. Inter-annotator agreement analysis for meaning assessment -- 5. Meaning assessment results -- 6. Avenues for future research -- 7. Summary -- Acknowledgments -- References -- The ALeSKo learner corpus: Design -- annotation -- quantitative analyses -- 1. Introduction -- 2. Design of the corpus -- 3. Annotation layers -- 4. Quantitative descriptive analyses -- 5. Applications for the corpus -- Acknowledgements -- References -- Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monoli -- 1. Introduction -- 2. Description of the corpora -- 3. Further research -- Acknowledgements -- References -- Monolingual and bilingual phonoprosodic corpora of child German and child Spanish -- 1. Introduction -- 2. The PAIDUS corpus -- 3. The corpus PhonBLA -- 4. Concluding remarks -- References.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data -- 1. An introductory note on methodology -- 2. The data: Corpus and constellation -- 3. Research questions and aspects of frequency -- 4. Procedures of quantitative analysis -- 5. Classification of search results -- 6. Contextual interpretation of the items -- 7. Extending the analysis: Interpretative procedures -- 8. Consequences and further research -- Abbreviations and conventions -- References -- Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating -- 1. Introduction -- 2. Participants of the study -- 3. Corpus design -- 4. Data acquisition and storage -- 5. Transcription -- 6. Corpus publication and reuse -- References -- The HABLA-Corpus (German-French and German-Italian) -- 1. Introduction -- 2. Research on simultaneous bilingualism and the weaker (heritage) language -- 3. Corpus design -- 4. Transcription -- 5. Availability -- References -- Appendix -- Section 2. Language contact corpora -- The Hamburg Corpus of Argentinean Spanish (HaCASpa) -- 1. Introduction -- 2. Argentinean Porteño Spanish as a contact variety: The role of multilingualism and Second Language -- 3. Corpus design -- 4. Main findings -- 5. Remaining issues -- References -- Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis -- 1. Introduction -- 2. The language situation on the Faroe Islands: Sociopolitical and linguistic factors -- 3. Written and spoken language in language contact -- 4. Corpus-based analyses of contact-induced transfer -- 5. The corpora -- 6. Case study: The use of subjunctions in conditional clauses in Faroe-Danish -- 7. Conclusion -- References -- Phonoprosodic corpus of spoken Catalan (PhonCAT) -- 1. Introduction: PhonCAT -- 2. Data collection.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">3. Data segmentation and coding -- 4. Collected data -- 5. Data analysis -- 6. Conclusions -- References -- Researching the intelligibility of a (German) dialect -- 1. Why passive knowledge of a dialect? -- 2. Focussing on language variation and its intelligibility in health care institutions -- 3. The corpus design -- 4. The annotation system -- 5. Evaluating the (results of) the annotating system -- References -- Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish -- 1. Specific problems in historical corpora -- 2. The HaCOSSA corpus -- 3. Digital representation and linguistic annotation -- 4. Syntactic ambiguity in Old Swedish -- 5. Concluding remarks -- References -- Section 3. Interpreting corpora -- Sharing community interpreting corpora: A pilot study -- 1. Introduction -- 2. Data for the pilot study -- 3. Technical heterogeneity of the data -- 4. Common platform for sharing the data: Integration of sound, text, and images -- 5. Common approaches to annotating the data -- 6. Conclusion and outlook -- References -- CoSi -- A Corpus of Consecutive and Simultaneous Interpreting -- 1. Introduction -- 2. Corpus design -- 3. Corpus creation and editing -- 4. Corpus use -- 5. Getting access to the corpus -- References -- The corpus "Interpreting in Hospitals": Possible applications for research and communication trainin -- 1. The corpus "Interpreting in Hospitals": Design and background -- 2. The corpus "Interpreting in Hospitals" as a source for research -- 3. Using the corpus in communication trainings -- 4. Conclusions -- References -- Section 4. Comparable and parallel corpora -- The GeWiss corpus: Comparing spoken academic German, English and Polish -- 1. Putting GeWiss into context: Motivation, aims and applications -- 2. The design of the GeWiss corpus -- 3. Data acquisition -- 4. Metadata.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">5. Transcription -- 6. Annotation -- 7. Perspectives -- References -- Corpora -- Appendix 1 -- Appendix 2 -- Korpus C4: A distributed corpus of German varieties -- 1. A German variety corpus -- 2. Design of the Korpus C4 -- 3. Corpus format and metadata -- 4. Access to the Korpus C4 -- 5. Conclusion -- References -- Treebanks in translation studies: The CroCo Dependency Treebank -- 1. Introduction -- 2. The CroCo Dependency Bank -- 3. Treebanks in translation studies -- 4. Conclusion and outlook -- References -- Section 5. Corpus tools -- Multilingual phonological corpus analysis: The tools behind the PhonBank Project -- 1. Introduction -- 2. PhonBank -- 3. Phon -- 4. A practical illustration -- 5. Outlook -- References -- Finding the balance between strict defaults and total openness: Collecting and managing metadata for -- 1. What is metadata? -- 2. Why metadata? -- 3. Metadata standards -- 4. ISLE Meta Data Initiative (IMDI) -- 5. Institut für Deutsche Sprache (IDS) -- 6. EXMARaLDA metadata -- 7. Using EXMARaLDA metadata -- 8. Possible enhancements to the toolset regarding metadata -- 9. Outlook -- References -- General index -- Corpora index -- Language index.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Multilingualism</subfield><subfield code="z">Germany.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Linguistic minorities</subfield><subfield code="z">Germany.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Contrastive linguistics.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85031652</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Corpora (Linguistics)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh2006006393</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Multilinguisme</subfield><subfield code="z">Allemagne.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Minorités linguistiques</subfield><subfield code="z">Allemagne.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Linguistique contrastive.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Corpus (Linguistique)</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">EDUCATION</subfield><subfield code="x">Bilingual Education.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Contrastive linguistics</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Corpora (Linguistics)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Linguistic minorities</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Multilingualism</subfield><subfield code="2">fast</subfield></datafield><datafield tag="651" ind1=" " ind2="7"><subfield code="a">Germany</subfield><subfield code="2">fast</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PBJtCD3rcKcPDx6FHmjvrbd</subfield></datafield><datafield tag="655" ind1=" " ind2="4"><subfield code="a">Electronic book.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">Multilingual corpora and multilingual corpus analysis (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCGBHWKPXQcHFKj9ppWcRTd</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="z">9027219346</subfield><subfield code="z">9789027219343</subfield><subfield code="w">(OCoLC)795009278</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=502098</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">AH28329640</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest Ebook Central</subfield><subfield code="b">EBLB</subfield><subfield code="n">EBL1049747</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ebrary</subfield><subfield code="b">EBRY</subfield><subfield code="n">ebr10617478</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">502098</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">YBP Library Services</subfield><subfield code="b">YANK</subfield><subfield code="n">9675074</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
genre | Electronic book. |
genre_facet | Electronic book. |
geographic | Germany fast https://id.oclc.org/worldcat/entity/E39PBJtCD3rcKcPDx6FHmjvrbd |
geographic_facet | Germany |
id | ZDB-4-EBA-on1264964260 |
illustrated | Not Illustrated |
indexdate | 2024-11-27T13:30:22Z |
institution | BVB |
isbn | 9789027273444 9027273448 |
language | English |
oclc_num | 1264964260 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (pages). |
psigel | ZDB-4-EBA |
publishDate | 2012 |
publishDateSearch | 2012 |
publishDateSort | 2012 |
publisher | John Benjamins, |
record_format | marc |
series2 | Hamburg studies on multilingualism ; |
spelling | Velupillai, Viveka. Multilingual corpora and multilingual corpus analysis. [Place of publication not identified] : John Benjamins, 2012. 1 online resource (pages). text txt rdacontent computer c rdamedia online resource cr rdacarrier Hamburg studies on multilingualism ; v. 14 This paper presents the metadata model of the EXMARaLDA system and its implementations. It will first take a look on existing metadata schemes for transcriptions of spoken language as well as written texts and emphasize on their advantages and disadvantages. The paper will justify the decisions against existing models that led to a new data model that does not prescribe many metadata items and relies on XML files. It will conclude with a brief outlook on ongoing efforts to standardize metadata. Includes bibliographical references and index. Multilingual Corpora and Multilingual Corpus Analysis -- Editorial page -- Title page -- LCC page -- Dedication page -- Table of contents -- Introduction -- Section 1. Learner and attrition corpora -- The LeaP corpus: A multilingual corpus of spoken learner German and learner English -- 1. Introduction -- 2. LeaP corpus: Primary data -- 3. Corpus annotation -- 4. Corpus data format -- 5. Corpus search -- 6. Exploring fluency in second language learner speech with the LeaP corpus -- 7. Conclusion -- References -- Technological and methodological challenges in creating, annotating and sharing a learner corpus of -- 1. Introduction -- 2. The Hamburg Map Task Corpus -- 3. Manual interpretative annotation -- 4. Conclusion -- References -- Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in cont -- 1. Introduction -- 2. The Corpus of Reading Comprehension Exercises in German (CREG) -- 3. Corpus collection and the WELCOME tool -- 4. Inter-annotator agreement analysis for meaning assessment -- 5. Meaning assessment results -- 6. Avenues for future research -- 7. Summary -- Acknowledgments -- References -- The ALeSKo learner corpus: Design -- annotation -- quantitative analyses -- 1. Introduction -- 2. Design of the corpus -- 3. Annotation layers -- 4. Quantitative descriptive analyses -- 5. Applications for the corpus -- Acknowledgements -- References -- Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monoli -- 1. Introduction -- 2. Description of the corpora -- 3. Further research -- Acknowledgements -- References -- Monolingual and bilingual phonoprosodic corpora of child German and child Spanish -- 1. Introduction -- 2. The PAIDUS corpus -- 3. The corpus PhonBLA -- 4. Concluding remarks -- References. Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data -- 1. An introductory note on methodology -- 2. The data: Corpus and constellation -- 3. Research questions and aspects of frequency -- 4. Procedures of quantitative analysis -- 5. Classification of search results -- 6. Contextual interpretation of the items -- 7. Extending the analysis: Interpretative procedures -- 8. Consequences and further research -- Abbreviations and conventions -- References -- Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating -- 1. Introduction -- 2. Participants of the study -- 3. Corpus design -- 4. Data acquisition and storage -- 5. Transcription -- 6. Corpus publication and reuse -- References -- The HABLA-Corpus (German-French and German-Italian) -- 1. Introduction -- 2. Research on simultaneous bilingualism and the weaker (heritage) language -- 3. Corpus design -- 4. Transcription -- 5. Availability -- References -- Appendix -- Section 2. Language contact corpora -- The Hamburg Corpus of Argentinean Spanish (HaCASpa) -- 1. Introduction -- 2. Argentinean Porteño Spanish as a contact variety: The role of multilingualism and Second Language -- 3. Corpus design -- 4. Main findings -- 5. Remaining issues -- References -- Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis -- 1. Introduction -- 2. The language situation on the Faroe Islands: Sociopolitical and linguistic factors -- 3. Written and spoken language in language contact -- 4. Corpus-based analyses of contact-induced transfer -- 5. The corpora -- 6. Case study: The use of subjunctions in conditional clauses in Faroe-Danish -- 7. Conclusion -- References -- Phonoprosodic corpus of spoken Catalan (PhonCAT) -- 1. Introduction: PhonCAT -- 2. Data collection. 3. Data segmentation and coding -- 4. Collected data -- 5. Data analysis -- 6. Conclusions -- References -- Researching the intelligibility of a (German) dialect -- 1. Why passive knowledge of a dialect? -- 2. Focussing on language variation and its intelligibility in health care institutions -- 3. The corpus design -- 4. The annotation system -- 5. Evaluating the (results of) the annotating system -- References -- Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish -- 1. Specific problems in historical corpora -- 2. The HaCOSSA corpus -- 3. Digital representation and linguistic annotation -- 4. Syntactic ambiguity in Old Swedish -- 5. Concluding remarks -- References -- Section 3. Interpreting corpora -- Sharing community interpreting corpora: A pilot study -- 1. Introduction -- 2. Data for the pilot study -- 3. Technical heterogeneity of the data -- 4. Common platform for sharing the data: Integration of sound, text, and images -- 5. Common approaches to annotating the data -- 6. Conclusion and outlook -- References -- CoSi -- A Corpus of Consecutive and Simultaneous Interpreting -- 1. Introduction -- 2. Corpus design -- 3. Corpus creation and editing -- 4. Corpus use -- 5. Getting access to the corpus -- References -- The corpus "Interpreting in Hospitals": Possible applications for research and communication trainin -- 1. The corpus "Interpreting in Hospitals": Design and background -- 2. The corpus "Interpreting in Hospitals" as a source for research -- 3. Using the corpus in communication trainings -- 4. Conclusions -- References -- Section 4. Comparable and parallel corpora -- The GeWiss corpus: Comparing spoken academic German, English and Polish -- 1. Putting GeWiss into context: Motivation, aims and applications -- 2. The design of the GeWiss corpus -- 3. Data acquisition -- 4. Metadata. 5. Transcription -- 6. Annotation -- 7. Perspectives -- References -- Corpora -- Appendix 1 -- Appendix 2 -- Korpus C4: A distributed corpus of German varieties -- 1. A German variety corpus -- 2. Design of the Korpus C4 -- 3. Corpus format and metadata -- 4. Access to the Korpus C4 -- 5. Conclusion -- References -- Treebanks in translation studies: The CroCo Dependency Treebank -- 1. Introduction -- 2. The CroCo Dependency Bank -- 3. Treebanks in translation studies -- 4. Conclusion and outlook -- References -- Section 5. Corpus tools -- Multilingual phonological corpus analysis: The tools behind the PhonBank Project -- 1. Introduction -- 2. PhonBank -- 3. Phon -- 4. A practical illustration -- 5. Outlook -- References -- Finding the balance between strict defaults and total openness: Collecting and managing metadata for -- 1. What is metadata? -- 2. Why metadata? -- 3. Metadata standards -- 4. ISLE Meta Data Initiative (IMDI) -- 5. Institut für Deutsche Sprache (IDS) -- 6. EXMARaLDA metadata -- 7. Using EXMARaLDA metadata -- 8. Possible enhancements to the toolset regarding metadata -- 9. Outlook -- References -- General index -- Corpora index -- Language index. Multilingualism Germany. Linguistic minorities Germany. Contrastive linguistics. http://id.loc.gov/authorities/subjects/sh85031652 Corpora (Linguistics) http://id.loc.gov/authorities/subjects/sh2006006393 Multilinguisme Allemagne. Minorités linguistiques Allemagne. Linguistique contrastive. Corpus (Linguistique) EDUCATION Bilingual Education. bisacsh Contrastive linguistics fast Corpora (Linguistics) fast Linguistic minorities fast Multilingualism fast Germany fast https://id.oclc.org/worldcat/entity/E39PBJtCD3rcKcPDx6FHmjvrbd Electronic book. has work: Multilingual corpora and multilingual corpus analysis (Text) https://id.oclc.org/worldcat/entity/E39PCGBHWKPXQcHFKj9ppWcRTd https://id.oclc.org/worldcat/ontology/hasWork Print version: 9027219346 9789027219343 (OCoLC)795009278 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=502098 Volltext |
spellingShingle | Velupillai, Viveka Multilingual corpora and multilingual corpus analysis. Multilingual Corpora and Multilingual Corpus Analysis -- Editorial page -- Title page -- LCC page -- Dedication page -- Table of contents -- Introduction -- Section 1. Learner and attrition corpora -- The LeaP corpus: A multilingual corpus of spoken learner German and learner English -- 1. Introduction -- 2. LeaP corpus: Primary data -- 3. Corpus annotation -- 4. Corpus data format -- 5. Corpus search -- 6. Exploring fluency in second language learner speech with the LeaP corpus -- 7. Conclusion -- References -- Technological and methodological challenges in creating, annotating and sharing a learner corpus of -- 1. Introduction -- 2. The Hamburg Map Task Corpus -- 3. Manual interpretative annotation -- 4. Conclusion -- References -- Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in cont -- 1. Introduction -- 2. The Corpus of Reading Comprehension Exercises in German (CREG) -- 3. Corpus collection and the WELCOME tool -- 4. Inter-annotator agreement analysis for meaning assessment -- 5. Meaning assessment results -- 6. Avenues for future research -- 7. Summary -- Acknowledgments -- References -- The ALeSKo learner corpus: Design -- annotation -- quantitative analyses -- 1. Introduction -- 2. Design of the corpus -- 3. Annotation layers -- 4. Quantitative descriptive analyses -- 5. Applications for the corpus -- Acknowledgements -- References -- Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monoli -- 1. Introduction -- 2. Description of the corpora -- 3. Further research -- Acknowledgements -- References -- Monolingual and bilingual phonoprosodic corpora of child German and child Spanish -- 1. Introduction -- 2. The PAIDUS corpus -- 3. The corpus PhonBLA -- 4. Concluding remarks -- References. Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data -- 1. An introductory note on methodology -- 2. The data: Corpus and constellation -- 3. Research questions and aspects of frequency -- 4. Procedures of quantitative analysis -- 5. Classification of search results -- 6. Contextual interpretation of the items -- 7. Extending the analysis: Interpretative procedures -- 8. Consequences and further research -- Abbreviations and conventions -- References -- Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating -- 1. Introduction -- 2. Participants of the study -- 3. Corpus design -- 4. Data acquisition and storage -- 5. Transcription -- 6. Corpus publication and reuse -- References -- The HABLA-Corpus (German-French and German-Italian) -- 1. Introduction -- 2. Research on simultaneous bilingualism and the weaker (heritage) language -- 3. Corpus design -- 4. Transcription -- 5. Availability -- References -- Appendix -- Section 2. Language contact corpora -- The Hamburg Corpus of Argentinean Spanish (HaCASpa) -- 1. Introduction -- 2. Argentinean Porteño Spanish as a contact variety: The role of multilingualism and Second Language -- 3. Corpus design -- 4. Main findings -- 5. Remaining issues -- References -- Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis -- 1. Introduction -- 2. The language situation on the Faroe Islands: Sociopolitical and linguistic factors -- 3. Written and spoken language in language contact -- 4. Corpus-based analyses of contact-induced transfer -- 5. The corpora -- 6. Case study: The use of subjunctions in conditional clauses in Faroe-Danish -- 7. Conclusion -- References -- Phonoprosodic corpus of spoken Catalan (PhonCAT) -- 1. Introduction: PhonCAT -- 2. Data collection. 3. Data segmentation and coding -- 4. Collected data -- 5. Data analysis -- 6. Conclusions -- References -- Researching the intelligibility of a (German) dialect -- 1. Why passive knowledge of a dialect? -- 2. Focussing on language variation and its intelligibility in health care institutions -- 3. The corpus design -- 4. The annotation system -- 5. Evaluating the (results of) the annotating system -- References -- Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish -- 1. Specific problems in historical corpora -- 2. The HaCOSSA corpus -- 3. Digital representation and linguistic annotation -- 4. Syntactic ambiguity in Old Swedish -- 5. Concluding remarks -- References -- Section 3. Interpreting corpora -- Sharing community interpreting corpora: A pilot study -- 1. Introduction -- 2. Data for the pilot study -- 3. Technical heterogeneity of the data -- 4. Common platform for sharing the data: Integration of sound, text, and images -- 5. Common approaches to annotating the data -- 6. Conclusion and outlook -- References -- CoSi -- A Corpus of Consecutive and Simultaneous Interpreting -- 1. Introduction -- 2. Corpus design -- 3. Corpus creation and editing -- 4. Corpus use -- 5. Getting access to the corpus -- References -- The corpus "Interpreting in Hospitals": Possible applications for research and communication trainin -- 1. The corpus "Interpreting in Hospitals": Design and background -- 2. The corpus "Interpreting in Hospitals" as a source for research -- 3. Using the corpus in communication trainings -- 4. Conclusions -- References -- Section 4. Comparable and parallel corpora -- The GeWiss corpus: Comparing spoken academic German, English and Polish -- 1. Putting GeWiss into context: Motivation, aims and applications -- 2. The design of the GeWiss corpus -- 3. Data acquisition -- 4. Metadata. 5. Transcription -- 6. Annotation -- 7. Perspectives -- References -- Corpora -- Appendix 1 -- Appendix 2 -- Korpus C4: A distributed corpus of German varieties -- 1. A German variety corpus -- 2. Design of the Korpus C4 -- 3. Corpus format and metadata -- 4. Access to the Korpus C4 -- 5. Conclusion -- References -- Treebanks in translation studies: The CroCo Dependency Treebank -- 1. Introduction -- 2. The CroCo Dependency Bank -- 3. Treebanks in translation studies -- 4. Conclusion and outlook -- References -- Section 5. Corpus tools -- Multilingual phonological corpus analysis: The tools behind the PhonBank Project -- 1. Introduction -- 2. PhonBank -- 3. Phon -- 4. A practical illustration -- 5. Outlook -- References -- Finding the balance between strict defaults and total openness: Collecting and managing metadata for -- 1. What is metadata? -- 2. Why metadata? -- 3. Metadata standards -- 4. ISLE Meta Data Initiative (IMDI) -- 5. Institut für Deutsche Sprache (IDS) -- 6. EXMARaLDA metadata -- 7. Using EXMARaLDA metadata -- 8. Possible enhancements to the toolset regarding metadata -- 9. Outlook -- References -- General index -- Corpora index -- Language index. Multilingualism Germany. Linguistic minorities Germany. Contrastive linguistics. http://id.loc.gov/authorities/subjects/sh85031652 Corpora (Linguistics) http://id.loc.gov/authorities/subjects/sh2006006393 Multilinguisme Allemagne. Minorités linguistiques Allemagne. Linguistique contrastive. Corpus (Linguistique) EDUCATION Bilingual Education. bisacsh Contrastive linguistics fast Corpora (Linguistics) fast Linguistic minorities fast Multilingualism fast |
subject_GND | http://id.loc.gov/authorities/subjects/sh85031652 http://id.loc.gov/authorities/subjects/sh2006006393 |
title | Multilingual corpora and multilingual corpus analysis. |
title_auth | Multilingual corpora and multilingual corpus analysis. |
title_exact_search | Multilingual corpora and multilingual corpus analysis. |
title_full | Multilingual corpora and multilingual corpus analysis. |
title_fullStr | Multilingual corpora and multilingual corpus analysis. |
title_full_unstemmed | Multilingual corpora and multilingual corpus analysis. |
title_short | Multilingual corpora and multilingual corpus analysis. |
title_sort | multilingual corpora and multilingual corpus analysis |
topic | Multilingualism Germany. Linguistic minorities Germany. Contrastive linguistics. http://id.loc.gov/authorities/subjects/sh85031652 Corpora (Linguistics) http://id.loc.gov/authorities/subjects/sh2006006393 Multilinguisme Allemagne. Minorités linguistiques Allemagne. Linguistique contrastive. Corpus (Linguistique) EDUCATION Bilingual Education. bisacsh Contrastive linguistics fast Corpora (Linguistics) fast Linguistic minorities fast Multilingualism fast |
topic_facet | Multilingualism Germany. Linguistic minorities Germany. Contrastive linguistics. Corpora (Linguistics) Multilinguisme Allemagne. Minorités linguistiques Allemagne. Linguistique contrastive. Corpus (Linguistique) EDUCATION Bilingual Education. Contrastive linguistics Linguistic minorities Multilingualism Germany Electronic book. |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=502098 |
work_keys_str_mv | AT velupillaiviveka multilingualcorporaandmultilingualcorpusanalysis |