Multilingual speech recognition:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Berlin
Logos-Verl.
2000
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Zugl.: Erlangen-Nürnberg, Univ., Diss., 2000 |
Beschreibung: | VIII, 186 S. Ill., graph. Darst. : 21 cm |
ISBN: | 3897225026 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV013366932 | ||
003 | DE-604 | ||
005 | 20010111 | ||
007 | t | ||
008 | 000926s2000 gw ad|| m||| 00||| eng d | ||
016 | 7 | |a 959786767 |2 DE-101 | |
020 | |a 3897225026 |c kart. : DM 79.00, sfr 71.90, S 576.60 |9 3-89722-502-6 | ||
035 | |a (OCoLC)50063136 | ||
035 | |a (DE-599)BVBBV013366932 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a gw |c DE | ||
049 | |a DE-29 |a DE-29T |a DE-11 | ||
082 | 0 | |a 006.454 |b U22 | |
084 | |a ST 278 |0 (DE-625)143644: |2 rvk | ||
084 | |a ST 306 |0 (DE-625)143654: |2 rvk | ||
100 | 1 | |a Uebler, Ulla |e Verfasser |4 aut | |
245 | 1 | 0 | |a Multilingual speech recognition |c from Ulla Uebler |
264 | 1 | |a Berlin |b Logos-Verl. |c 2000 | |
300 | |a VIII, 186 S. |b Ill., graph. Darst. : 21 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Zugl.: Erlangen-Nürnberg, Univ., Diss., 2000 | ||
650 | 4 | |a Automatic speech recognition | |
650 | 4 | |a Speech perception | |
650 | 0 | 7 | |a Mehrsprachigkeit |0 (DE-588)4038403-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 0 | 1 | |a Mehrsprachigkeit |0 (DE-588)4038403-2 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009117880&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-009117880 |
Datensatz im Suchindex
_version_ | 1804128145622695936 |
---|---|
adam_text | Contents
1 Introduction 1
1.1 Automatic Speech Recognition 1
1.2 Multilingual Speech Recognition 5
1.3 Contribution to the Progress in Science 6
1.4 Overview 7
2 State of the Art 9
2.1 Multilingual Porting 10
2.2 Multilingual Cross-language Recognition 12
2.3 Recognition of Dialects and Non-native Speech 15
2.4 Simultaneous Multilingual Recognition 18
3 Speech Recognition Technology 23
3.1 Hidden Markov Models 23
3.2 Acoustic Phonetic Modeling 27
3.3 Language Modeling 29
3.4 Speaker Adaptation 31
4 Languages Used in this Work 35
4.1 Definitions 35
4.2 Phonetics 37
4.2.1 Production of Sounds 37
4.2.2 Representation of Sounds 40
4.3 Languages in Detail 42
4.3.1 German 43
4.3.2 English 44
4.3.3 Italian 45
4.3.4 Slovenian 46
4.3.5 Slovak 46
4.3.6 Czech 47
4.3.7 Japanese 47
4.4 Phone Inventory of the Used Languages 48
5 Overview of Approaches to Multilingual Speech Recognition in this Work 55
5.1 System Architecture 55
5.2 Challenges of Multilingual Recognition 58
5.2.1 Approaches for Acoustic Units 59
5.2.2 Approaches for Language Modeling 62
5.3 Fields of Application 63
6 Bilingual Recognition in SpeeDaTA 65
6.1 Description of the Task 67
6.1.1 Domain 67
6.1.2 Data 69
6.2 User Friendliness and Evaluation Criteria 71
6.3 Non-natives and Dialect Speakers 73
6.4 Bilingual Acoustic Units 77
6.5 Speaker Adaptation and Retraining 82
6.6 Language Model Design . 85
6.6.1 Language Models in Use 85
6.6.2 Combining Language Models 90
6.6.3 Morphological Language Modeling 91
6.7 Conclusion 99
7 Recognition of Non-native Speech 103
7.1 The Speech of Non-natives 104
7.2 Strange Corpus 107
7.3 Czech German Ill
7.4 Conclusion 113
8 Multilingual Recognition in Seven Languages 115
8.1 Adding Languages 115
8.1.1 Data 116
8.1.2 Developments in Size 118
8.2 Phoneme Substitution 120
8.2.1 Na(t)ive Substitution 121
8.2.2 Phonetic Substitution 122
8.2.3 Data-driven Substitution 124
8.3 Monolingual performance 128
8.4 Cross-lingual performance 129
, 8.5 Simultaneous multilingual performance 133
8.5.1 Acoustic modeling 134
8.5.2 Language modeling 138
8.6 Conclusion 143
9 Discussion and Outlook 147
9.1 New Languages: Modeling a Recognizer 149
9.2 Similarities among Languages: Cross-lingual Recognition ........... 150
9.3 Non-native Speakers: Recognition with Multilingual Recognizers 152
9.4 Simultaneous Recognition: Combination of Languages 153
10 Summary 157
Bibliography 161
A Phone set of 7 languages 171
B Coverage of trilingual phone sets 177
C Phoneme replacements for seven languages 179
Index 185
|
any_adam_object | 1 |
author | Uebler, Ulla |
author_facet | Uebler, Ulla |
author_role | aut |
author_sort | Uebler, Ulla |
author_variant | u u uu |
building | Verbundindex |
bvnumber | BV013366932 |
classification_rvk | ST 278 ST 306 |
ctrlnum | (OCoLC)50063136 (DE-599)BVBBV013366932 |
dewey-full | 006.454 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.454 |
dewey-search | 006.454 |
dewey-sort | 16.454 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01683nam a2200433 c 4500</leader><controlfield tag="001">BV013366932</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20010111 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">000926s2000 gw ad|| m||| 00||| eng d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">959786767</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">3897225026</subfield><subfield code="c">kart. : DM 79.00, sfr 71.90, S 576.60</subfield><subfield code="9">3-89722-502-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)50063136</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV013366932</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29</subfield><subfield code="a">DE-29T</subfield><subfield code="a">DE-11</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.454</subfield><subfield code="b">U22</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 278</subfield><subfield code="0">(DE-625)143644:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 306</subfield><subfield code="0">(DE-625)143654:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Uebler, Ulla</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Multilingual speech recognition</subfield><subfield code="c">from Ulla Uebler</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin</subfield><subfield code="b">Logos-Verl.</subfield><subfield code="c">2000</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">VIII, 186 S.</subfield><subfield code="b">Ill., graph. Darst. : 21 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Zugl.: Erlangen-Nürnberg, Univ., Diss., 2000</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Automatic speech recognition</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Speech perception</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Mehrsprachigkeit</subfield><subfield code="0">(DE-588)4038403-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Mehrsprachigkeit</subfield><subfield code="0">(DE-588)4038403-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009117880&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-009117880</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV013366932 |
illustrated | Illustrated |
indexdate | 2024-07-09T18:44:35Z |
institution | BVB |
isbn | 3897225026 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-009117880 |
oclc_num | 50063136 |
open_access_boolean | |
owner | DE-29 DE-29T DE-11 |
owner_facet | DE-29 DE-29T DE-11 |
physical | VIII, 186 S. Ill., graph. Darst. : 21 cm |
publishDate | 2000 |
publishDateSearch | 2000 |
publishDateSort | 2000 |
publisher | Logos-Verl. |
record_format | marc |
spelling | Uebler, Ulla Verfasser aut Multilingual speech recognition from Ulla Uebler Berlin Logos-Verl. 2000 VIII, 186 S. Ill., graph. Darst. : 21 cm txt rdacontent n rdamedia nc rdacarrier Zugl.: Erlangen-Nürnberg, Univ., Diss., 2000 Automatic speech recognition Speech perception Mehrsprachigkeit (DE-588)4038403-2 gnd rswk-swf Automatische Spracherkennung (DE-588)4003961-4 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Automatische Spracherkennung (DE-588)4003961-4 s Mehrsprachigkeit (DE-588)4038403-2 s DE-604 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009117880&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Uebler, Ulla Multilingual speech recognition Automatic speech recognition Speech perception Mehrsprachigkeit (DE-588)4038403-2 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
subject_GND | (DE-588)4038403-2 (DE-588)4003961-4 (DE-588)4113937-9 |
title | Multilingual speech recognition |
title_auth | Multilingual speech recognition |
title_exact_search | Multilingual speech recognition |
title_full | Multilingual speech recognition from Ulla Uebler |
title_fullStr | Multilingual speech recognition from Ulla Uebler |
title_full_unstemmed | Multilingual speech recognition from Ulla Uebler |
title_short | Multilingual speech recognition |
title_sort | multilingual speech recognition |
topic | Automatic speech recognition Speech perception Mehrsprachigkeit (DE-588)4038403-2 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
topic_facet | Automatic speech recognition Speech perception Mehrsprachigkeit Automatische Spracherkennung Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009117880&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT ueblerulla multilingualspeechrecognition |