Real world approaches for multilingual and non-native speech recognition:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Abschlussarbeit Buch |
Sprache: | English |
Veröffentlicht: |
Berlin
Logos-Verl.
2010
|
Schriftenreihe: | Studien zur Mustererkennung
32 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XII, 156 S. graph. Darst. 21 cm |
ISBN: | 9783832524463 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV036513109 | ||
003 | DE-604 | ||
005 | 20100628 | ||
007 | t | ||
008 | 100621s2010 d||| m||| 00||| eng d | ||
015 | |a 10,A20 |2 dnb | ||
016 | 7 | |a 1002021049 |2 DE-101 | |
020 | |a 9783832524463 |c kart. |9 978-3-8325-2446-3 | ||
035 | |a (OCoLC)624471811 | ||
035 | |a (DE-599)DNB1002021049 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-29 |a DE-29T | ||
082 | 0 | |a 006.454 |2 22/ger | |
084 | |a ST 306 |0 (DE-625)143654: |2 rvk | ||
084 | |a 004 |2 sdnb | ||
084 | |a 400 |2 sdnb | ||
100 | 1 | |a Raab, Martin |e Verfasser |0 (DE-588)133990303 |4 aut | |
245 | 1 | 0 | |a Real world approaches for multilingual and non-native speech recognition |c von Martin Raab |
264 | 1 | |a Berlin |b Logos-Verl. |c 2010 | |
300 | |a XII, 156 S. |b graph. Darst. |c 21 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Studien zur Mustererkennung |v 32 | |
502 | |a Zugl.: Erlangen-Nürnberg, Univ., Diss., 2010 | ||
650 | 0 | 7 | |a Codebuch |0 (DE-588)4697624-3 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Mehrsprachigkeit |0 (DE-588)4038403-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Eingebettetes System |0 (DE-588)4396978-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Non-native speaker |0 (DE-588)4438849-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachdaten |0 (DE-588)4312652-2 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Eingebettetes System |0 (DE-588)4396978-1 |D s |
689 | 0 | 1 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 0 | 2 | |a Mehrsprachigkeit |0 (DE-588)4038403-2 |D s |
689 | 0 | 3 | |a Codebuch |0 (DE-588)4697624-3 |D s |
689 | 0 | 4 | |a Non-native speaker |0 (DE-588)4438849-4 |D s |
689 | 0 | 5 | |a Sprachdaten |0 (DE-588)4312652-2 |D s |
689 | 0 | |5 DE-604 | |
830 | 0 | |a Studien zur Mustererkennung |v 32 |w (DE-604)BV013645858 |9 32 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020435274&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-020435274 |
Datensatz im Suchindex
_version_ | 1804143082215571456 |
---|---|
adam_text | CONTENTS ABBREVIATIONS XI 1 INTRODUCTION 1 1.1 MULTILINGUAL SPEECH
RECOGNITION 1 1.1.1 PROLOG 1 1.1.2 NON-NATIVE SPEECH RECOGNITION 2 1.1.3
COMBINATONE PROBLEM 3 1.2 MOTIVATION AND AIMS 5 1.3 CONTRIBUTIONS 7 1.4
OUTLINE 8 2 AUTOMATIC SPEECH RECOGNITION 9 2.1 STATISTICAL FRAMEWORK 9
2.2 FEATURE EXTRACTION 10 2.3 ACOUSTIC MODEL 14 2.3.1 HIDDEN MARKOV
MODELS 14 2.3.2 SEMI-CONTINUOUS ACOUSTIC MODEL 15 2.3.3 CODEBOOK
GENERATION 15 2.3.4 TRAINING HIDDEN MARKOV MODELS 17 2.3.5 DECODING WITH
HIDDEN MARKOV MODELS 19 2.4 LANGUAGE MODEL 20 2.5 DICTIONARY 21 2.6
EVALUATION MEASURES 22 3 RELATED WORK 23 3.1 LINGUISTICS OF ACCENTS 23
3.2 SECOND LANGUAGE ACQUISITION 24 3.2.1 CONTRASTIVE HYPOTHESIS 25 3.2.2
IDENTITY HYPOTHESIS 25 3.2.3 THEORY OF LEARNERS ERRORS 26
BIBLIOGRAFISCHE INFORMATIONEN HTTP://D-NB.INFO/1002021049 DIGITALISIERT
DURCH VIII 3.2.4 INTERLANGUAGE HYPOTHESIS 27 3.3 MULTILINGUAL SPEECH
RECOGNITION 28 3.3.1 DESIGN QUESTIONS 28 3.3.2 PHONEME DISTANCES 29
3.3.3 LINGUISTICALLY MOTIVATED PARAMETER REDUCTION 30 3.3.4 DATA DRIVEN
PARAMETER REDUCTION 31 3.3.5 COMPARISONS OF PARAMETER REDUCTION
TECHNIQUES 32 3.4 NON-NATIVE SPEECH RECOGNITION 33 3.4.1 DESIGN
QUESTIONS 34 3.4.2 RECOGNITION WITHOUT NON-NATIVE DATA 35 3.4.3
ADAPTATION WITH PHONETIC KNOWLEDGE AND HUMAN EXPERTISE . 37 3.4.4
SUPERVISED ADAPTATION WITH NON-NATIVE DATA 38 3.4.5 TRAINING WITH
NON-NATIVE DATA 39 3.5 SUMMARY 42 ALGORITHM DESCRIPTION 45 4.1 BENCHMARK
SYSTEM 45 4.2 BASELINE SYSTEM 46 4.3 MULTILINGUAL WEIGHTED CODEBOOK 47
4.3.1 MOTIVATION 47 4.3.2 MWC ALGORITHM 47 4.3.3 DISTANCE MEASURES 49
4.3.4 SIMPLIFICATIONS 51 4.4 ON-THE-FLY GENERATION OF MULTILINGUAL HMMS
53 4.4.1 MOTIVATION 53 4.4.2 OPTIMAL PROJECTIONS 54 4.4.3 APPROXIMATED
PROJECTIONS 60 4.4.4 OVERVIEW OF PROJECTIONS 63 4.5 SCALABLE
ARCHITECTURE 63 4.6 ADAPTATION 64 4.6.1 ACCENT DETECTION AND LANGUAGE
IDENTIFICATION THROUGH CODE- BOOK SHARE RATES 64 4.6.2 DURATIONAL
MODELING 65 4.6.3 FREQUENCY BAND WEIGHT ADAPTATION 66 4.6.4 MODEL
MERGING 70 4.6.5 HMM ADAPTATION 71 4.7 SUMMARY 72 IX 5 RESOURCES 75 5.1
NON-NATIVE SPEECH DATABASES 75 5.1.1 PROBLEM 75 5.1.2 OVERVIEW OF
NON-NATIVE DATABASES 76 5.1.3 DETAILS FOR SELECTED DATABASES 80 5.1.4
CATEGORIES OF DATABASES 81 5.1.5 SUMMARY 83 5.2 NAMING SCHEME 84 5.3
TRAINING DATA 85 5.4 DEVELOPMENT DATA 85 5.5 TEST DATA 87 5.6 SUMMARY 88
6 EXPERIMENTS 89 6.1 MONOPHONES VS. TRIPHONES 89 6.2 MULTILINGUAL
WEIGHTED CODEBOOKS 90 6.2.1 DISTANCE METRICS 91 6.2.2 CODEBOOK SIZE 92
6.2.3 SIMPLIFICATIONS 95 6.2.4 FIVE-LINGUAL SYSTEM 95 6.2.5 PERFORMANCE
ON ACCENTED ENGLISH 97 6.2.6 SUMMARY 101 6.3 ON-THE-FLY MULTILINGUAL
HMMS 102 6.3.1 COMBINATION WEIGHT FOR PROJECTION 7 102 6.3.2 COMPARING
PROJECTIONS 102 6.3.3 OTFMHMM ON NATIVE SPEECH 105 6.3.4 OTFMHMM ON
NON-NATIVE SPEECH 105 6.3.5 SUMMARY 10G 6.4 SCALABLE ARCHITECTURE 107
6.4.1 PERFORMANCE ON NATIVE SPEECH 108 6.4.2 PERFORMANCE ON NON-NATIVE
SPEECH 110 6.4.3 RUNTIME 112 6.4.4 SUMMARY ILL 6.5 ADAPTATION 110 6.5.1
ACCENT DETECTION AND LANGUAGE IDENTIFICATION THROUGH CODE- BOOK SHARE
RATES 110 6.5.2 DURATIONAL MODELING 117 6.5.3 FREQUENCY BAND WEIGHT
ADAPTATION 118 BIBLIOGRAPHY 147 6.5.4 MODEL MERGING 120 6.5.5 HMM
ADAPTATION 122 6.5.6 SUMMARY 128 7 OUTLOOK 129 8 SUMMARY 131 A OWN
PUBLICATIONS 135 B SIGNIFICANCE TEST 137 C MEL BANDS 139 LIST OF FIGURES
141 LIST OF TABLES 145
|
any_adam_object | 1 |
author | Raab, Martin |
author_GND | (DE-588)133990303 |
author_facet | Raab, Martin |
author_role | aut |
author_sort | Raab, Martin |
author_variant | m r mr |
building | Verbundindex |
bvnumber | BV036513109 |
classification_rvk | ST 306 |
ctrlnum | (OCoLC)624471811 (DE-599)DNB1002021049 |
dewey-full | 006.454 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.454 |
dewey-search | 006.454 |
dewey-sort | 16.454 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik Sprachwissenschaft |
format | Thesis Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02234nam a2200541 cb4500</leader><controlfield tag="001">BV036513109</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20100628 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">100621s2010 d||| m||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">10,A20</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1002021049</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783832524463</subfield><subfield code="c">kart.</subfield><subfield code="9">978-3-8325-2446-3</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)624471811</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1002021049</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29</subfield><subfield code="a">DE-29T</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.454</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 306</subfield><subfield code="0">(DE-625)143654:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">400</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Raab, Martin</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)133990303</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Real world approaches for multilingual and non-native speech recognition</subfield><subfield code="c">von Martin Raab</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin</subfield><subfield code="b">Logos-Verl.</subfield><subfield code="c">2010</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 156 S.</subfield><subfield code="b">graph. Darst.</subfield><subfield code="c">21 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Studien zur Mustererkennung</subfield><subfield code="v">32</subfield></datafield><datafield tag="502" ind1=" " ind2=" "><subfield code="a">Zugl.: Erlangen-Nürnberg, Univ., Diss., 2010</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Codebuch</subfield><subfield code="0">(DE-588)4697624-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Mehrsprachigkeit</subfield><subfield code="0">(DE-588)4038403-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Eingebettetes System</subfield><subfield code="0">(DE-588)4396978-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Non-native speaker</subfield><subfield code="0">(DE-588)4438849-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachdaten</subfield><subfield code="0">(DE-588)4312652-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Eingebettetes System</subfield><subfield code="0">(DE-588)4396978-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Mehrsprachigkeit</subfield><subfield code="0">(DE-588)4038403-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Codebuch</subfield><subfield code="0">(DE-588)4697624-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="4"><subfield code="a">Non-native speaker</subfield><subfield code="0">(DE-588)4438849-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="5"><subfield code="a">Sprachdaten</subfield><subfield code="0">(DE-588)4312652-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Studien zur Mustererkennung</subfield><subfield code="v">32</subfield><subfield code="w">(DE-604)BV013645858</subfield><subfield code="9">32</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020435274&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020435274</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV036513109 |
illustrated | Illustrated |
indexdate | 2024-07-09T22:42:00Z |
institution | BVB |
isbn | 9783832524463 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-020435274 |
oclc_num | 624471811 |
open_access_boolean | |
owner | DE-29 DE-29T |
owner_facet | DE-29 DE-29T |
physical | XII, 156 S. graph. Darst. 21 cm |
publishDate | 2010 |
publishDateSearch | 2010 |
publishDateSort | 2010 |
publisher | Logos-Verl. |
record_format | marc |
series | Studien zur Mustererkennung |
series2 | Studien zur Mustererkennung |
spelling | Raab, Martin Verfasser (DE-588)133990303 aut Real world approaches for multilingual and non-native speech recognition von Martin Raab Berlin Logos-Verl. 2010 XII, 156 S. graph. Darst. 21 cm txt rdacontent n rdamedia nc rdacarrier Studien zur Mustererkennung 32 Zugl.: Erlangen-Nürnberg, Univ., Diss., 2010 Codebuch (DE-588)4697624-3 gnd rswk-swf Mehrsprachigkeit (DE-588)4038403-2 gnd rswk-swf Eingebettetes System (DE-588)4396978-1 gnd rswk-swf Non-native speaker (DE-588)4438849-4 gnd rswk-swf Automatische Spracherkennung (DE-588)4003961-4 gnd rswk-swf Sprachdaten (DE-588)4312652-2 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Eingebettetes System (DE-588)4396978-1 s Automatische Spracherkennung (DE-588)4003961-4 s Mehrsprachigkeit (DE-588)4038403-2 s Codebuch (DE-588)4697624-3 s Non-native speaker (DE-588)4438849-4 s Sprachdaten (DE-588)4312652-2 s DE-604 Studien zur Mustererkennung 32 (DE-604)BV013645858 32 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020435274&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Raab, Martin Real world approaches for multilingual and non-native speech recognition Studien zur Mustererkennung Codebuch (DE-588)4697624-3 gnd Mehrsprachigkeit (DE-588)4038403-2 gnd Eingebettetes System (DE-588)4396978-1 gnd Non-native speaker (DE-588)4438849-4 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd Sprachdaten (DE-588)4312652-2 gnd |
subject_GND | (DE-588)4697624-3 (DE-588)4038403-2 (DE-588)4396978-1 (DE-588)4438849-4 (DE-588)4003961-4 (DE-588)4312652-2 (DE-588)4113937-9 |
title | Real world approaches for multilingual and non-native speech recognition |
title_auth | Real world approaches for multilingual and non-native speech recognition |
title_exact_search | Real world approaches for multilingual and non-native speech recognition |
title_full | Real world approaches for multilingual and non-native speech recognition von Martin Raab |
title_fullStr | Real world approaches for multilingual and non-native speech recognition von Martin Raab |
title_full_unstemmed | Real world approaches for multilingual and non-native speech recognition von Martin Raab |
title_short | Real world approaches for multilingual and non-native speech recognition |
title_sort | real world approaches for multilingual and non native speech recognition |
topic | Codebuch (DE-588)4697624-3 gnd Mehrsprachigkeit (DE-588)4038403-2 gnd Eingebettetes System (DE-588)4396978-1 gnd Non-native speaker (DE-588)4438849-4 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd Sprachdaten (DE-588)4312652-2 gnd |
topic_facet | Codebuch Mehrsprachigkeit Eingebettetes System Non-native speaker Automatische Spracherkennung Sprachdaten Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020435274&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV013645858 |
work_keys_str_mv | AT raabmartin realworldapproachesformultilingualandnonnativespeechrecognition |