Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Abschlussarbeit Buch |
Sprache: | English |
Veröffentlicht: |
Oldenburg
BIS-Verlag der Carl von Ossietzky Universität Oldenburg
2016
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | ix, 176 Seiten Illustrationen, Diagramme |
ISBN: | 9783814223339 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV043671554 | ||
003 | DE-604 | ||
005 | 20170407 | ||
007 | t | ||
008 | 160714s2016 a||| m||| 00||| eng d | ||
020 | |a 9783814223339 |9 978-3-8142-2333-9 | ||
035 | |a (OCoLC)953680066 | ||
035 | |a (DE-599)GBV860033082 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-19 |a DE-83 |a DE-12 | ||
084 | |a ZN 6060 |0 (DE-625)157500: |2 rvk | ||
084 | |a ZN 6070 |0 (DE-625)157501: |2 rvk | ||
100 | 1 | |a Schädler, Marc René |e Verfasser |0 (DE-588)1106414667 |4 aut | |
245 | 1 | 0 | |a Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features |c Marc René Schädler |
264 | 1 | |a Oldenburg |b BIS-Verlag der Carl von Ossietzky Universität Oldenburg |c 2016 | |
300 | |a ix, 176 Seiten |b Illustrationen, Diagramme | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
502 | |b Dissertation |c Carl von Ossietzky Universität Oldenburg | ||
650 | 0 | 7 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachsignal |0 (DE-588)4056494-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 0 | 1 | |a Sprachsignal |0 (DE-588)4056494-0 |D s |
689 | 0 | 2 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029084674&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-029084674 |
Datensatz im Suchindex
_version_ | 1804176432660742144 |
---|---|
adam_text | CONTENTS
1 GENERAL INTRODUCTION 1
2 GABOR FILTER BANK FEATURES FOR ROBUST ASR 9
2.1 INTRODUCTION 10
2.2 GABOR FILTER BANK FEATURES 14
2.2.1 CALCULATION OF THE GBFB FEATURES 14
2.2.2 EXPERIMENTS 22
2.2.3 RESULTS AND DISCUSSION 26
2.3 ROBUSTNESS OF THE GABOR FILTER BANK FEA
TURES 32
2.3.1 BASELINE FEATURES 33
2.3.2 EXPERIMENTS 33
2.3.3 RESULTS AND DISCUSSION 36
2.4 SUMMARY AND FURTHER DISCUSSION 49
2.4.1 ROBUSTNESS OF GBFB FEATURES AGAINST EXTRINSIC VARI
ABILITY 49
2.4.2 COMPLEMENTARY INFORMATION 49
2.4.3 FUTURE WORK 50
2.5 CONCLUSIONS 51
3 NORMALIZATION OF GBFB FEATURES FOR IMPROVED ROBUST ASR 53
3.1 INTRODUCTION 54
3.2 METHODS 55
3.2.1 GABOR FILTER BANK FEATURES 55
3.2.2 NORMALIZATION OF FEATURE VALUE STATISTICS 57
3.2.3 RECOGNITION EXPERIMENT AND BASELINE 58
3.2.4 SPECTRAL AND TEMPORAL CONTRIBUTION 58
3.3 RESULTS AND DISCUSSION 59
3.3.1 NORMALIZED GBFB FEATURES 59
3.3.2 SPECTRAL VS. TEMPORAL NORMALIZATION 60
3.4 CONCLUSIONS 62
4 GBFB FEATURES FOR ROBUST MEDIUM-SIZE VOCABULARY ASR 63
4.1 INTRODUCTION 63
4.2 METHODS 65
4.2.1 GABOR FILTER BANK FEATURES 65
4.2.2 RECOGNITION EXPERIMENT AND BASELINE 67
4.2.3 PARAMETER SEARCH 68
4.3 RESULTS AND DISCUSSION 69
4.4 CONCLUSIONS 72
5 SEPARABLE, LESS COMPLEX GBFB FEATURES FOR ROBUST ASR 73
5.1 INTRODUCTION 74
5.2 METHODS 79
5.2.1 SPECTRO-TEMPORAL REPRESENTATION 79
5.2.2 GABOR FILTER BANK FEATURES 80
5.2.3 SEPARATE GABOR FILTER BANK FEATURES 83
5.2.4 FEATURE NORMALIZATION 88
5.2.5 RECOGNITION EXPERIMENT 89
5.2.6 ROBUSTNESS MEASURE 91
5.2.7 REFERENCE SYSTEMS 91
5.2.8 MAN-MACHINE GAP 93
5.2.9 REFERENCE IMPLEMENTATIONS 94
5.3 RESULTS 94
5.3.1 PERFORMANCE OF REFERENCE SYSTEM AND DATA REPRE
SENTATION 94
5.3.2 SINGLE SGBFB FEATURES 96
5.3.3 DUAL SGBFB FEATURES 97
5.3.4 COMPLETE SGBFB FEATURES 98
5.3.5 QUANTITY OF TRAINING DATA 99
5.3.6 REMAINING MAN-MACHINE GAP 99
5.4 DISCUSSION 100
5.4.1 MODULATION PHASES 100
5.4.2 ID VS 2D GABOR FILTER COMPLEXITY 102
5.4.3 REMAINING MAN-MACHINE GAP 103
5.5 CONCLUSIONS 104
6 SPEECH INTELLIGIBILITY PREDICTION WITH ASR 105
6.1 INTRODUCTION 106
6.2 METHODS 110
6.2.1 SPEECH INTELLIGIBILITY MEASUREMENTS 110
6.2.2 AUTOMATIC SPEECH RECOGNIZER 111
6.2.3 PREDICTING SRTS WITH THE AUTOMATIC SPEECH RECOGNIZER 114
6.2.4 SPEECH INTELLIGIBILITY INDEX 115
6.3 RESULTS 115
6.3.1 EMPIRICAL DATA 115
6.3.2 ASR-BASED PREDICTIONS 116
6.4 DISCUSSION 120
6.5 CONCLUSIONS 124
7 MODELING AUDITORY DISCRIMINATION EXPERIMENTS WITH ASR 125
7.1 INTRODUCTION 126
7.2 METHODS 129
7.2.1 EXPERIMENTS 129
7.2.2 SIGNAL REPRESENTATIONS 131
7.2.3 SIMULATION FRAMEWORK FOR AUDITORY DISCRIMINATION
EXPERIMENTS 137
7.3 RESULTS 142
7.3.1 SIMULTANEOUS MASKING 142
7.3.2 SPECTRAL MASKING 144
7.3.3 GERMAN MATRIX SENTENCE TEST 146
7.3.4 EFFECT OF BACK-END PARAMETER VARIATIONS 150
7.3.5 MAN-MACHINE GAP 152
7.3.6 EFFECT OF FEATURE VECTOR NORMALIZATION 153
7.4 DISCUSSION 153
7.4.1 INTERPRETATION OF SIMULATED THRESHOLDS 154
7.4.2 SIGNAL PROCESSING DEPENDENCE OF SIMULATED THRESHOLDS 155
7.4.3 REQUIRED ASSUMPTIONS FOR ADE SIMULATIONS 156
7.4.4 GENERALIZATION OF THE FADE APPROACH 158
7.4.5 ACROSS-FREQUENCY PROCESSING AND RELATION TO TEMPO
RAL PROCESSING 158
7.5 CONCLUSIONS 160
8 GENERAL CONCLUSIONS 161
BIBLIOGRAPHY 165
|
any_adam_object | 1 |
author | Schädler, Marc René |
author_GND | (DE-588)1106414667 |
author_facet | Schädler, Marc René |
author_role | aut |
author_sort | Schädler, Marc René |
author_variant | m r s mr mrs |
building | Verbundindex |
bvnumber | BV043671554 |
classification_rvk | ZN 6060 ZN 6070 |
ctrlnum | (OCoLC)953680066 (DE-599)GBV860033082 |
discipline | Elektrotechnik / Elektronik / Nachrichtentechnik |
format | Thesis Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01788nam a2200397 c 4500</leader><controlfield tag="001">BV043671554</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20170407 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">160714s2016 a||| m||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783814223339</subfield><subfield code="9">978-3-8142-2333-9</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)953680066</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBV860033082</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-19</subfield><subfield code="a">DE-83</subfield><subfield code="a">DE-12</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ZN 6060</subfield><subfield code="0">(DE-625)157500:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ZN 6070</subfield><subfield code="0">(DE-625)157501:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Schädler, Marc René</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1106414667</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features</subfield><subfield code="c">Marc René Schädler</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Oldenburg</subfield><subfield code="b">BIS-Verlag der Carl von Ossietzky Universität Oldenburg</subfield><subfield code="c">2016</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">ix, 176 Seiten</subfield><subfield code="b">Illustrationen, Diagramme</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="502" ind1=" " ind2=" "><subfield code="b">Dissertation</subfield><subfield code="c">Carl von Ossietzky Universität Oldenburg</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029084674&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-029084674</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV043671554 |
illustrated | Illustrated |
indexdate | 2024-07-10T07:32:06Z |
institution | BVB |
isbn | 9783814223339 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-029084674 |
oclc_num | 953680066 |
open_access_boolean | |
owner | DE-19 DE-BY-UBM DE-83 DE-12 |
owner_facet | DE-19 DE-BY-UBM DE-83 DE-12 |
physical | ix, 176 Seiten Illustrationen, Diagramme |
publishDate | 2016 |
publishDateSearch | 2016 |
publishDateSort | 2016 |
publisher | BIS-Verlag der Carl von Ossietzky Universität Oldenburg |
record_format | marc |
spelling | Schädler, Marc René Verfasser (DE-588)1106414667 aut Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features Marc René Schädler Oldenburg BIS-Verlag der Carl von Ossietzky Universität Oldenburg 2016 ix, 176 Seiten Illustrationen, Diagramme txt rdacontent n rdamedia nc rdacarrier Dissertation Carl von Ossietzky Universität Oldenburg Sprachverarbeitung (DE-588)4116579-2 gnd rswk-swf Sprachsignal (DE-588)4056494-0 gnd rswk-swf Automatische Spracherkennung (DE-588)4003961-4 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Automatische Spracherkennung (DE-588)4003961-4 s Sprachsignal (DE-588)4056494-0 s Sprachverarbeitung (DE-588)4116579-2 s DE-604 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029084674&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Schädler, Marc René Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features Sprachverarbeitung (DE-588)4116579-2 gnd Sprachsignal (DE-588)4056494-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
subject_GND | (DE-588)4116579-2 (DE-588)4056494-0 (DE-588)4003961-4 (DE-588)4113937-9 |
title | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features |
title_auth | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features |
title_exact_search | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features |
title_full | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features Marc René Schädler |
title_fullStr | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features Marc René Schädler |
title_full_unstemmed | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features Marc René Schädler |
title_short | Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features |
title_sort | robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro temporal features |
topic | Sprachverarbeitung (DE-588)4116579-2 gnd Sprachsignal (DE-588)4056494-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
topic_facet | Sprachverarbeitung Sprachsignal Automatische Spracherkennung Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029084674&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT schadlermarcrene robustautomaticspeechrecognitionandmodelingofauditorydiscriminationexperimentswithauditoryspectrotemporalfeatures |