Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Abschlussarbeit Buch |
Sprache: | English |
Veröffentlicht: |
Düren
Shaker Verlag
2022
|
Ausgabe: | 1. Auflage |
Schriftenreihe: | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig
70 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | ix, 139 Seiten Illustrationen, Diagramme 21 cm x 14.8 cm, 228 g |
ISBN: | 9783844087796 3844087796 |
Internformat
MARC
LEADER | 00000nam a22000008cb4500 | ||
---|---|---|---|
001 | BV048934208 | ||
003 | DE-604 | ||
005 | 20240117 | ||
007 | t | ||
008 | 230509s2022 gw a||| m||| 00||| eng d | ||
015 | |a 22,N37 |2 dnb | ||
016 | 7 | |a 1267500433 |2 DE-101 | |
020 | |a 9783844087796 |c : EUR 48.80 (DE), EUR 48.80 (AT) |9 978-3-8440-8779-6 | ||
020 | |a 3844087796 |9 3-8440-8779-6 | ||
024 | 3 | |a 9783844087796 | |
035 | |a (OCoLC)1381344258 | ||
035 | |a (DE-599)DNB1267500433 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
044 | |a gw |c XA-DE-NW | ||
049 | |a DE-83 | ||
084 | |a ZN 6070 |0 (DE-625)157501: |2 rvk | ||
084 | |8 1\p |a 621.3 |2 23sdnb | ||
100 | 1 | |a Zhao, Ziyue |e Verfasser |0 (DE-588)1282277634 |4 aut | |
245 | 1 | 0 | |a Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |c Ziyue Zhao |
250 | |a 1. Auflage | ||
264 | 1 | |a Düren |b Shaker Verlag |c 2022 | |
300 | |a ix, 139 Seiten |b Illustrationen, Diagramme |c 21 cm x 14.8 cm, 228 g | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig |v 70 | |
502 | |b Dissertation |c Technische Universität Carolo-Wilhelmina zu Braunschweig |d 2022 | ||
650 | 0 | 7 | |a Sprachcodierung |0 (DE-588)4182502-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Neuronales Netz |0 (DE-588)4226127-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachqualität |0 (DE-588)4536190-3 |2 gnd |9 rswk-swf |
653 | |a Speech Enhancement | ||
653 | |a Speech Coding | ||
653 | |a Neural Networks | ||
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |D s |
689 | 0 | 1 | |a Sprachcodierung |0 (DE-588)4182502-0 |D s |
689 | 0 | 2 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 0 | 3 | |a Neuronales Netz |0 (DE-588)4226127-2 |D s |
689 | 0 | 4 | |a Sprachqualität |0 (DE-588)4536190-3 |D s |
689 | 0 | |5 DE-604 | |
710 | 2 | |a Shaker Verlag |0 (DE-588)1064118135 |4 pbl | |
830 | 0 | |a Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig |v 70 |w (DE-604)BV035238624 |9 70 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=034198094&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-034198094 | ||
883 | 1 | |8 1\p |a vlb |d 20220907 |q DE-101 |u https://d-nb.info/provenance/plan#vlb |
Datensatz im Suchindex
_version_ | 1804185128843345920 |
---|---|
adam_text | CONTENTS
1
INTRODUCTION
1
1.1
NEURAL
NETWORK-BASED
SPEECH
PROCESSING
.........................................................
1
1.2
IMPROVED
SPEECH
DECODING
.................................................................................
5
1.3
OUTLINE
OF
THE
THESIS
...........................................................................................
6
2
NONLINEAR
PREDICTION
OF
SPEECH
9
2.1
INTRODUCTION
...........................................................................................................
9
2.1.1
FRAME
AND
SAMPLE-BASED
LINEAR
PREDICTION
.......................................
9
2.1.2
STATE-OF-THE-ART
NONLINEAR
PREDICTION
APPROACHES
..............................
10
2.2
BASELINE
SPEECH
PREDICTION
METHODS
..................................................................
12
2.2.1
LINEAR
PREDICTION
OF
SPEECH
....................................................................
12
2.2.2
NONLINEAR
PREDICTION
OF
SPEECH
BY
NEURAL
NETWORKS
..........................
15
2.3
NEW
SPEECH
PREDICTION
BY
ESNS
........................................................................
20
2.3.1
ESN
TOPOLOGY
.............................................................................................
20
2.3.2
ESN
WEIGHT
ADAPTATION
..........................................................................
21
2.4
SIMULATION
SETUP
.................................................................................................
22
2.4.1
DATABASE
...................................................................................................
22
2.4.2
PREDICTION
FRAMEWORKS
AND
EVALUATION
METRICS
.................................
22
2.4.3
TRAINING
SETTINGS
OF
NEURAL
NETWORKS
....................................................
23
2.5
SIMULATION
RESULTS
..............................................................................................
24
2.5.1
PRELIMINARY
EXPERIMENTS
ON
PREDICTOR
PARAMETERS
.............................
24
2.5.2
MAJOR
EXPERIMENTS
ON
PREDICTION
PERFORMANCE
....................................
26
2.6
SUMMARY
.............................................................................................................
28
3
PERCEPTUAL
LOSS
FUNCTIONS
FOR
NEURAL
NETWORK-BASED
SPEECH
ENHANCE
MENT
29
3.1
INTRODUCTION
..........................................................................................................
30
3.1.1
PERCEPTUAL
PROCESSING
IN
SPEECH
CODING
............................................
30
3.1.2
STATE-OF-THE-ART
LOSS
FUNCTIONS
IN
SPEECH
ENHANCEMENT
...................
30
3.2
BASELINE
LOSS
FUNCTIONS
....................................................................................
33
3.2.1
MSE
LOSS
FUNCTIONS
...............................................................................
34
3.2.2
A
PESQ-BASED
PERCEPTUAL
LOSS
FUNCTION
.............................................
35
3.3
REVISITING
THE
PWF
IN
CELP
SPEECH
CODING
...................................................
35
3.4
NEW
PWF
LOSS
FUNCTIONS
...................................................................................
37
3.4.1
PWF
LOSS
FUNCTION
WITH
SPECTRAL
AMPLITUDES
................................
37
3.4.2
COMPLEX
PWF
LOSS
FUNCTION
................................................................
39
3.5
SIMULATION
SETUP
...................................................................................................
40
3.5.1
DATABASES
.....................................................................................................
40
3.5.2
FRAMEWORK
STRUCTURES
..............................................................................
40
3.5.3
NEURAL
NETWORK
TOPOLOGIES
AND
TRAINING
SETTINGS
.............................
41
3.5.4
EVALUATION
METRICS
....................................................................................
44
3.6
SIMULATION
RESULTS
................................................................................................
46
3.6.1
PRELIMINARY
EXPERIMENTS
ON
LOSS
FUNCTIONS
........................................
46
3.6.2
MAJOR
EXPERIMENTS
ON
LOSS
FUNCTIONS
WITH
FCNNS
............................
48
3.6.3
MAJOR
EXPERIMENTS
ON
LOSS
FUNCTIONS
WITH
CNNS
............................
52
3.7
SUMMARY
...............................................................................................................
53
4
NEURAL
NETWORK-BASED
POSTPROCESSORS
FOR
CODED
SPEECH
55
4.1
INTRODUCTION
............................................................................................................
55
4.1.1
SOME
IMPORTANT
SPEECH
CODECS
..............................................................
55
4.1.2
STATE-OF-THE-ART
POSTPROCESSORS
.................................................................
57
4.2
BASELINE
G.711
POSTFILTER
......................................................................................
59
4.3
NEW
CNN-BASED
POSTPROCESSORS
.........................................................................
62
4.3.1
PROCESSING
FRAMEWORK
..............................................................................
62
4.3.2
CNN
TOPOLOGY
...........................................................................................
65
4.4
NEW
FCRN-BASED
POSTPROCESSORS
......................................................................
67
4.4.1
PROCESSING
FRAMEWORK
.............................................................................
67
4.4.2
FCRN
TOPOLOGY
........................................................................................
69
4.4.3
FCRN
WITH
COMPLEX
PWF
LOSS
FUNCTION
..........................................
70
4.5
SIMULATION
SETUP
...................................................................................................
71
4.5.1
DATABASES
....................................................................................................
71
4.5.2
PROCESSING
PLANS
........................................................................................
72
4.5.3
TRAINING
SETTINGS
.......................................................................................
75
4.5.4
EVALUATION
METRICS
....................................................................................
76
4.6
SIMULATION
RESULTS
...............................................................................................
77
4.6.1
EXPERIMENTS
ON
THE
NEW
CNN-BASED
POSTPROCESSORS
........................
77
4.6.2
EXPERIMENTS
ON
THE
NEW
FCRN-BASED
POSTPROCESSOR
........................
91
4.7
SUMMARY
..................................................................................................................
104
5
CONCLUSIONS
AND
OUTLOOK
105
5.1
CONCLUSIONS
...............................................................................................................
105
5.2
FUTURE
CHALLENGES
.....................................................................................................
106
LIST
OF
SYMBOLS
109
LIST
OF
ABBREVIATIONS
113
BIBLIOGRAPHY
117
OWN
PUBLICATIONS
139
|
adam_txt |
CONTENTS
1
INTRODUCTION
1
1.1
NEURAL
NETWORK-BASED
SPEECH
PROCESSING
.
1
1.2
IMPROVED
SPEECH
DECODING
.
5
1.3
OUTLINE
OF
THE
THESIS
.
6
2
NONLINEAR
PREDICTION
OF
SPEECH
9
2.1
INTRODUCTION
.
9
2.1.1
FRAME
AND
SAMPLE-BASED
LINEAR
PREDICTION
.
9
2.1.2
STATE-OF-THE-ART
NONLINEAR
PREDICTION
APPROACHES
.
10
2.2
BASELINE
SPEECH
PREDICTION
METHODS
.
12
2.2.1
LINEAR
PREDICTION
OF
SPEECH
.
12
2.2.2
NONLINEAR
PREDICTION
OF
SPEECH
BY
NEURAL
NETWORKS
.
15
2.3
NEW
SPEECH
PREDICTION
BY
ESNS
.
20
2.3.1
ESN
TOPOLOGY
.
20
2.3.2
ESN
WEIGHT
ADAPTATION
.
21
2.4
SIMULATION
SETUP
.
22
2.4.1
DATABASE
.
22
2.4.2
PREDICTION
FRAMEWORKS
AND
EVALUATION
METRICS
.
22
2.4.3
TRAINING
SETTINGS
OF
NEURAL
NETWORKS
.
23
2.5
SIMULATION
RESULTS
.
24
2.5.1
PRELIMINARY
EXPERIMENTS
ON
PREDICTOR
PARAMETERS
.
24
2.5.2
MAJOR
EXPERIMENTS
ON
PREDICTION
PERFORMANCE
.
26
2.6
SUMMARY
.
28
3
PERCEPTUAL
LOSS
FUNCTIONS
FOR
NEURAL
NETWORK-BASED
SPEECH
ENHANCE
MENT
29
3.1
INTRODUCTION
.
30
3.1.1
PERCEPTUAL
PROCESSING
IN
SPEECH
CODING
.
30
3.1.2
STATE-OF-THE-ART
LOSS
FUNCTIONS
IN
SPEECH
ENHANCEMENT
.
30
3.2
BASELINE
LOSS
FUNCTIONS
.
33
3.2.1
MSE
LOSS
FUNCTIONS
.
34
3.2.2
A
PESQ-BASED
PERCEPTUAL
LOSS
FUNCTION
.
35
3.3
REVISITING
THE
PWF
IN
CELP
SPEECH
CODING
.
35
3.4
NEW
PWF
LOSS
FUNCTIONS
.
37
3.4.1
PWF
LOSS
FUNCTION
WITH
SPECTRAL
AMPLITUDES
.
37
3.4.2
COMPLEX
PWF
LOSS
FUNCTION
.
39
3.5
SIMULATION
SETUP
.
40
3.5.1
DATABASES
.
40
3.5.2
FRAMEWORK
STRUCTURES
.
40
3.5.3
NEURAL
NETWORK
TOPOLOGIES
AND
TRAINING
SETTINGS
.
41
3.5.4
EVALUATION
METRICS
.
44
3.6
SIMULATION
RESULTS
.
46
3.6.1
PRELIMINARY
EXPERIMENTS
ON
LOSS
FUNCTIONS
.
46
3.6.2
MAJOR
EXPERIMENTS
ON
LOSS
FUNCTIONS
WITH
FCNNS
.
48
3.6.3
MAJOR
EXPERIMENTS
ON
LOSS
FUNCTIONS
WITH
CNNS
.
52
3.7
SUMMARY
.
53
4
NEURAL
NETWORK-BASED
POSTPROCESSORS
FOR
CODED
SPEECH
55
4.1
INTRODUCTION
.
55
4.1.1
SOME
IMPORTANT
SPEECH
CODECS
.
55
4.1.2
STATE-OF-THE-ART
POSTPROCESSORS
.
57
4.2
BASELINE
G.711
POSTFILTER
.
59
4.3
NEW
CNN-BASED
POSTPROCESSORS
.
62
4.3.1
PROCESSING
FRAMEWORK
.
62
4.3.2
CNN
TOPOLOGY
.
65
4.4
NEW
FCRN-BASED
POSTPROCESSORS
.
67
4.4.1
PROCESSING
FRAMEWORK
.
67
4.4.2
FCRN
TOPOLOGY
.
69
4.4.3
FCRN
WITH
COMPLEX
PWF
LOSS
FUNCTION
.
70
4.5
SIMULATION
SETUP
.
71
4.5.1
DATABASES
.
71
4.5.2
PROCESSING
PLANS
.
72
4.5.3
TRAINING
SETTINGS
.
75
4.5.4
EVALUATION
METRICS
.
76
4.6
SIMULATION
RESULTS
.
77
4.6.1
EXPERIMENTS
ON
THE
NEW
CNN-BASED
POSTPROCESSORS
.
77
4.6.2
EXPERIMENTS
ON
THE
NEW
FCRN-BASED
POSTPROCESSOR
.
91
4.7
SUMMARY
.
104
5
CONCLUSIONS
AND
OUTLOOK
105
5.1
CONCLUSIONS
.
105
5.2
FUTURE
CHALLENGES
.
106
LIST
OF
SYMBOLS
109
LIST
OF
ABBREVIATIONS
113
BIBLIOGRAPHY
117
OWN
PUBLICATIONS
139 |
any_adam_object | 1 |
any_adam_object_boolean | 1 |
author | Zhao, Ziyue |
author_GND | (DE-588)1282277634 |
author_facet | Zhao, Ziyue |
author_role | aut |
author_sort | Zhao, Ziyue |
author_variant | z z zz |
building | Verbundindex |
bvnumber | BV048934208 |
classification_rvk | ZN 6070 |
ctrlnum | (OCoLC)1381344258 (DE-599)DNB1267500433 |
discipline | Elektrotechnik / Elektronik / Nachrichtentechnik |
discipline_str_mv | Elektrotechnik / Elektronik / Nachrichtentechnik |
edition | 1. Auflage |
format | Thesis Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02719nam a22006018cb4500</leader><controlfield tag="001">BV048934208</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20240117 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">230509s2022 gw a||| m||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">22,N37</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1267500433</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783844087796</subfield><subfield code="c">: EUR 48.80 (DE), EUR 48.80 (AT)</subfield><subfield code="9">978-3-8440-8779-6</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">3844087796</subfield><subfield code="9">3-8440-8779-6</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783844087796</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1381344258</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1267500433</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">XA-DE-NW</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-83</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ZN 6070</subfield><subfield code="0">(DE-625)157501:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="8">1\p</subfield><subfield code="a">621.3</subfield><subfield code="2">23sdnb</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Zhao, Ziyue</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1282277634</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions</subfield><subfield code="c">Ziyue Zhao</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. Auflage</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Düren</subfield><subfield code="b">Shaker Verlag</subfield><subfield code="c">2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">ix, 139 Seiten</subfield><subfield code="b">Illustrationen, Diagramme</subfield><subfield code="c">21 cm x 14.8 cm, 228 g</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig</subfield><subfield code="v">70</subfield></datafield><datafield tag="502" ind1=" " ind2=" "><subfield code="b">Dissertation</subfield><subfield code="c">Technische Universität Carolo-Wilhelmina zu Braunschweig</subfield><subfield code="d">2022</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachcodierung</subfield><subfield code="0">(DE-588)4182502-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Neuronales Netz</subfield><subfield code="0">(DE-588)4226127-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachqualität</subfield><subfield code="0">(DE-588)4536190-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Speech Enhancement</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Speech Coding</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Neural Networks</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Sprachcodierung</subfield><subfield code="0">(DE-588)4182502-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Neuronales Netz</subfield><subfield code="0">(DE-588)4226127-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="4"><subfield code="a">Sprachqualität</subfield><subfield code="0">(DE-588)4536190-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="710" ind1="2" ind2=" "><subfield code="a">Shaker Verlag</subfield><subfield code="0">(DE-588)1064118135</subfield><subfield code="4">pbl</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig</subfield><subfield code="v">70</subfield><subfield code="w">(DE-604)BV035238624</subfield><subfield code="9">70</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=034198094&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-034198094</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">vlb</subfield><subfield code="d">20220907</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#vlb</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV048934208 |
illustrated | Illustrated |
index_date | 2024-07-03T21:57:55Z |
indexdate | 2024-07-10T09:50:19Z |
institution | BVB |
institution_GND | (DE-588)1064118135 |
isbn | 9783844087796 3844087796 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-034198094 |
oclc_num | 1381344258 |
open_access_boolean | |
owner | DE-83 |
owner_facet | DE-83 |
physical | ix, 139 Seiten Illustrationen, Diagramme 21 cm x 14.8 cm, 228 g |
publishDate | 2022 |
publishDateSearch | 2022 |
publishDateSort | 2022 |
publisher | Shaker Verlag |
record_format | marc |
series | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig |
series2 | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig |
spelling | Zhao, Ziyue Verfasser (DE-588)1282277634 aut Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions Ziyue Zhao 1. Auflage Düren Shaker Verlag 2022 ix, 139 Seiten Illustrationen, Diagramme 21 cm x 14.8 cm, 228 g txt rdacontent n rdamedia nc rdacarrier Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig 70 Dissertation Technische Universität Carolo-Wilhelmina zu Braunschweig 2022 Sprachcodierung (DE-588)4182502-0 gnd rswk-swf Automatische Spracherkennung (DE-588)4003961-4 gnd rswk-swf Neuronales Netz (DE-588)4226127-2 gnd rswk-swf Sprachverarbeitung (DE-588)4116579-2 gnd rswk-swf Sprachqualität (DE-588)4536190-3 gnd rswk-swf Speech Enhancement Speech Coding Neural Networks (DE-588)4113937-9 Hochschulschrift gnd-content Sprachverarbeitung (DE-588)4116579-2 s Sprachcodierung (DE-588)4182502-0 s Automatische Spracherkennung (DE-588)4003961-4 s Neuronales Netz (DE-588)4226127-2 s Sprachqualität (DE-588)4536190-3 s DE-604 Shaker Verlag (DE-588)1064118135 pbl Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig 70 (DE-604)BV035238624 70 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=034198094&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p vlb 20220907 DE-101 https://d-nb.info/provenance/plan#vlb |
spellingShingle | Zhao, Ziyue Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig Sprachcodierung (DE-588)4182502-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd Neuronales Netz (DE-588)4226127-2 gnd Sprachverarbeitung (DE-588)4116579-2 gnd Sprachqualität (DE-588)4536190-3 gnd |
subject_GND | (DE-588)4182502-0 (DE-588)4003961-4 (DE-588)4226127-2 (DE-588)4116579-2 (DE-588)4536190-3 (DE-588)4113937-9 |
title | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |
title_auth | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |
title_exact_search | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |
title_exact_search_txtP | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |
title_full | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions Ziyue Zhao |
title_fullStr | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions Ziyue Zhao |
title_full_unstemmed | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions Ziyue Zhao |
title_short | Contributions to Neural Network-Based Speech Processing: Nonlinear Speech Prediction, Decoder Postprocessing, and Perceptual Loss Functions |
title_sort | contributions to neural network based speech processing nonlinear speech prediction decoder postprocessing and perceptual loss functions |
topic | Sprachcodierung (DE-588)4182502-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd Neuronales Netz (DE-588)4226127-2 gnd Sprachverarbeitung (DE-588)4116579-2 gnd Sprachqualität (DE-588)4536190-3 gnd |
topic_facet | Sprachcodierung Automatische Spracherkennung Neuronales Netz Sprachverarbeitung Sprachqualität Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=034198094&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV035238624 |
work_keys_str_mv | AT zhaoziyue contributionstoneuralnetworkbasedspeechprocessingnonlinearspeechpredictiondecoderpostprocessingandperceptuallossfunctions AT shakerverlag contributionstoneuralnetworkbasedspeechprocessingnonlinearspeechpredictiondecoderpostprocessingandperceptuallossfunctions |