Automatic speaker identification by artificial neural networks:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | German |
Veröffentlicht: |
Aachen
Shaker
1997
|
Ausgabe: | Als Ms. gedr. |
Schriftenreihe: | Berichte aus der Informatik
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Zugl.: Ulm, Univ., Diss., 1997 |
Beschreibung: | VIII, 193 S. graph. Darst. |
ISBN: | 382653073X |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV011604679 | ||
003 | DE-604 | ||
005 | 19980407 | ||
007 | t | ||
008 | 971027s1997 gw d||| m||| 00||| ger d | ||
016 | 7 | |a 951728385 |2 DE-101 | |
020 | |a 382653073X |c kart. : DM 98.00, sfr 99.00, S 689.00 |9 3-8265-3073-X | ||
035 | |a (OCoLC)75808114 | ||
035 | |a (DE-599)BVBBV011604679 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a ger | |
044 | |a gw |c DE | ||
049 | |a DE-29T | ||
100 | 1 | |a He, Jialong |e Verfasser |0 (DE-588)115771654 |4 aut | |
245 | 1 | 0 | |a Automatic speaker identification by artificial neural networks |c Jialong He |
250 | |a Als Ms. gedr. | ||
264 | 1 | |a Aachen |b Shaker |c 1997 | |
300 | |a VIII, 193 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Berichte aus der Informatik | |
500 | |a Zugl.: Ulm, Univ., Diss., 1997 | ||
650 | 0 | 7 | |a Neuronales Netz |0 (DE-588)4226127-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Sprechererkennung |0 (DE-588)4143704-4 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Automatische Sprechererkennung |0 (DE-588)4143704-4 |D s |
689 | 0 | 1 | |a Neuronales Netz |0 (DE-588)4226127-2 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007817822&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-007817822 |
Datensatz im Suchindex
_version_ | 1807323525857411072 |
---|---|
adam_text |
TABLE
OF
CONTENTS
1
INTRODUCTION
.
1
1.1
BASIC
CONCEPTS
.
1
1.2
HISTORY
REVIEW
.
6
1.3
TYPICAL
SPEAKER
RECOGNITION
METHODS
.
9
1.3.1
VQ
BASED
METHOD
.
9
1.3.2
HMM
BASED
METHOD
.
13
1.3.3
NEURAL
NETWORK
METHOD
.
15
1.3.4
PARAMETRIC
MODEL
AND
OTHERS
.
17
2
DIGITAL
MODEL
OF
SPEECH
SIGNALS
.
25
2.1
ACOUSTIC
THEORY
.25
2.2
THE
LOSSLESS
TUBE
MODEL
.
28
2.3
DIGITAL
MODEL
.
31
2.4
LINEAR
PREDICTIVE
ANALYSIS
.33
2.5
HOMOMORPHIC
PROCESSING
.37
2.6
PITCH
DETECTION
.
42
2.7
SPEECH
DATABASE
.
49
2.7.1
THE
TIMIT
DATABASE
.
49
2.7.2
THE
YOHO
DATABASE
.
50
3
ARTIFICIAL
NEURAL
NETWORKS
.
52
3.1
INTRODUCTION
.
53
3.2
LEARNING
VECTOR
QUANTIZATION
NETWORK
.
54
3.2.1
THE
LBG
ALGORITHM
.
54
3.2.2
THE
LVQ
ALGORITHM
.
58
3.3
MULTI-LAYER
PERCEPTRON
NETWORK
.
61
3.3.1
BACK-PROPAGATION
ALGORITHM
.
62
3.3.2
GENERALIZED
OPTIMIZATION
STRATEGY
.
65
3.3.3
STEEPEST
GRADIENT
DESCENT
.
66
3.3.4
CONJUGATE
GRADIENT
WITH
LINE
SEARCH
.
66
3.3.5
SCALED
CONJUGATE
GRADIENT
.
69
3.3.6
QUICK
PROPAGATION
.
70
IV
TABLE
OF
CONTENTS
3.3.7
CLASSIFICATION
BORDERS
FORMED
BY
A
MLP
NETWORK
.
71
3.4
RADIAL
BASIS
FUNCTION
NETWORK
.
72
3.4.1
INTRODUCTION
.
72
3.4.2
RADIAL
BASIS
FUNCTIONS
.
73
3.4.3
DETERMINING
THE
WEIGHT
MATRIX
.
74
3.5
GAUSSIAN
MIXTURE
MODEL
.
76
3.5.1
MULTIVARIATE
GAUSSIAN
FUNCTION
.
76
3.5.2
GAUSSIAN
MIXTURE
MODEL
.
78
3.5.3
ESTIMATION
OF
PARAMETERS
IN
THE
GMM
.
78
3.5.4
MAXIMUM
LIKELIHOOD
CLASSIFIER
.
82
3.5.5
ANALYSIS
FOR
A
TWO-CLASS
PROBLEM
.
83
4
DIMENSIONALITY
REDUCTION
.
88
4.1
INTRODUCTION
.
88
4.2
OPTIMIZATION
CRITERIA
.
90
4.3
FEATURE
EXTRACTION
BY
LINEAR
MAPPING
.
93
4.4
FEATURE
SELECTION
ALGORITHM
.
96
4.4.1
FEATURE
SELECTION
BASED
ON
THE
INDIVIDUAL
BEST
FEATURE
.
96
4.4.2
THE
BRANCH
AND
BOUND
ALGORITHM
.
97
4.4.3
SEQUENTIAL
FORWARD
SEARCH
.
97
4.4.4
SEQUENTIAL
BACKWARD
SEARCH
.
98
5
RESIDUAL
CEPSTRUM
.
100
5.1
INTRODUCTION
.
100
5.2
CALCULATION
OF
THE
RESIDUAL
CEPSTRUM
.
101
5.3
EVALUATION
RESULTS
.
105
6
SELECT
EFFECTIVE
FEATURES
.
109
6.1
INTRODUCTION
.
109
6.2
FEATURE
SELECTION
CRITERIA
.
110
6.3
EXPERIMENT
RESULTS
.
112
7
VQ-BASED
SPEAKER
IDENTIFICATION
.
122
7.1
INTRODUCTION
.
123
7.2
SENTENCE
LEVEL
DECISION
RULES
.
124
TABLE
OF
CONTENTS
V
7.3
PERFORMANCE
OF
LBG
AND
LVQ
CODEBOOKS
.
125
7.4
GROUP
VECTOR
QUANTIZATION
ALGORITHM
.
130
7.5
ANALYSIS
FOR
A
TWO-CLASS
PROBLEM
.
133
7.5.1
DISTRIBUTION
OF
THE
DISTANCE
DIFFERENCE
.
134
7.5.2
SPEECH
VECTOR
SEQUENCE
.
137
7.5.3
RELATION
TO
THE
MAP
AND
MMI
CRITERIA
.
139
7.6
PERFORMANCE
OF
GVQ
CODEBOOK
.
140
8
SPEAKER
IDENTIFICATION
BY
MLP
AND
LVQ-SLP
NETWORKSL46
8.1
INTRODUCTION
.
146
8.2
SPEAKER
IDENTIFICATION
BY
MLP
NETWORKS
.
147
8.3
LVQ-SLP
HYBRID
NETWORK
ARCHITECTURE
.
150
8.4
TRAINING
PROCEDURE
.
152
8.5
EVALUATION
RESULTS
.
154
9
LEARNING
GAUSSIAN
MIXTURE
MODEL
.
158
9.1
INTRODUCTION
.
158
9.2
A
DISCRIMINATIVE
TRAINING
ALGORITHM
.
159
9.3
ANALYSIS
FOR
A
TWO-CLASS
PROBLEM
.
163
9.3.1
LIKELIHOOD
RATIO
.
163
9.3.2
RELATION
TO
THE
MAP
AND
MMI
CRITERIA
.
167
9.3.3
CLASSIFICATION
FOR
VECTOR
SEQUENCE
.
167
9.4
EXPERIMENTAL
RESULTS
.
169
9.4.1
RESULTS
WITH
THE
TIMIT
DATABASE
.
171
9.4.2
RESULTS
WITH
THE
YOHO
DATABASE
.
173
10
SUMMARY
AND
CONCLUSION
.
177
10.1
SUMMARY
OF
EXPERIMENTAL
RESULTS
.
177
10.2
CONTRIBUTIONS
OF
THE
THESIS
.
182
10.3
FUTURE
RESEARCH
DIRECTIONS
.
184
11
BIBLIOGRAPHY
.
186 |
any_adam_object | 1 |
author | He, Jialong |
author_GND | (DE-588)115771654 |
author_facet | He, Jialong |
author_role | aut |
author_sort | He, Jialong |
author_variant | j h jh |
building | Verbundindex |
bvnumber | BV011604679 |
ctrlnum | (OCoLC)75808114 (DE-599)BVBBV011604679 |
edition | Als Ms. gedr. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV011604679</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">19980407</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">971027s1997 gw d||| m||| 00||| ger d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">951728385</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">382653073X</subfield><subfield code="c">kart. : DM 98.00, sfr 99.00, S 689.00</subfield><subfield code="9">3-8265-3073-X</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)75808114</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV011604679</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">ger</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">He, Jialong</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)115771654</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Automatic speaker identification by artificial neural networks</subfield><subfield code="c">Jialong He</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">Als Ms. gedr.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Aachen</subfield><subfield code="b">Shaker</subfield><subfield code="c">1997</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">VIII, 193 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Berichte aus der Informatik</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Zugl.: Ulm, Univ., Diss., 1997</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Neuronales Netz</subfield><subfield code="0">(DE-588)4226127-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Sprechererkennung</subfield><subfield code="0">(DE-588)4143704-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Automatische Sprechererkennung</subfield><subfield code="0">(DE-588)4143704-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Neuronales Netz</subfield><subfield code="0">(DE-588)4226127-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007817822&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-007817822</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV011604679 |
illustrated | Illustrated |
indexdate | 2024-08-14T01:13:46Z |
institution | BVB |
isbn | 382653073X |
language | German |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-007817822 |
oclc_num | 75808114 |
open_access_boolean | |
owner | DE-29T |
owner_facet | DE-29T |
physical | VIII, 193 S. graph. Darst. |
publishDate | 1997 |
publishDateSearch | 1997 |
publishDateSort | 1997 |
publisher | Shaker |
record_format | marc |
series2 | Berichte aus der Informatik |
spelling | He, Jialong Verfasser (DE-588)115771654 aut Automatic speaker identification by artificial neural networks Jialong He Als Ms. gedr. Aachen Shaker 1997 VIII, 193 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Berichte aus der Informatik Zugl.: Ulm, Univ., Diss., 1997 Neuronales Netz (DE-588)4226127-2 gnd rswk-swf Automatische Sprechererkennung (DE-588)4143704-4 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Automatische Sprechererkennung (DE-588)4143704-4 s Neuronales Netz (DE-588)4226127-2 s DE-604 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007817822&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | He, Jialong Automatic speaker identification by artificial neural networks Neuronales Netz (DE-588)4226127-2 gnd Automatische Sprechererkennung (DE-588)4143704-4 gnd |
subject_GND | (DE-588)4226127-2 (DE-588)4143704-4 (DE-588)4113937-9 |
title | Automatic speaker identification by artificial neural networks |
title_auth | Automatic speaker identification by artificial neural networks |
title_exact_search | Automatic speaker identification by artificial neural networks |
title_full | Automatic speaker identification by artificial neural networks Jialong He |
title_fullStr | Automatic speaker identification by artificial neural networks Jialong He |
title_full_unstemmed | Automatic speaker identification by artificial neural networks Jialong He |
title_short | Automatic speaker identification by artificial neural networks |
title_sort | automatic speaker identification by artificial neural networks |
topic | Neuronales Netz (DE-588)4226127-2 gnd Automatische Sprechererkennung (DE-588)4143704-4 gnd |
topic_facet | Neuronales Netz Automatische Sprechererkennung Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007817822&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT hejialong automaticspeakeridentificationbyartificialneuralnetworks |