Voice recognition by computer:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Marburg
Tectum-Verl.
2003
|
Ausgabe: | 1. Aufl. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Zugl.: Wiesbaden, Fachhochsch., Diss., 2003 |
Beschreibung: | 110 S. zahlr. graph. Darst. : 21 cm |
ISBN: | 382888492X |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV020854834 | ||
003 | DE-604 | ||
007 | t| | ||
008 | 051103s2003 gw d||| m||| 00||| eng d | ||
016 | 7 | |a 967216621 |2 DE-101 | |
020 | |a 382888492X |c kart. : EUR 25.90, sfr 51.00 |9 3-8288-8492-X | ||
028 | 5 | 2 | |a 8492 |
035 | |a (OCoLC)55062054 | ||
035 | |a (DE-599)BVBBV020854834 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a gw |c DE | ||
049 | |a DE-1046 | ||
084 | |a ST 306 |0 (DE-625)143654: |2 rvk | ||
100 | 1 | |a Sigmund, Milan |e Verfasser |4 aut | |
245 | 1 | 0 | |a Voice recognition by computer |c von Milan Sigmund |
250 | |a 1. Aufl. | ||
264 | 1 | |a Marburg |b Tectum-Verl. |c 2003 | |
300 | |a 110 S. |b zahlr. graph. Darst. : 21 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Zugl.: Wiesbaden, Fachhochsch., Diss., 2003 | ||
650 | 0 | 7 | |a Automatische Sprechererkennung |0 (DE-588)4143704-4 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Automatische Sprechererkennung |0 (DE-588)4143704-4 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=014176489&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-014176489 |
Datensatz im Suchindex
_version_ | 1826204857186385920 |
---|---|
adam_text |
CONTENTS
FOREWORD
1
INTRODUCTION
7
1.1
PROCESSING
OF
SPEECH
SIGNAL
.
7
1.2
SOME
HISTORY
AND
PRESENT
STATE
OF
SPEECH
PROCESSING
.
8
1.3
PUBLIC
SPEECH
DATABASES
.
10
1.4
TASK
ORIENTED
APPLICATIONS
OF
AUTOMATIC
VOICE
RECOGNITION
.
12
1.5
SPEECH
PROCESSING
AND
RECOGNITION
BY
HUMANS
.
14
1.6
PHONEMIC
NOTATION
OF
INDIVIDUAL
LANGUAGES
.
16
2
SIGNAL
PROCESSING
TECHNIQUES
IN
SPEECH
RECOGNITION
19
2.1
INTRODUCTION
.
19
2.2
PRE-PROCESSING
OF
THE
SPEECH
SIGNAL
.
20
2.2.1
DIGITALIZATION
.
20
2.2.2
PREEMPHASIS
.
21
2.2.3
FRAME
BLOCKING
.
21
2.2.4
WINDOWING
.
22
2.3
TIME
DOMAIN
METHODS
.
24
2.4
SPECTRAL
DOMAIN
METHODS
.
26
2.4.1
FREQUENCY-WARPED
FILTER
BANK
.
26
2.4.2
CEPSTRAL
ANALYSIS
.
28
2.5
LINEAR
PREDICTION
AND
DERIVED
METHODS
.
29
2.5.1
LINEAR
PREDICTIVE
ANALYSIS
.
29
2.5.2
RELATIONS
BETWEEN
VARIOUS
LP-DERIVED
SPEECH
PARAMETERS
.
31
2.6
FUNDAMENTAL
FREQUENCY
ESTIMATION
.
34
2.6.1
AMDF
ALGORITHM
.
35
2.6.2
CENTER-CLIPPING
ALGORITHM
.
36
2.7
PARAMETER
TRANSFORM
BY
DIFFERENTIATION
.
38
2.8
ENDPOINT
DETECTION
.
39
2.9
SPEECH
ENHANCEMENT
.
41
3
METHODS
OF
SPEAKER
RECOGNITION
43
3.1
SPEAKER
RECOGNITION
BY
HUMANS
.
43
3.2
VOICEPRINT
ANALYSIS
.
44
3.3
IDEAL
VOICE
RECOGNITION
.
46
3.4
PRINCIPLES
OF
SPEAKER
RECOGNITION
TECHNOLOGY
.
48
3.5
FEATURE
PARAMETERS
.
52
3.5.1
EVALUATION
OF
PARAMETERS
.
53
3.5.2
NORMALIZATION
TECHNIQUES
.
55
3.6
TEXT-DEPENDENT
SPEAKER
RECOGNITION
METHODS
.
55
3.6.1
DTW-BASED
METHODS
.
55
3.6.2
HMM-BASED
METHODS
.
58
3.7
TEXT-INDEPENDENT
SPEAKER
RECOGNITION
METHODS
.
58
3.7.1
LONG-TERM-STATISTICS-BASED
METHODS
.
58
3.7.2
VQ-BASED
METHODS
.
59
3.7.3
ERGODIC-HMM-BASED
METHODS
.
60
3.7.4
GAUSSIAN
MIXTURE
SPEAKER
MODELS
.
61
4
EXPERIMENTS
AND
RESULTS
63
4.1
SPEECH
LABELING
FOR
SPEAKER
RECOGNITION
.
63
4.1.1
INTRODUCTION
.
63
4.1.2
FONLABEL
PROGRAM
.
64
4.1.3
EFFECTIVENESS
OF
VARIOUS
PHONEMES
FOR
SPEAKER
RECOGNITION
.
66
4.1.4
CONCLUSIONS
.
68
4.2
ESTIMATION
OF
VOCAL
TRACT
LONG-TIME
SPECTRUM
.
69
4.2.1
INTRODUCTION
.
69
4.2.2
LONG-TIME
LPC
SPECTRUM
.
70
4.2.3
SPEECH
DATA
.
70
4.2.4
EXPERIMENTAL
RESULTS
.
70
4.2.5
SPEAKER
NORMALIZATION
BY
LONG-TIME
SPECTRUM
.
74
4.2.6
CONCLUSIONS
.
75
4.3
ANALYSIS
OFLMITATOR
'
S
VOICE
AND
SEX
IDENTIFICATION
.
76
4.3.1
INTRODUCTION
.
76
4.3.2
EFFECTIVENESS
OF
FEATURES
IN
TEXT-DEPENDENT
SPEAKER
RECOGNITION
.
76
4.3.3
UNCOVERING
AN
IMITATOR
.
78
4.3.4
AUTOMATIC
GENDER
DISTINCTION
BY
VOICE
.
84
4.3.5
CONCLUSIONS
.
88
4.4
EFFECTS
OF
EMOTIONAL
STRESS
AND
ALCOHOL
ON
SPEECH
.
89
4.4.1
INTRODUCTION
.
89
4.4.2
SPEECH
DATA
.
90
4.4.3
EFFECTS
OF
STRESS
ON
SPEECH
.
91
4.4.4
EFFECTS
OF
ALCOHOL
ON
SPEECH
.
94
4.4.5
CONCLUSIONS
.
96
5
VOICE
FOR
BIOMETRIC
APPLICATIONS
97
5.1
SECURITY
BIOMETRICAL
VERIFICATORS
.
97
5.2
FORENSIC
SPEAKER
RECOGNITION
.
98
5.2.1
LISTENER
METHOD
.
99
5.2.2
SPECTROGRAPHIC
METHOD
.
99
5.2.3
SEMI-AUTOMATIC
METHOD
.
100
6
SUMMARY
AND
FUTURE
OF
VOICE
RECOGNITION
101
7
REFERENCES
103
APPENDIX:
PROFESSIONAL
ASSOCIATIONS
(ISCA,
ELRA,
IAFP)
108
ABOUT
THE
AUTHOR
109 |
adam_txt | |
any_adam_object | 1 |
any_adam_object_boolean | |
author | Sigmund, Milan |
author_facet | Sigmund, Milan |
author_role | aut |
author_sort | Sigmund, Milan |
author_variant | m s ms |
building | Verbundindex |
bvnumber | BV020854834 |
classification_rvk | ST 306 |
ctrlnum | (OCoLC)55062054 (DE-599)BVBBV020854834 |
discipline | Informatik |
discipline_str_mv | Informatik |
edition | 1. Aufl. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV020854834</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">051103s2003 gw d||| m||| 00||| eng d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">967216621</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">382888492X</subfield><subfield code="c">kart. : EUR 25.90, sfr 51.00</subfield><subfield code="9">3-8288-8492-X</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">8492</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)55062054</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV020854834</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-1046</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 306</subfield><subfield code="0">(DE-625)143654:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Sigmund, Milan</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Voice recognition by computer</subfield><subfield code="c">von Milan Sigmund</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. Aufl.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Marburg</subfield><subfield code="b">Tectum-Verl.</subfield><subfield code="c">2003</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">110 S.</subfield><subfield code="b">zahlr. graph. Darst. : 21 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Zugl.: Wiesbaden, Fachhochsch., Diss., 2003</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Sprechererkennung</subfield><subfield code="0">(DE-588)4143704-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Automatische Sprechererkennung</subfield><subfield code="0">(DE-588)4143704-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=014176489&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-014176489</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV020854834 |
illustrated | Illustrated |
index_date | 2024-07-02T13:20:57Z |
indexdate | 2025-03-10T11:04:28Z |
institution | BVB |
isbn | 382888492X |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-014176489 |
oclc_num | 55062054 |
open_access_boolean | |
owner | DE-1046 |
owner_facet | DE-1046 |
physical | 110 S. zahlr. graph. Darst. : 21 cm |
publishDate | 2003 |
publishDateSearch | 2003 |
publishDateSort | 2003 |
publisher | Tectum-Verl. |
record_format | marc |
spelling | Sigmund, Milan Verfasser aut Voice recognition by computer von Milan Sigmund 1. Aufl. Marburg Tectum-Verl. 2003 110 S. zahlr. graph. Darst. : 21 cm txt rdacontent n rdamedia nc rdacarrier Zugl.: Wiesbaden, Fachhochsch., Diss., 2003 Automatische Sprechererkennung (DE-588)4143704-4 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Automatische Sprechererkennung (DE-588)4143704-4 s DE-604 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=014176489&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Sigmund, Milan Voice recognition by computer Automatische Sprechererkennung (DE-588)4143704-4 gnd |
subject_GND | (DE-588)4143704-4 (DE-588)4113937-9 |
title | Voice recognition by computer |
title_auth | Voice recognition by computer |
title_exact_search | Voice recognition by computer |
title_exact_search_txtP | Voice recognition by computer |
title_full | Voice recognition by computer von Milan Sigmund |
title_fullStr | Voice recognition by computer von Milan Sigmund |
title_full_unstemmed | Voice recognition by computer von Milan Sigmund |
title_short | Voice recognition by computer |
title_sort | voice recognition by computer |
topic | Automatische Sprechererkennung (DE-588)4143704-4 gnd |
topic_facet | Automatische Sprechererkennung Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=014176489&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT sigmundmilan voicerecognitionbycomputer |