Theory and applications of digital speech processing: international version
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Boston [u.a.]
Pearson
2011
|
Ausgabe: | 1. internat. ed. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | 1056 S. Ill., graph. Darst. |
ISBN: | 9780137050857 |
Internformat
MARC
LEADER | 00000nam a2200000zc 4500 | ||
---|---|---|---|
001 | BV036551911 | ||
003 | DE-604 | ||
005 | 20120326 | ||
007 | t | ||
008 | 100707s2011 xxuad|| |||| 00||| eng d | ||
020 | |a 9780137050857 |c alk. paper |9 978-0-13-705085-7 | ||
035 | |a (OCoLC)705649504 | ||
035 | |a (DE-599)BVBBV036551911 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
044 | |a xxu |c US | ||
049 | |a DE-859 |a DE-862 |a DE-1043 |a DE-355 | ||
050 | 0 | |a TK7882.S65 | |
082 | 0 | |a 006.4/54 | |
084 | |a ES 945 |0 (DE-625)27935: |2 rvk | ||
084 | |a ZN 6060 |0 (DE-625)157500: |2 rvk | ||
100 | 1 | |a Rabiner, Lawrence R. |d 1943- |e Verfasser |0 (DE-588)138833737 |4 aut | |
245 | 1 | 0 | |a Theory and applications of digital speech processing |b international version |c Lawrence R. Rabiner ; Ronald W. Schafer |
250 | |a 1. internat. ed. | ||
264 | 1 | |a Boston [u.a.] |b Pearson |c 2011 | |
300 | |a 1056 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 4 | |a Speech processing systems | |
650 | 0 | 7 | |a Sprachsignal |0 (DE-588)4056494-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Digitale Sprachverarbeitung |0 (DE-588)4233857-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenverarbeitung |0 (DE-588)4011152-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Sprachsignal |0 (DE-588)4056494-0 |D s |
689 | 0 | 1 | |a Datenverarbeitung |0 (DE-588)4011152-0 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Digitale Sprachverarbeitung |0 (DE-588)4233857-8 |D s |
689 | 1 | |5 DE-604 | |
689 | 2 | 0 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 2 | |5 DE-604 | |
700 | 1 | |a Schafer, Ronald W. |d 1938- |e Sonstige |0 (DE-588)137466692 |4 oth | |
856 | 4 | 2 | |m Digitalisierung UB Regensburg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020473434&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-020473434 |
Datensatz im Suchindex
DE-BY-862_location | 2000 |
---|---|
DE-BY-FWS_call_number | 2000/ZN 6060 R116 |
DE-BY-FWS_katkey | 385168 |
DE-BY-FWS_media_number | 083000502679 |
_version_ | 1806174475693064192 |
adam_text | Contents
Preface
9
CHAPTER
1
Introduction to Digital Speech Processing
15
1.1
The Speech Signal
17
1.2
The Speech Stack
22
1.3
Applications of Digital Speech Processing
24
1.4
Comment on the References
29
1.5
Summary
31
CHAPTER
2
Review of Fundamentals of Digital Signal Processing
32
2.1
Introduction
32
2.2
Discrete-Time Signals and Systems
32
2.3
Transform Representation of Signals and Systems
36
2.4
Fundamentals of Digital Filters
47
2.5
Sampling
58
2.6
Summary
70
Problems
70
CHAPTER
3
Fundamentals of Human Speech Production
81
3.1
Introduction
81
3.2
The Process of Speech Production
82
3.3
Short-Time Fourier Representation of Speech
95
3.4
Acoustic Phonetics
100
3.5
Distinctive Features of the Phonemes of American English
122
3.6
Summary
124
Problems
124
CHAPTER
4
Hearing, Auditory Models, and Speech Perception
138
4.1
Introduction
138
4.2
The Speech Chain
139
4.3
Anatomy and Function of the Ear
141
4.4
The Perception of Sound
147
4.5
Auditory Models
164
4.6
Human Speech Perception Experiments
172
4.7
Measurement of Speech Quality and Intelligibility
176
4.8
Summary
180
Problems
181
6 Contents
CHAPTER
5
Sound Propagation in the Human Vocal Tract
184
5.1
The Acoustic Theory of Speech Production
184
5.2
Lossless Tube Models
214
5.3
Digital Models for Sampled Speech Signals
233
5.4
Summary
242
Problems
242
CHAPTER
6
Time-Domain Methods for Speech Processing
253
6.1
Introduction
253
6.2
Short-Time Analysis of Speech
256
6.3
Short-Time Energy and Short-Time Magnitude
262
6.4
Short-Time Zero-Crossing Rate
271
6.5
The Short-Time Autocorrelation Function
279
6.6
The Modified Short-Time Autocorrelation Function
287
6.7
The Short-Time Average Magnitude Difference Function
289
6.8
Summary
291
Problems
292
CHAPTER
7
Frequency-Domain Representations
301
7.1
Introduction
301
7.2
Discrete-Time Fourier Analysis
303
7.3
Short-Time Fourier Analysis
306
7.4
Spectrographic Displays
326
7.5
Overlap Addition Method of Synthesis
333
7.6
Filter Bank Summation Method of Synthesis
345
7.7
Time-Decimated Filter Banks
354
7.8
Two-Channel Filter Banks
362
7.9
Implementation of the FBS Method Using the FFT
372
7.10
OLA Revisited
379
7.11
Modifications of the STFT
381
7.12
Summary
393
Problems
394
CHAPTER
8
The Cepstrum and Homomorphic Speech Processing
413
8.1
Introduction
413
8.2
Homomorphic Systems for Convolution
415
8.3
Homomorphic Analysis of the Speech Model
431
8.4
Computing the Short-Time Cepstrum and Complex Cepstrum
of Speech
443
8.5
Homomorphic Filtering of Natural Speech
454
8.6
Cepstrum Analysis of All-Pole Models
470
8.7
Cepstrum Distance Measures
473
8.8
Summary
480
Problems
480
Contents 7
CHAPTER
9 Linear
Predictive Analysis of Speech Signals
487
9.1
Introduction
487
9.2
Basic Principles of Linear Predictive Analysis
488
9.3
Computation of the Gain for the Model
500
9.4
Frequency Domain Interpretations of Linear Predictive
Analysis
504
9.5
Solution of the LPC Equations
519
9.6
The Prediction Error Signal
541
9.7
Some Properties of the LPC Polynomial A(z)
552
9.8
Relation of Linear Predictive Analysis to Lossless Tube Models
560
9.9
Alternative Representations of the LP Parameters
565
9.10
Summary
574
Problems
574
CHAPTER
10
Algorithms for Estimating Speech Parameters
592
10.1
Introduction
592
10.2
Median Smoothing and Speech Processing
594
10.3
Speech-Background/Silence Discrimination
600
10.4
A Bayesian Approach to Voiced/Unvoiced/Silence Detection
609
10.5
Pitch Period Estimation (Pitch Detection)
617
10.6
Formant
Estimation
649
10.7
Summary
659
Problems
659
CHAPTER
11
Digital Coding of Speech Signals
677
11.1
Introduction
677
11.2
Sampling Speech Signals
681
11.3
A Statistical Model for Speech
683
11.4
Instantaneous Quantization
690
11.5
Adaptive Quantization
720
11.6
Quantizing of Speech Model Parameters
732
11.7
General Theory of Differential Quantization
746
11.8
Delta Modulation
757
11.9
Differential PCM (DPCM)
773
11.10
Enhancements for ADPCM Coders
782
11.11
Analysis-by-Synthesis Speech Coders
797
11.12
Open-Loop Speech Coders
820
11.13
Applications of Speech Coders
828
11.14
Summary
833
Problems
834
CHAPTER
12
Frequency-Domain Coding of Speech and Audio
856
12.1
Introduction
856
12.2
Historical Perspective
858
8 Contents
12.3 Subband
Coding
864
12.4
Adaptive Transform Coding
875
12.5
A Perception Model for Audio Coding
880
12.6
MPEG-1 Audio Coding Standard
895
12.7
Other Audio Coding Standards
908
12.8
Summary
908
Problems
909
CHAPTER
13
Text-to-Speech Synthesis Methods
921
13.1
Introduction
921
13.2
Text Analysis
922
13.3
Evolution of Speech Synthesis Methods
928
13.4
Early Speech Synthesis Approaches
930
13.5
Unit Selection Methods
940
13.6
TTS Future Needs
956
13.7
Visual TTS
957
13.8
Summary
961
Problems
961
CHAPTER
14
Automatic Speech Recognition and Natural
Language Understanding
964
14.1
Introduction
964
14.2
Basic ASR Formulation
966
14.3
Overall Speech Recognition Process
967
14.4
Building a Speech Recognition System
968
14.5
The Decision Processes in ASR
971
14.6
Step
3:
The Search Problem
985
14.7
Simple ASR System: Isolated Digit Recognition
986
14.8
Performance Evaluation of Speech Recognizers
988
14.9
Spoken Language Understanding
991
14.10
Dialog Management and Spoken Language Generation
994
14.11
User Interfaces
997
14.12 Multimodal User
Interfaces
998
14.13
Summary
998
Problems
999
Appendices
A Speech and Audio Processing Demonstrations
1007
В
Solution of Frequency-Domain Differential Equations
1019
Bibliography
1022
Index
1045
|
any_adam_object | 1 |
author | Rabiner, Lawrence R. 1943- |
author_GND | (DE-588)138833737 (DE-588)137466692 |
author_facet | Rabiner, Lawrence R. 1943- |
author_role | aut |
author_sort | Rabiner, Lawrence R. 1943- |
author_variant | l r r lr lrr |
building | Verbundindex |
bvnumber | BV036551911 |
callnumber-first | T - Technology |
callnumber-label | TK7882 |
callnumber-raw | TK7882.S65 |
callnumber-search | TK7882.S65 |
callnumber-sort | TK 47882 S65 |
callnumber-subject | TK - Electrical and Nuclear Engineering |
classification_rvk | ES 945 ZN 6060 |
ctrlnum | (OCoLC)705649504 (DE-599)BVBBV036551911 |
dewey-full | 006.4/54 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.4/54 |
dewey-search | 006.4/54 |
dewey-sort | 16.4 254 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik Sprachwissenschaft Elektrotechnik / Elektronik / Nachrichtentechnik Literaturwissenschaft |
edition | 1. internat. ed. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02016nam a2200493zc 4500</leader><controlfield tag="001">BV036551911</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20120326 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">100707s2011 xxuad|| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780137050857</subfield><subfield code="c">alk. paper</subfield><subfield code="9">978-0-13-705085-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)705649504</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV036551911</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxu</subfield><subfield code="c">US</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-859</subfield><subfield code="a">DE-862</subfield><subfield code="a">DE-1043</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">TK7882.S65</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.4/54</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ES 945</subfield><subfield code="0">(DE-625)27935:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ZN 6060</subfield><subfield code="0">(DE-625)157500:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Rabiner, Lawrence R.</subfield><subfield code="d">1943-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)138833737</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Theory and applications of digital speech processing</subfield><subfield code="b">international version</subfield><subfield code="c">Lawrence R. Rabiner ; Ronald W. Schafer</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. internat. ed.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Boston [u.a.]</subfield><subfield code="b">Pearson</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1056 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Speech processing systems</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Digitale Sprachverarbeitung</subfield><subfield code="0">(DE-588)4233857-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenverarbeitung</subfield><subfield code="0">(DE-588)4011152-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Datenverarbeitung</subfield><subfield code="0">(DE-588)4011152-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Digitale Sprachverarbeitung</subfield><subfield code="0">(DE-588)4233857-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="2" ind2="0"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="2" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Schafer, Ronald W.</subfield><subfield code="d">1938-</subfield><subfield code="e">Sonstige</subfield><subfield code="0">(DE-588)137466692</subfield><subfield code="4">oth</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Regensburg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020473434&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020473434</subfield></datafield></record></collection> |
id | DE-604.BV036551911 |
illustrated | Illustrated |
indexdate | 2024-08-01T10:50:08Z |
institution | BVB |
isbn | 9780137050857 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-020473434 |
oclc_num | 705649504 |
open_access_boolean | |
owner | DE-859 DE-862 DE-BY-FWS DE-1043 DE-355 DE-BY-UBR |
owner_facet | DE-859 DE-862 DE-BY-FWS DE-1043 DE-355 DE-BY-UBR |
physical | 1056 S. Ill., graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Pearson |
record_format | marc |
spellingShingle | Rabiner, Lawrence R. 1943- Theory and applications of digital speech processing international version Speech processing systems Sprachsignal (DE-588)4056494-0 gnd Digitale Sprachverarbeitung (DE-588)4233857-8 gnd Datenverarbeitung (DE-588)4011152-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
subject_GND | (DE-588)4056494-0 (DE-588)4233857-8 (DE-588)4011152-0 (DE-588)4003961-4 |
title | Theory and applications of digital speech processing international version |
title_auth | Theory and applications of digital speech processing international version |
title_exact_search | Theory and applications of digital speech processing international version |
title_full | Theory and applications of digital speech processing international version Lawrence R. Rabiner ; Ronald W. Schafer |
title_fullStr | Theory and applications of digital speech processing international version Lawrence R. Rabiner ; Ronald W. Schafer |
title_full_unstemmed | Theory and applications of digital speech processing international version Lawrence R. Rabiner ; Ronald W. Schafer |
title_short | Theory and applications of digital speech processing |
title_sort | theory and applications of digital speech processing international version |
title_sub | international version |
topic | Speech processing systems Sprachsignal (DE-588)4056494-0 gnd Digitale Sprachverarbeitung (DE-588)4233857-8 gnd Datenverarbeitung (DE-588)4011152-0 gnd Automatische Spracherkennung (DE-588)4003961-4 gnd |
topic_facet | Speech processing systems Sprachsignal Digitale Sprachverarbeitung Datenverarbeitung Automatische Spracherkennung |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020473434&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT rabinerlawrencer theoryandapplicationsofdigitalspeechprocessinginternationalversion AT schaferronaldw theoryandapplicationsofdigitalspeechprocessinginternationalversion |
Inhaltsverzeichnis
Schweinfurt Zentralbibliothek Lesesaal
Signatur: |
2000 ZN 6060 R116 |
---|---|
Exemplar 1 | ausleihbar Checked out – Rückgabe bis: 10.02.2025 Vormerken |