Pitch Determination of Speech Signals: Algorithms and Devices
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Berlin, Heidelberg
Springer Berlin Heidelberg
1983
|
Schriftenreihe: | Springer Series in Information Sciences
3 |
Schlagworte: | |
Online-Zugang: | Volltext |
Beschreibung: | Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measurement. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz) |
Beschreibung: | 1 Online-Ressource (XIV, 700 p) |
ISBN: | 9783642819261 9783642819285 |
ISSN: | 0720-678X |
DOI: | 10.1007/978-3-642-81926-1 |
Internformat
MARC
LEADER | 00000nmm a2200000zcb4500 | ||
---|---|---|---|
001 | BV042413759 | ||
003 | DE-604 | ||
005 | 20171009 | ||
007 | cr|uuu---uuuuu | ||
008 | 150316s1983 |||| o||u| ||||||eng d | ||
020 | |a 9783642819261 |c Online |9 978-3-642-81926-1 | ||
020 | |a 9783642819285 |c Print |9 978-3-642-81928-5 | ||
024 | 7 | |a 10.1007/978-3-642-81926-1 |2 doi | |
035 | |a (OCoLC)863820006 | ||
035 | |a (DE-599)BVBBV042413759 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
049 | |a DE-91 |a DE-83 | ||
082 | 0 | |a 534 |2 23 | |
084 | |a PHY 000 |2 stub | ||
100 | 1 | |a Hess, Wolfgang |e Verfasser |4 aut | |
245 | 1 | 0 | |a Pitch Determination of Speech Signals |b Algorithms and Devices |c by Wolfgang Hess |
264 | 1 | |a Berlin, Heidelberg |b Springer Berlin Heidelberg |c 1983 | |
300 | |a 1 Online-Ressource (XIV, 700 p) | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
490 | 1 | |a Springer Series in Information Sciences |v 3 |x 0720-678X | |
500 | |a Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measurement. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz) | ||
650 | 4 | |a Physics | |
650 | 4 | |a Acoustics | |
650 | 0 | 7 | |a Grundfrequenzbestimmung |0 (DE-588)4158381-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachsignal |0 (DE-588)4056494-0 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Sprachsignal |0 (DE-588)4056494-0 |D s |
689 | 0 | 1 | |a Grundfrequenzbestimmung |0 (DE-588)4158381-4 |D s |
689 | 0 | |8 1\p |5 DE-604 | |
689 | 1 | 0 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |D s |
689 | 1 | |8 2\p |5 DE-604 | |
830 | 0 | |a Springer Series in Information Sciences |v 3 |w (DE-604)BV000008063 |9 3 | |
856 | 4 | 0 | |u https://doi.org/10.1007/978-3-642-81926-1 |x Verlag |3 Volltext |
912 | |a ZDB-2-PHA |a ZDB-2-BAE | ||
940 | 1 | |q ZDB-2-PHA_Archive | |
999 | |a oai:aleph.bib-bvb.de:BVB01-027849252 | ||
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk | |
883 | 1 | |8 2\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk |
Datensatz im Suchindex
_version_ | 1804153078489808896 |
---|---|
any_adam_object | |
author | Hess, Wolfgang |
author_facet | Hess, Wolfgang |
author_role | aut |
author_sort | Hess, Wolfgang |
author_variant | w h wh |
building | Verbundindex |
bvnumber | BV042413759 |
classification_tum | PHY 000 |
collection | ZDB-2-PHA ZDB-2-BAE |
ctrlnum | (OCoLC)863820006 (DE-599)BVBBV042413759 |
dewey-full | 534 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 534 - Sound and related vibrations |
dewey-raw | 534 |
dewey-search | 534 |
dewey-sort | 3534 |
dewey-tens | 530 - Physics |
discipline | Physik |
doi_str_mv | 10.1007/978-3-642-81926-1 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03466nmm a2200517zcb4500</leader><controlfield tag="001">BV042413759</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20171009 </controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">150316s1983 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642819261</subfield><subfield code="c">Online</subfield><subfield code="9">978-3-642-81926-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642819285</subfield><subfield code="c">Print</subfield><subfield code="9">978-3-642-81928-5</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/978-3-642-81926-1</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)863820006</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV042413759</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield><subfield code="a">DE-83</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">534</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">PHY 000</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Hess, Wolfgang</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Pitch Determination of Speech Signals</subfield><subfield code="b">Algorithms and Devices</subfield><subfield code="c">by Wolfgang Hess</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin, Heidelberg</subfield><subfield code="b">Springer Berlin Heidelberg</subfield><subfield code="c">1983</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (XIV, 700 p)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Springer Series in Information Sciences</subfield><subfield code="v">3</subfield><subfield code="x">0720-678X</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measurement. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Physics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Acoustics</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Grundfrequenzbestimmung</subfield><subfield code="0">(DE-588)4158381-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Sprachsignal</subfield><subfield code="0">(DE-588)4056494-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Grundfrequenzbestimmung</subfield><subfield code="0">(DE-588)4158381-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">2\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Springer Series in Information Sciences</subfield><subfield code="v">3</subfield><subfield code="w">(DE-604)BV000008063</subfield><subfield code="9">3</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.1007/978-3-642-81926-1</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-2-PHA</subfield><subfield code="a">ZDB-2-BAE</subfield></datafield><datafield tag="940" ind1="1" ind2=" "><subfield code="q">ZDB-2-PHA_Archive</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-027849252</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">2\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield></record></collection> |
id | DE-604.BV042413759 |
illustrated | Not Illustrated |
indexdate | 2024-07-10T01:20:53Z |
institution | BVB |
isbn | 9783642819261 9783642819285 |
issn | 0720-678X |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-027849252 |
oclc_num | 863820006 |
open_access_boolean | |
owner | DE-91 DE-BY-TUM DE-83 |
owner_facet | DE-91 DE-BY-TUM DE-83 |
physical | 1 Online-Ressource (XIV, 700 p) |
psigel | ZDB-2-PHA ZDB-2-BAE ZDB-2-PHA_Archive |
publishDate | 1983 |
publishDateSearch | 1983 |
publishDateSort | 1983 |
publisher | Springer Berlin Heidelberg |
record_format | marc |
series | Springer Series in Information Sciences |
series2 | Springer Series in Information Sciences |
spelling | Hess, Wolfgang Verfasser aut Pitch Determination of Speech Signals Algorithms and Devices by Wolfgang Hess Berlin, Heidelberg Springer Berlin Heidelberg 1983 1 Online-Ressource (XIV, 700 p) txt rdacontent c rdamedia cr rdacarrier Springer Series in Information Sciences 3 0720-678X Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measurement. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz) Physics Acoustics Grundfrequenzbestimmung (DE-588)4158381-4 gnd rswk-swf Sprachverarbeitung (DE-588)4116579-2 gnd rswk-swf Sprachsignal (DE-588)4056494-0 gnd rswk-swf Sprachsignal (DE-588)4056494-0 s Grundfrequenzbestimmung (DE-588)4158381-4 s 1\p DE-604 Sprachverarbeitung (DE-588)4116579-2 s 2\p DE-604 Springer Series in Information Sciences 3 (DE-604)BV000008063 3 https://doi.org/10.1007/978-3-642-81926-1 Verlag Volltext 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk 2\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Hess, Wolfgang Pitch Determination of Speech Signals Algorithms and Devices Springer Series in Information Sciences Physics Acoustics Grundfrequenzbestimmung (DE-588)4158381-4 gnd Sprachverarbeitung (DE-588)4116579-2 gnd Sprachsignal (DE-588)4056494-0 gnd |
subject_GND | (DE-588)4158381-4 (DE-588)4116579-2 (DE-588)4056494-0 |
title | Pitch Determination of Speech Signals Algorithms and Devices |
title_auth | Pitch Determination of Speech Signals Algorithms and Devices |
title_exact_search | Pitch Determination of Speech Signals Algorithms and Devices |
title_full | Pitch Determination of Speech Signals Algorithms and Devices by Wolfgang Hess |
title_fullStr | Pitch Determination of Speech Signals Algorithms and Devices by Wolfgang Hess |
title_full_unstemmed | Pitch Determination of Speech Signals Algorithms and Devices by Wolfgang Hess |
title_short | Pitch Determination of Speech Signals |
title_sort | pitch determination of speech signals algorithms and devices |
title_sub | Algorithms and Devices |
topic | Physics Acoustics Grundfrequenzbestimmung (DE-588)4158381-4 gnd Sprachverarbeitung (DE-588)4116579-2 gnd Sprachsignal (DE-588)4056494-0 gnd |
topic_facet | Physics Acoustics Grundfrequenzbestimmung Sprachverarbeitung Sprachsignal |
url | https://doi.org/10.1007/978-3-642-81926-1 |
volume_link | (DE-604)BV000008063 |
work_keys_str_mv | AT hesswolfgang pitchdeterminationofspeechsignalsalgorithmsanddevices |