Phonetic search methods for large speech databases:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
New York
Springer
2013
|
Schriftenreihe: | Springer briefs in electrical and computer engineering
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Includes bibliographical references (pages 49-53) |
Beschreibung: | X, 53 S. graph. Darst. 24 cm |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV041255953 | ||
003 | DE-604 | ||
005 | 20131008 | ||
007 | t | ||
008 | 130905s2013 d||| |||| 00||| eng d | ||
020 | |z 9781461464884 |c paperback |9 978-1-4614-6488-4 | ||
020 | |z 1461464889 |c paperback |9 1-4614-6488-9 | ||
020 | |z 9781461464891 (ebook) |9 9781461464891 (ebook) | ||
035 | |a (OCoLC)852377249 | ||
035 | |a (DE-599)BVBBV041255953 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-12 | ||
084 | |a 24,1 |2 ssgn | ||
100 | 1 | |a Moyal, Ami |e Verfasser |0 (DE-588)1042574774 |4 aut | |
245 | 1 | 0 | |a Phonetic search methods for large speech databases |c Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri |
264 | 1 | |a New York |b Springer |c 2013 | |
300 | |a X, 53 S. |b graph. Darst. |c 24 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Springer briefs in electrical and computer engineering | |
500 | |a Includes bibliographical references (pages 49-53) | ||
650 | 4 | |a Database searching | |
650 | 4 | |a Keyword searching | |
650 | 4 | |a Natural language processing (Computer science) | |
650 | 4 | |a Speech processing systems | |
650 | 0 | 7 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Information Retrieval |0 (DE-588)4072803-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Big Data |0 (DE-588)4802620-7 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 0 | 1 | |a Automatische Spracherkennung |0 (DE-588)4003961-4 |D s |
689 | 0 | 2 | |a Big Data |0 (DE-588)4802620-7 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m Digitalisierung BSB Muenchen 21 - ADAM Catalogue Enrichment |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026229892&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-026229892 | ||
942 | 1 | 1 | |c 400 |e 22/bsb |
942 | 1 | 1 | |c 025.04 |e 22/bsb |
Datensatz im Suchindex
_version_ | 1804150713420349440 |
---|---|
adam_text | Contents
1
Keyword Spotting Out of Continuous Speech
................... 1
1.1
Introduction
......................................... 1
1.2
Problem Formulation: KWS in Large Speech Databases
......... 5
1.3
Target Applications of Keyword Spotting
................... 6
2
Keyword Spotting Methods
................................ 7
2.1
LVCSR-Based KWS
.................................. 7
2.2
Acoustic KWS
....................................... 7
2.3
Phonetic Search KWS
................................. 8
2.4
Discussion: Why Phonetic Search?
........................ 9
2.4.1
Response Time
................................. 9
2.4.2
KWS Performance
............................... 10
2.4.3
Keyword Flexibility
.............................. 10
3
Phonetic Search
......................................... 13
3.1
The Search Mechanism
................................. 13
3.2
Using Phonetic Search for KWS
.......................... 15
3.3
Computational Complexity Analysis
....................... 16
4
Search Space Complexity Reduction
.......................... 19
4.1
Overview
........................................... 19
4.2
Complexity Reduction in Phonetic Search
................... 21
4.3
Anchor-Based Phonetic Search
........................... 23
5
Evaluating Phonetic Search KWS
............................ 29
5.1
Performance Metrics
.................................. 29
5.2
Evaluation Process
.................................... 32
5.3
Evaluation Databases
.................................. 33
6
Evaluation Results
....................................... 35
6.1
Exhaustive Search
.................................... 35
6.1.
1 Textual Benchmark
.............................. 36
IX
x
Contents
6.1.2
KWS on
Speech................................ 37
6.1.2.1 Single
Threshold.........................
37
6.1.2.2 Multiple
Thresholds.......................
38
6.2
Anchor-Based Search
.................................. 39
6.2.1
Textual Benchmark
.............................. 39
6.2.2
Reduced Complexity KWS on Speech
................ 39
6.2.2.1
Single Threshold
......................... 39
6.2.3
Multiple Thresholds
.............................. 42
6.3
Lessons Learned from the Evaluation
...................... 43
7
Summary
.............................................. 45
Glossary of Acronyms
....................................... 47
References
................................................ 49
|
any_adam_object | 1 |
author | Moyal, Ami |
author_GND | (DE-588)1042574774 |
author_facet | Moyal, Ami |
author_role | aut |
author_sort | Moyal, Ami |
author_variant | a m am |
building | Verbundindex |
bvnumber | BV041255953 |
ctrlnum | (OCoLC)852377249 (DE-599)BVBBV041255953 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01997nam a2200481 c 4500</leader><controlfield tag="001">BV041255953</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20131008 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">130905s2013 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781461464884</subfield><subfield code="c">paperback</subfield><subfield code="9">978-1-4614-6488-4</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">1461464889</subfield><subfield code="c">paperback</subfield><subfield code="9">1-4614-6488-9</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781461464891 (ebook)</subfield><subfield code="9">9781461464891 (ebook)</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)852377249</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV041255953</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">24,1</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Moyal, Ami</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1042574774</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Phonetic search methods for large speech databases</subfield><subfield code="c">Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York</subfield><subfield code="b">Springer</subfield><subfield code="c">2013</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 53 S.</subfield><subfield code="b">graph. Darst.</subfield><subfield code="c">24 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Springer briefs in electrical and computer engineering</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references (pages 49-53)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Database searching</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Keyword searching</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Natural language processing (Computer science)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Speech processing systems</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Automatische Spracherkennung</subfield><subfield code="0">(DE-588)4003961-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung BSB Muenchen 21 - ADAM Catalogue Enrichment</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026229892&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-026229892</subfield></datafield><datafield tag="942" ind1="1" ind2="1"><subfield code="c">400</subfield><subfield code="e">22/bsb</subfield></datafield><datafield tag="942" ind1="1" ind2="1"><subfield code="c">025.04</subfield><subfield code="e">22/bsb</subfield></datafield></record></collection> |
id | DE-604.BV041255953 |
illustrated | Illustrated |
indexdate | 2024-07-10T00:43:18Z |
institution | BVB |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-026229892 |
oclc_num | 852377249 |
open_access_boolean | |
owner | DE-12 |
owner_facet | DE-12 |
physical | X, 53 S. graph. Darst. 24 cm |
publishDate | 2013 |
publishDateSearch | 2013 |
publishDateSort | 2013 |
publisher | Springer |
record_format | marc |
series2 | Springer briefs in electrical and computer engineering |
spelling | Moyal, Ami Verfasser (DE-588)1042574774 aut Phonetic search methods for large speech databases Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri New York Springer 2013 X, 53 S. graph. Darst. 24 cm txt rdacontent n rdamedia nc rdacarrier Springer briefs in electrical and computer engineering Includes bibliographical references (pages 49-53) Database searching Keyword searching Natural language processing (Computer science) Speech processing systems Automatische Spracherkennung (DE-588)4003961-4 gnd rswk-swf Information Retrieval (DE-588)4072803-1 gnd rswk-swf Big Data (DE-588)4802620-7 gnd rswk-swf Information Retrieval (DE-588)4072803-1 s Automatische Spracherkennung (DE-588)4003961-4 s Big Data (DE-588)4802620-7 s DE-604 Digitalisierung BSB Muenchen 21 - ADAM Catalogue Enrichment application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026229892&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Moyal, Ami Phonetic search methods for large speech databases Database searching Keyword searching Natural language processing (Computer science) Speech processing systems Automatische Spracherkennung (DE-588)4003961-4 gnd Information Retrieval (DE-588)4072803-1 gnd Big Data (DE-588)4802620-7 gnd |
subject_GND | (DE-588)4003961-4 (DE-588)4072803-1 (DE-588)4802620-7 |
title | Phonetic search methods for large speech databases |
title_auth | Phonetic search methods for large speech databases |
title_exact_search | Phonetic search methods for large speech databases |
title_full | Phonetic search methods for large speech databases Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri |
title_fullStr | Phonetic search methods for large speech databases Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri |
title_full_unstemmed | Phonetic search methods for large speech databases Ami Moyal, Vered Aharonson, Ella Tetariy, Michal Gishri |
title_short | Phonetic search methods for large speech databases |
title_sort | phonetic search methods for large speech databases |
topic | Database searching Keyword searching Natural language processing (Computer science) Speech processing systems Automatische Spracherkennung (DE-588)4003961-4 gnd Information Retrieval (DE-588)4072803-1 gnd Big Data (DE-588)4802620-7 gnd |
topic_facet | Database searching Keyword searching Natural language processing (Computer science) Speech processing systems Automatische Spracherkennung Information Retrieval Big Data |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026229892&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT moyalami phoneticsearchmethodsforlargespeechdatabases |