Verfügbarkeit: 1997 International Conference on Acoustics, Speech and Signal Processing

1997 International Conference on Acoustics, Speech and Signal Processing: ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing

Gespeichert in:

Bibliographische Detailangaben
Körperschaft:	ICASSP München (VerfasserIn)
Format:	Tagungsbericht Buch
Sprache:	English
Veröffentlicht:	Munich 1997
Schlagworte:	Konferenzschrift
Online-Zugang:	Inhaltsverzeichnis
Beschreibung:	S. 711 - 1610 Ill., graph. Darst.

Internformat

MARC


LEADER	00000nam a2200000 cc4500
001	BV011407364
003	DE-604
005	00000000000000.0
007	t
008	970701s1997 ad\|\| \|\|\|\| 10\|\|\| eng d
035			\|a (OCoLC)632730018
035			\|a (DE-599)BVBBV011407364
040			\|a DE-604 \|b ger \|e rakddb
041	0		\|a eng
049			\|a DE-29T \|a DE-91G \|a DE-91
111	2		\|a ICASSP \|n 22 \|d 1997 \|c München \|j Verfasser \|0 (DE-588)1901193-3 \|4 aut
245	1	0	\|a 1997 International Conference on Acoustics, Speech and Signal Processing \|b ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center \|n 2 \|p Speech processing
264		1	\|c 1997
264		1	\|a Munich
300			\|a S. 711 - 1610 \|b Ill., graph. Darst.
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
655		7	\|0 (DE-588)1071861417 \|a Konferenzschrift \|2 gnd-content
773	0	8	\|w (DE-604)BV011372148 \|g 2
856	4	2	\|m Digitalisierung TU Muenchen \|q application/pdf \|u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA \|3 Inhaltsverzeichnis
999			\|a oai:aleph.bib-bvb.de:BVB01-007669545

Datensatz im Suchindex

_version_	1804125921592999936
adam_text	Table of Contents Volume II Speech Processing Conference Committee................................................................................................................ xix IEEE Signa] Processing Society ...................................................................................................xx ICASSP 1998 in Seattle ...............................................................................................................xxi Call for Papers .............................................................................................................................xxii ICASSP-De Paper Cover Sheet ..................................................................................................xxiii IEEE Copyright Form ...............................................................................................................xxiv Recognizing Broadcast News Transcription of Broadcast News - System Robustness Issues and Adaptation Techniques ...............................................................................................................711 R. Bakis, S. Schen, P. Gopalakrishnan, R. Gopinath, S. Maes and L. Polymenakos Transcribing Broadcast News Shows ........................................................................................715 J. Gauvain, G. Adda, L. Lamel and M. Adda-Decker Broadcast News Transcription Using НТК ...............................................................................719 P. Woodland, M. Gales, D. Pye and S. Young Transcription of Broadcast Television and Radio News: The 1996 Abbot System ........................................................................... ..............................................................723 С Cook, D. Kershaw, J. Christie, C. Seymour and S. Waterhouse Improved Topic Discrimination of Broadcast News Using a Model of Multiple Simultaneous Topics ....................................................................................................727 T. Imai, R. Schwartz, F. Rubala and L. Nguyen CELP Speech Coding Enhanced Full Rate Speech Codec for IS-136 Digital Cellular System ..................................731 T. Honkanen, J. Vainio, K. Järvinen, P. Haamsto, R. Salami, C. Laßamme and J. Adoul A CELP Variable Rate Speech Codec with Low Average Rate ................................................735 L. Zhang, T. Wang and V. Cuperman HCELP: Low Bit Rate Speech Coder for Voice Storage Applications .....................................739 M. Bouraoui, F. Druilhe and G. Feng Low-Rate CELP Speech Coding Using an Improved Weighting Function .............................743 С Kwon and C. Un Toll Quality Variable-Rate Speech Codec .................................................................................747 P. Ojala A Variable-Rate Multimodal Speech Coder with Gain-Matched Analysis-by-Synthesis .................................................................................................................751 E. Paksoy, A. McCree and V. Viswanathan Design of a Toll-Quality 4-Kbit/s Speech Coder Based on Phase-Adaptive PSI-CELP .........................................................................................................755 K. Mano A High-Quality Bi-CELP Speech Coder at 8 Kbit/s and Below ...............................................759 S. Kwon, H. Park and H. Chang Low Complexity VQ for Multi-Tap Pitch Predictor Coding .....................................................763 J. Patel A 4 Kbit/s Renewal Code Excited Linear Prediction Speech Coder ........................................767 H. Kim, Y. Cho, M. Kim and S. Kim GSM Enhanced Full Rate Speech Codec ...................................................................................771 K. Järvinen, J. Vainio, P. Kapanen, T. Honkanen, P. Haavisto, R. Salami, C. Laflamme and J. Adoul Description of ITU-T Recommendation G.729 Annex A: Reduced Complexity 8 Kbit/s CS-ACELP Codec ..........................................................................................................775 R. Salami, C. Laflamme, B. Bessette and J. Adoul Language modeling Semantic Clustering for Adaptive Language Modeling ...........................................................779 R. Kneser and J. Peters Task Adaptation Using MAP Estimation in N-Gram Language Modeling .............................783 H. Masataki, Y. Sagisaka, К. Hisaki and T. Kawahara Distant Bigram Language Modelling Using Maximum Entropy ............................................787 M. Simons, H. Ney and S. Martin Nonuniform Markov Models ......................................................................................................791 E. Ristad and R. Thomas Modelling Word-Pair Relations in a Category-Based Language Model ..................................795 T. Niesler and P. Woodland Language Model Adaptation Using Mixtures and an Exponentially Decaying Cache ...........................................................................................................................799 P. Clarkson and A. Robinson Confidence-Driven Estimator Perturbation: BMPC .................................................................803 S. Besling and H. Meier Domain Adaptation with Clustered Language Models ............................................................807 J. Ueberla Improving Parsing of Spontaneous Speech with the Help of Prosodie Boundaries ..................................................................................................................................811 R. Котре, Α. Kießling, H. Niemann, E. Nöth, A. Batliner, S. Schachtl, T. Rulând and H. Block Specialized Language Models Using Dialogue Predictions ......................................................815 C. Popovici and P. Baggia K-TLSS(S) Language Models for Speech Recognition ..............................................................819 G. Bordel and A. Varona Language Model Adaptation for Conversational Speech Recognition Using Automatically Tagged Pseudo-Morphological Classes .............................................................823 C. Crespo, D. Tapias, G. Escalada and J. Alvarez Noise Robustness Model Adaptation Based on HMM Decomposition for Reverberant Speech Recognition ..................................................................................................................................827 Γ. Takiguchi, S. Nakamura, K. Shikano and Q Huo Model Compensation for Noises in Training and Test Data ....................................................831 D. Matrouf and J. Gauvain Jacobian Approach to Fast Acoustic Model Adaptation ...........................................................835 S. Sagayama, Y. Yamaguchi, S. Takahashi and J. Takahashi A Unified Maximum Likelihood Approach to Acoustic Mismatch Compensation: Application to Noisy Lombard Speech Recognition ........................................839 M. Afify, Y. Gong and J Haton Enhancement and Recognition of Noisy Speech Within an Autoregressive Hidden Markov Model Framework Using Noise Estimates from the Noisy Signal ...........................................................................................................................................843 B. Logan and A. Robinson Fast Speech Recognition Algorithm Under Noisy Environment Using Modified CMS-PMC and Improved IDMM+SQ ........................................................................847 H. Yamamoto, T. Kosáka, M. Yamada, Y. Komori and M. Fujiła The Effects of Background Music on Speech Recognition Accuracy ........................................851 B. Raj, V. Parikh and R. Stern Joint Model and Feature Space Optimization for Robust Speech Recognition ..................................................................................................................................855 J. Hwang and C. Wang Co-Channel Speech Separation for Robust Automatic Speech Recognition: Stability and Efficiency .............................................................................................................859 K. Yen and Y. Zhao Missing Data Techniques for Robust Speech Recognition .......................................................863 M. Cooke, A. Morris and P. Green Spectral Subtraction and Rasta-Filtering in Text-Dependent HMM-Based Speaker Verification ...................................................................................................................867 D. Hardt and K. Fellbaum Noise Robust Speech Recognition with State Duration Constraints .......................................871 ■K. Laurila Word Spotting with Confidence Confidence Measures for Spontaneous Speech Recognition ....................................................875 T. Schaaf and T. Kemp A Probabilistic Approach to Confidence Estimation and Evaluation ......................................879 L. Gillick, Y. Ito and J. Young Word-Based Confidence Measures As a Guide for Stack Search in Speech Recognition ..................................................................................................................................883 С Neti, S. Roukos and E. Eide Neural - Network Based Measures of Confidence for Word Recognition ................................887 M. Weintraub, F. Beaufays, Z. Rivlin, Y. König and A. Stoicke Improving Utterance Verification Using Hierarchical Confidence Measures in Continuous Natural Numbers Recognition .........................................................891 J. Caminero, L. Hernandez-Gomez, C. de la Torre and C. Martin On the Influence of Frame-Asynchronous Grammar Scoring in a CSR System ....................895 A. Rubio, J. Diaz, P. Garcia and J. Segura A Segment-Based Wordspotter Using Phonetic Filler Models ................................................899 A. Manos and V. Zue A Multi-Phase Approach for Fast Spotting of Large Vocabulary Chinese Keywords from Mandarin Speech Using Prosodie Information ..............................................903 B. Bai, С. Tseng and L. Lee Accurate Keyword Spotting Using Strictly Lexical Fillers ......................................................907 R. El Méliani and D. O Shaughnessy Failure Simulation for a Phoneme HMM Based Keyword Spotter .........................................911 M. Holzapfel, G. Ruske and H. Höge Wordspotting Using a Predictive Neural Model for the Telephone Speech Corpus ..........................................................................................................................................915 S. Suhardi and K. Fellbaum Speech Synthesis Shape-Invariant Pitch and Time-Scale Modification of Speech by Variable Order Phase Interpolation ..........................................................................................919 M. Pollard, B. Cheetham, C. Goodyear and M. Edgington A Chinese Text-to-Speech System Based on Part-of-Speech Analysis, Prosodie Modeling and Non-Uniform Units ..............................................................................923 F. Chou, С. Tseng, К. Chen and L. Lee Automatic Prosodie Modeling for Speaker and Task Adaptation in Text-to-Speech .............................................................................................................................927 E. Lopez-Gonzalo, J. Rodriguez-Garcia, L. Hernandez-Gomez and J. Villar Prosody Generation with a Neural Network: Weighing the Importance of Input Parameters ........................................................................................................................931 G. Sonntag, T. Portele and В. Heuft Evaluation of a Speech Synthesis Method for Nonlinear Modeling of Vocal Folds Vibration Effect .......................................................................................................935 H. Ohmura and K. Tanaka Generation of Fo Contour Using Stochastic Mapping and Vector Quantization Control Parameters .............................................................................................939 B. Heo-Jin, K. Yeon-Jun and O. Yung-Hwan Spectral Normalization Employing Hidden Markov Modeling of Line Spectrum Pair Frequencies ........................................................................................................943 B. Pellom and J. Hansen Time Domain Technique for Pitch Modification and Robust Voice Transformation ...........................................................................................................................947 R. Vergin, D. O Shaughnessy and A. Farhat A New Fundamental Frequency Modification Algorithm with Transformation of Spectrum Envelope According to Fo ...........................................................951 K. Tanaka and M. Abe Reliability Assessment and Evaluation of Objectively Measured Descriptors for Perceptual Speaker Characterization ..............................................................955 B. Necioglu, M. Clements and T. Barnwell Recent Improvements on Microsoft s Trainable Text-to-Speech System - Whistler .......................................................................................................................................959 X. Huang, A. Acero, H. Hon, Y. Ju, J. Liu, S. Meredith and M. Plumpe Automatic Generation of Speech Synthesis Units Based on Closed Loop Training .......................................................................................................................................963 T. Kagoshima and M. Akamine Speech Features and Acoustic Modeling Isolated Word Recognition Using the HMM Structure Selected by the Genetic Algorithm .......................................................................................................................967 T. Takara, K. Higa and I. Nagayama Discrete Mixture HMM ..............................................................................................................971 S. Takahashi, К. Aikawa and S. Sagayama Using Word Temporal Structure in HMM Speech Recognition ..............................................975 L. Fissore, F. Ravera and P. Laface Smoothness Analysis for Trajectory Features ..........................................................................979 Z. Ни and E. Barnard Frequency-Warping and Speaker-Normalization .....................................................................983 S. Umesh, L. Cohen and D. Nelson Integrating Syllable Boundary Information Into Speech Recognition ....................................987 S. Wu, M. Shire, S. Greenberg and N. Morgan Explicit, N-Best Formant Features for Vowel Classification ...................................................991 P. Schmid and E. Barnard Dual-Channel Auditory Spectrum Modeling ............................................................................995 J. Billa Direct Identification Vs. Correlated Models to Process Acoustic and Articulatory Informations in Automatic Speech Recognition ..................................................999 R. André-Obrecht and B. Jacob Adapting PSN Recognition Models to the GSM Environment by Using Spectral Transformation ..........................................................................................................1003 T. Soulas, С. Mokbel, D. Jouvet and J. Monne Integrated-Multilingual Speech Recognition Using Universal Phonological Features in a Functional Speech Production Model .........................................1007 L. Deng Phone Classification with Segmental Features and a Binary-Pair Partitioned Neural Network Classifier ...................................................................................1011 S. Zahorian, P. Silsbee and X. Wang Speaker Adaptation and Normalisation Smoothed N-Best-Based Speaker Adaptation for Speech Recognition .................................1015 T. Matsui, T. Matsuoka and S. Furui A Fast Algorithm for Unsupervised Incremental Speaker Adaptation .................................1019 M. Schüßler, F. Gallwitz and S. Harbeck Improved Estimation of Supervision in Unsupervised Speaker Adaptation ........................1023 S. Homma, K. Aikawa and S. Sagayama Improved Bayesian Learning of Hidden Markov Models for Speaker Adaptation .................................................................................................................................1027 J. Chien, H. Wang and С. Lee Studies in Transformation-Based Adaptation ........................................................................1031 V. Nagesha and L. Gillick Speaker Adaptation in the Philips System for Large Vocabulary Continuous Speech Recognition ...............................................................................................1035 E. Thelen, X. Aubert and P. Beyerlein Speaker Normalization Based on Frequency Warping ..........................................................1039 P. Zhan and M. Westphal Speaker Adaptive Training: A Maximum Likelihood Approach to Speaker Normalization ...........................................................................................................................1043 T. Anastasakos, J. McDonough and J. Makhoul Experiments in Speaker Normalisation and Adaptation for Large Vocabulary Speech Recognition ...............................................................................................1047 D. Pye and P. Woodland Effectiveness of Speaker Normalized HMM by Projection to Speaker Subspace ....................................................................................................................................1051 Y. Ariki Speaker Normalization and Adaptation Based on Linear Transformation ..........................1055 J. Ishii and M. Tonomura Speaker-Adapted Training on the Switchboard Corpus ........................................................1059 J. McDonough, T. Anastasakos, G. Zavaliagkos and H. Gish Speaker Verification and Identification Model Transformation for Robust Speaker Recognition from Telephone Data ...........................................................................................................................................1063 F. Beaufays and M. Weintraub Speaker Recognition with the Switchboard Corpus ...............................................................1067 L. Lamel and J. Gauvain Handset-Dependent Background Models for Robust Text-Independent Speaker Recognition .................................................................................................................1071 L. Heck and M. Weintraub Telephone Based Speaker Recognition Using Multiple Binary Classifier and Gaussian Mixture Models .................................................................................................1075 P. Castellano, S. Słomka and S. Sridharan Comparison of Whole Word and Subword Modeling Techniques for Speaker Verification with Limited Training Data ................................................................................1079 S. Euler, R. Langlitz and J. Zinke A Comparison of Model Estimation Techniques for Speaker Verification ............................1083 M. Carey, E. Parris, S. Bennett and H. Lloyd-Thomas Speaker Verification Using Frame and Utterance Level Likelihood Normalization ...........................................................................................................................1087 S. Nakagawa and K. Markov A New Codebook Training Algorithm for VQ-Based Speaker Recognition ...........................1091 J. He, L. Liu and G. Palm Bispectrum Features for Robust Speaker Identification ........................................................1095 S. Wenndt and S. Shamsunder Speaker Identification Based Text to Audio Alignment for an Audio Retrieval System .......................................................................................................................1099 D. Roy and C. M alamud Robust Speaker Recognition through Acoustic Array Processing and Spectral Normalization .............................................................................................................1103 J. Gonzalez-Rodriguez and J. Ortega-Garcia Providing Single and Multi-Channel Acoustical Robustness to Speaker Identification Systems ..............................................................................................................1107 J. Ortega-García and J. Gonzalez-Rodriguez Language and Speaker Identification Robust Spoken Language Identification Using Large Vocabulary Speech Recognition ................................................................................................................................1111 J. Hieronymus and S. Kadambe Double Bigram-Decoding in Phonotactic Language Identification .......................................1115 J. Navrátil and W. Zühlke Random Walk Theory Applied to Language Identification ....................................................1119 E. Marcherei and M. Savie Frequency Characteristics of Foreign Accented Speech .........................................................1123 L. Arslan and J. Hansen A Study on Improving Decisions in Closed Set Speaker Identification ................................1127 M. Demirekler and A. Saranli The Use of Harmonic Features in Speaker Recognition ........................................................1131 B. Imperi, Z. Kacie and В. Horvat An Approach to Speaker Identification Using Multiple Classifiers ......................................1135 V. Radová and J. Psutka Spoken Language Systems Development and Evaluation of the ATOS Spontaneous Speech Conversational System .............................................................................................................1139 J. Alvarez, D. Tapias, С. Crespo, I. Cortázar and F. Martinez A Spoken Language System for Automated Call Routing .....................................................1143 G. Riccardi, A. Gorin, A. Ljolje and M. Riley Dialogos: A Robust System for Human-Machine Spoken Dialogue on the Telephone ..................................................................................................................................1147 D. Albesano, P. Baggia, M. Danieli, R. Gemello, E. Gerbino and C. Rullent Surfin the World Wide Web with Japanese ............................................................................1151 К. Kondo and С. Hemphill Internet Chinese Information Retrieval Using Unconstrained Mandarin Speech Queries Based on a Client-Server Architecture and a PAT-Tree-Based Language Model ...........................................................................................1155 L. Chien, M. Chen, H. Wang, L. Lee, S. Lin, J. Hong and J. Shen Combining Key-Phrase Detection and Subword-Based Verification for Flexible Speech Understanding ...............................................................................................1159 T. Kawahara, C. Lee and B. Juang Controlling Limited-Domain Applications by Probabilistic Semantic Decoding of Natural Speech .....................................................................................................1163 H. Stahl, J. Müller and M. Lang Speech Enhancement Multi-Channel Speech Enhancement in a Car Environment Using Wiener Filtering and Spectral Subtraction ..........................................................................................1167 J. Meyer and K. Simmer Weighted Matching Algorithms and Reliability in Noise Cancelling by Spectral Subtraction .................................................................................................................1171 N. Yoma, F. Mclnnes and M. Jack HMM-Based Speech Enhancement Using Harmonic Modeling ............................................1175 M. Deisher and A. Spanias Model Based Speech Pause Detection .....................................................................................1179 B. McKinley and G. Whipple Integrated Speech Enhancement and Coding in the Time-Frequency Domain ...................1183 A. Drygajlo and B. Carnero Quality Enhancement of Narrowband CELP-Coded Speech via Wideband Harmonic Re-Synthesis ............................................................................................................1187 C. Chan and W. Hui Speech Enhancement Using CSS-Based Array Processing ...................................................1191 F. Asano and S. Hayamizu Co-Channel Speaker Separation Using Constrained Nonlinear Optimization ....................1195 D. Benincasa and M. Savie A Contextual Blind Separation of Delayed and Convolved Sources .....................................1199 T. Lee and R. Orglmeister Segregation of Concurrent Speech with the Reassigned Spectrum ......................................1203 G. Meyer, F. Plante and F. Berthommier Enhancement of Esophageal Speech by Injection Noise Rejection .......................................1207 H. Javkin, M. Galler and N. Niedzielski Real-Time Digital Speech Processing Strategies for the Hearing Impaired ....................................................................................................................................1211 N. Magotra and S. Sirivara Iterative-Batch and Sequential Algorithms for Single Microphone Speech Enhancement ...............................................................................................................1215 S. Gannot, D. Burshtein and E. Weinstein Kalman Filtering for Low Distortion Speech Enhancement in Mobile Communication .........................................................................................................................1219 P. Sörqvist, P. Handel and B. Ottersten Features for ASR Exploiting the Potential of Auditory Preprocessing for Robust Speech Recognition by Locally Recurrent Neural Networks ..............................................................1223 K. Kasper, H. Reininger and D. Wolf Feature Adaptation Using Deviation Vector for Robust Speech Recognition in Noisy Environment ..........................................................................................1227 T. Hwang, L. Lee and H. Wang Binaural Phoneme Recognition Using the Auditory Image Model and Cross-Correlation ......................................................................................................................1231 K. Francis and T. Anderson Utterance Dependent Parametric Warping for a Talker-Independent HMM-Based Recognizer ...........................................................................................................1235 D. Mashao and J. Adcock Phase-Corrected RASTA for Automatic Speech Recognition Over the Phone ......................1239 J. de Veth and L. Boves A Binaural Speech Processing Method Using Subband-Crosscorrelation Analysis for Noise Robust Recognition ....................................................................................1243 S. Knjita, K. Takeda and F. Itakura Modelling Asynchrony in Speech Using Elementary Single-Signal Decomposition ...........................................................................................................................1247 M. Tomlinson, M. Russell, R. Moore, A. Buckland and M. Fawley Subband-Based Speech Recognition ........................................................................................1251 H. Bourlard and S. Dupont Sub-Band Based Recognition of Noisy Speech ........................................................................1255 S. Tibrewala and H. Hermansky Recognizing Reverberant Speech with RASTA-PLP ..............................................................1259 B. Kingsbury and N. Morgan Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition ...................................................................................................................1263 S. Vaseghi, N. Harte and B. Milner Maximum Likelihood Weighting of Dynamic Speech Features for CDHMM Speech Recognition ...................................................................................................................1267 J. Hernando Speech Recognition Using Automatically Derived Acoustic Baseforms ................................1271 R. Rose and E. Lleida On Combining Frequency Warping and Spectral Shaping in HMM Based Speech Recognition ...................................................................................................................1275 A. Potamianos and R. Rose Speech Analysis Recursive Linear Prediction Using OBE Identification with Automatic Bound Estimation .....................................................................................................................1279 J. Délier, T. Lin and M. Nayeri Nonlinear Long-Term Prediction of Speech Signals ...............................................................1283 M. Birgmeier, H. Bernhard and G. Kubin Vocal Tract Shape Trajectory Estimation Using MLP Analysis-by-Synthesis ...............................................................................................................1287 H. Richards, J. Mason, J. Bridle and M. Hunt Fast and Robust Joint Estimation of Vocal Tract and Voice Source Parameters ................................................................................................................................1291 D. Wen and H. Norio Spectral Correlates of Glottal Waveform Models: An Analytic Study ...................................1295 B. Doval and C. d Alessandro A Time Varying ARMAX Speech Modeling with Phase Compensation Using Glottal Source Model ................................................................................................................1299 K. Funaki, Y. Miyanaga and K. Tochinai Speech Representation and Transformation Using Adaptive Interpolation of Weighted Spectrum: VOCODER Revisited .........................................................................1303 H. Kawahara The Weft: A Representation for Periodic Sounds ...................................................................1307 D. Ellis A Computationally Efficient Algorithm for Calculating Loudness Patterns of Narrowband Speech ..............................................................................................1311 M. Hauenstein Two-Channel Blind Deconvolution for Non-Minimum Phase Impulse Responses ..................................................................................................................................1315 K. Furuya and Y. Kaneda Variable Time-Scale Modification of Speech Using Transient Information .........................1319 S. Lee and H. Kim Speech Enhancement with Reduction of Noise Components in the Wavelet Domain ......................................................................................................................................1323 J. Seok and K. Вае Blind Separation and Restoration of Signals Mixed in Convolutive Environment .............................................................................................................................1327 J. Xi and J. Reilly Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator ............................................................................................................................1331 E. Scheirer and M. Slaney Topics in Speech Coding I Encoding of Speech Spectral Parameters Using Adaptive Quantization Methods .....................................................................................................................................1335 I. Lee and H. Woo Optimal Transformation of LSP Parameters Using Neural Network ...................................1339 H. Vu and L. Lois Speech Spectrum Representation and Coding Using Multigrams with Distance .....................................................................................................................................1343 J. Cernocký, G. Baudoin and G. Chollet Incorporating Perception Into LSF Quantization - Some Experiments ................................1347 R. Cohn and J. Collura Predictive VQ for Noisy Channel Spectrum Coding: AR Or MA? ..........................................1351 J. Skoglund and J. Linden Efficient Encoding of Mel-Generalized Cepstrum for CELP Coders .....................................1355 K. Koishida, T. Kobayashi, S. Imai and K. Tokuda A Candidate Coder for the ITU-T s New Wideband Speech Coding Standard .....................1359 J. Chen Perceptual Speech Coding Using Time and Frequency Masking Constraints .....................1363 B. Carnero and A. Drygajlo A Multi-Band CELP Wideband Speech Coder ........................................................................1367 A. Ubale and A. Gersho A Design of Transform Coder for Both Speech and Audio Signals at 1 Bit/sample ..................................................................................................................................1371 T. Moriya, N. Iwakami, A. Jin, K. Ikeda and S. Miki Speech Quality Assessment of Compounded Digital Telecommunication Systems: Perceptual Dimensions .............................................................................................1375 K. Petersen, S. Hansen and J. Sorensen Performance Assessment of Tandem Connection of Cellular and Satellite-Mobile Coders ............................................................................................................1379 S. Campos Neto, F. Corcoran and A. Karahisar The Consequences of Linguistic Perception on Low-Rate Speech Coding ............................1383 J. Parry and I. Burnett Using a Quantitative Psychoacoustical Signal Representation for Objective Speech Quality Measurement .................................................................................1387 M. Hansen and B. Kollmeier Speech Models and Features A Method of Extracting Time-Varying Acoustic Features Effective for Speech Recognition ...................................................................................................................1391 K. Tanaka and H. Kojima Elimination of Trajectory Folding Phenomenon: HMM, Trajectory Mixture HMM and Mixture Stochastic Trajectory Model ....................................................................1395 I. Illina and Y. Gong Linear Dynamic Segmental HMMs: Variability Representation and Training Procedure ...................................................................................................................1399 W. Holmes and M. Russell Model Parameter Estimation for Mixture Density Polynomial Segment Models ........................................................................................................................................1403 T. Fukada, Y. Sagisaka and K. Paliwal The Importance of Segmentation Probability in Segment Based Speech Recognizers ................................................................................................................................1407 J. Verhasselt, J. Martens, I. Illina, J. Haton and Y. Gong Adaptation of Polynomial Trajectory Segment Models for Large Vocabulary Speech Recognition ...............................................................................................1411 A. Kannan and M. Ostendorf Speaker Adaptation Experiments Using Nonstationary-State Hidden Markov Models: A MAP Approach ...........................................................................................1415 C. Rathinavelu and L. Deng Vocabulary Optimization Based on Perplexity .......................................................................1419 K. Hwang REMAP for Video Soundtrack Indexing ..................................................................................1423 P. Gelin and C. Wellekens Robust Pitch Detection of Speech Signals Using Steerable Filters .......................................1427 J. Cai and Z. Liu Evaluation of the Relationship Between Emotional Concepts and Emotional Parameters on Speech ............................................................................................1431 T. Moriyama, H. Saito and S. Ozawa Time-Frequency Analysis of the Glottal Opening ..................................................................1435 W. Wokurek Time-Frequency Structured Decorrelation of Speech Signals via Nonseparable Gabor Frames ....................................................................................................1439 W. Kozek and H. Feichtinger Generalized Mixture of HMMs for Continuous Speech Recognition .....................................1443 F. Korkmazskiy, B. Juang and F. Soong Topics in ASR Writer Adaptation of a HMM Handwriting Recognition System ..........................................1447 Л. Senior and K. Nathan In-Service Adaptation of Multilingual Hidden-Markov-Models ............................................1451 U. Bub, J. Köhler and B. Imperi Development of Dialect-Specific Speech Recognizers Using Adaptation Methods ..................................................................................................................................... 1455 V. Diakoloukas, V. Digalakis, L. Neumeyer and J. Kaja Syllable-Based Relevance Feedback Techniques for Mandarin Voice Record Retrieval Using Speech Queries ..................................................................................1459 B. Bai, L. Chien and L. Lee Automatic Alternative Transcription Generation and Vocabulary Selection for Flexible Word Recognizers .................................................................................1463 D. Torre, L. Villarrubia, J. Elvira and L. Hernandez-Gomez An Advanced System to Generate Pronunciations of Proper Nouns .....................................1467 N. Deshmukh, J. Ngan, J. Hamaker and J. Picone Automatic Pronunciation Scoring for Language Instruction .................................................1471 H. Franco, L. Neumeyer, Y. Kim and O. Ronen Speaker-Independent Name Dialing with Out-of-Vocabulary Rejection ..............................1475 С Ramalingam, L. Netsch and Y. Kao Hidden Understanding Models for Statistical Sentence Understanding ..............................1479 R. Schwartz, S. Miller, D. Stallard and J. Makhoul An Alternative Scheme for Perplexity Estimation .................................................................1483 F. Bimbót, M. El-Beze and M. Jardino Extensions to Phone-State Decision-Tree Clustering: Single Tree and Tagged Clustering .....................................................................................................................1487 D. Paul Evaluation of Fast Algorithms for Finding the Nearest Neighbor ........................................1491 S. Lubiarz and P. Lockwood Fusion of Visual and Acoustic Signals for Command-Word Recognition ..............................1495 R. Kober, U. Harz and J. Schiffers Difference in Visual Information Between Face to Face and Telephone Dialogues ...................................................................................................................................1499 Y. Iwano, Y. Sugita, Y. Kasahara, S. Nakazato and K. Shirai Compensation (Speaker, Channel, Noise) Cepstrum-Based Filter-Bank Design Using Discriminative Feature Extraction Training at Various Levels ....................................................................................1503 A. Biem and S. Katagiri Minimum Error Rate Training for Designing Tree-Structured Probability Density Function .......................................................................................................................1507 W. Chou A Frequency-Weighted HMM Based on Minimum Error Classification for Noisy Speech Recognition .........................................................................................................1511 H. Matsumoto and M. Ono Dictionary-Based Discriminative HMM Parameter Estimation for Continuous Speech Recognition Systems ................................................................................1515 D. Willett, C. Neukirchen and J. Rottland A DFE-Based Algorithm for Feature Selection in Speech Recognition .................................1519 A. de la Torre, A. Peinado, A. Rubio and V. Sanchez Robustness Issues and Solutions in Speech Recognition Based Telephony Services ......................................................................................................................................1523 V. Raman and V. Ramanujam Speaker-Dependent Speech Recognition Based on Phone-Like Units Models - Application to Voice Dialing ...................................................................................................1527 V. Fontaine and H. Bourlard Enhanced Control and Estimation of Parameters for a Telephone Based Isolated Digit Recognizer .........................................................................................................1531 J. Bauer HTIMIT and LLHDB: Speech Corpora for the Study of Handset Transducer Effects ........................................................................................................................................1535 D. Reynolds Robustness Improvements in Continuously Spelled Names Over the Telephone ..................................................................................................................................1539 M. Galler and J. Junqua A Fast Algorithm for Stochastic Matching with Application to Robust Speaker Verification .................................................................................................................1543 Q. Li, S. Parthasarathy and A. Rosenberg A Bayesian Predictive Classification Approach to Robust Speech Recognition ................................................................................................................................1547 Q. Huo, H. Jiang and C. Lee Robust Speech Recognition Based on Viterbi Bayesian Predictive Classification .............................................................................................................................1551 H. Jiang, K. Hirose and Q. Huo Speech Coding at Low Bit Rate Efficient Mixed Excitation Models in LPC Based Prototype Interpolation Speech Coders ....................................................................................................1555 С Papanastasiou and C. Xydeas High Quality Split Band LPC Vocoder Operating at Low Bit Rates .....................................1559 I. Atkinson, S. Yeldener and A. Kondoz Non-Linear Techniques for Pitch and Waveform Enhancement in PWI Coders ..................1563 H. Li and G. Lockhart Multi-Prototype Waveform Coding Using Frame-by-Frame Analysis-by-Synthesis ...............................................................................................................1567 I. Burnett and D. Pham Multiband Prototype Waveform Analysis for Very Low Bit Rate Speech Coding ........................................................................................................................................1571 K. Yaghmaie and A. Kondoz A Formant Vocoder Based on Mixtures of Gaussians ............................................................1575 P. Zolfaghari and T. Robinson Natural Quality Variable-Rate Spectral Speech Coding Below 3.0 Kbps .............................1579 E. Erzin, A. Kumar and A. Gersho A New 2-Kbit/s Speech Coder Based on Normalized Pitch Waveform ..................................1583 Y. Hiwasaki and K. Mano A Comparison of the New 2400 Bps MELP Federal Standard with Other Standard Coders .......................................................................................................................1587 M. Kohier MELP: The New Federal Standard at 2400 Bps .....................................................................1591 L. Supplee, R. Cohn, J. Collura and A. McCree Using a Perception-Based Frequency Scale in Waveform Interpolation ..............................1595 J. Thyssen, В. Kleijn and R. Hagen Very Low Complexity Interpolative Speech Coding at 1.2 to 2.4 Kbps .................................1599 Y. Shoham Modified Multiband Excitation Model at 2400 Bps ................................................................1603 M. Jamrozik and J. Gowdy Variable Bit Rate MBELP Speech Coding via V/UV Distribution Dependent Spectral Quantization ...............................................................................................................1607 E. Yu and С Chan Author Index ............................................................................................................................. Al
any_adam_object	1
author_corporate	ICASSP München
author_corporate_role	aut
author_facet	ICASSP München
author_sort	ICASSP München
building	Verbundindex
bvnumber	BV011407364
ctrlnum	(OCoLC)632730018 (DE-599)BVBBV011407364
format	Conference Proceeding Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01203nam a2200289 cc4500</leader><controlfield tag="001">BV011407364</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">970701s1997 ad\|\| \|\|\|\| 10\|\|\| eng d</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)632730018</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV011407364</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield><subfield code="a">DE-91G</subfield><subfield code="a">DE-91</subfield></datafield><datafield tag="111" ind1="2" ind2=" "><subfield code="a">ICASSP</subfield><subfield code="n">22</subfield><subfield code="d">1997</subfield><subfield code="c">München</subfield><subfield code="j">Verfasser</subfield><subfield code="0">(DE-588)1901193-3</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">1997 International Conference on Acoustics, Speech and Signal Processing</subfield><subfield code="b">ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center</subfield><subfield code="n">2</subfield><subfield code="p">Speech processing</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1997</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Munich</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">S. 711 - 1610</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)1071861417</subfield><subfield code="a">Konferenzschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="w">(DE-604)BV011372148</subfield><subfield code="g">2</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung TU Muenchen</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-007669545</subfield></datafield></record></collection>
genre	(DE-588)1071861417 Konferenzschrift gnd-content
genre_facet	Konferenzschrift
id	DE-604.BV011407364
illustrated	Illustrated
indexdate	2024-07-09T18:09:14Z
institution	BVB
institution_GND	(DE-588)1901193-3
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-007669545
oclc_num	632730018
open_access_boolean
owner	DE-29T DE-91G DE-BY-TUM DE-91 DE-BY-TUM
owner_facet	DE-29T DE-91G DE-BY-TUM DE-91 DE-BY-TUM
physical	S. 711 - 1610 Ill., graph. Darst.
publishDate	1997
publishDateSearch	1997
publishDateSort	1997
record_format	marc
spelling	ICASSP 22 1997 München Verfasser (DE-588)1901193-3 aut 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing 1997 Munich S. 711 - 1610 Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier (DE-588)1071861417 Konferenzschrift gnd-content (DE-604)BV011372148 2 Digitalisierung TU Muenchen application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis
spellingShingle	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center
subject_GND	(DE-588)1071861417
title	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center
title_auth	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center
title_exact_search	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center
title_full	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing
title_fullStr	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing
title_full_unstemmed	1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing
title_short	1997 International Conference on Acoustics, Speech and Signal Processing
title_sort	1997 international conference on acoustics speech and signal processing icassp 97 april 21 24 1997 munich germany gasteig munich s cultural center speech processing
title_sub	ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center
topic_facet	Konferenzschrift
url	http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA
volume_link	(DE-604)BV011372148
work_keys_str_mv	AT icasspmunchen 1997internationalconferenceonacousticsspeechandsignalprocessingicassp97april21241997munichgermanygasteigmunichsculturalcenter2

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Fernleihe Bestellen Achtung: Nicht im THWS-Bestand! Inhaltsverzeichnis

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge