1997 International Conference on Acoustics, Speech and Signal Processing: ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing
Gespeichert in:
Körperschaft: | |
---|---|
Format: | Tagungsbericht Buch |
Sprache: | English |
Veröffentlicht: |
Munich
1997
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | S. 711 - 1610 Ill., graph. Darst. |
Internformat
MARC
LEADER | 00000nam a2200000 cc4500 | ||
---|---|---|---|
001 | BV011407364 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | t | ||
008 | 970701s1997 ad|| |||| 10||| eng d | ||
035 | |a (OCoLC)632730018 | ||
035 | |a (DE-599)BVBBV011407364 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-29T |a DE-91G |a DE-91 | ||
111 | 2 | |a ICASSP |n 22 |d 1997 |c München |j Verfasser |0 (DE-588)1901193-3 |4 aut | |
245 | 1 | 0 | |a 1997 International Conference on Acoustics, Speech and Signal Processing |b ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |n 2 |p Speech processing |
264 | 1 | |c 1997 | |
264 | 1 | |a Munich | |
300 | |a S. 711 - 1610 |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
655 | 7 | |0 (DE-588)1071861417 |a Konferenzschrift |2 gnd-content | |
773 | 0 | 8 | |w (DE-604)BV011372148 |g 2 |
856 | 4 | 2 | |m Digitalisierung TU Muenchen |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-007669545 |
Datensatz im Suchindex
_version_ | 1804125921592999936 |
---|---|
adam_text | Table
of
Contents
Volume
II
Speech Processing
Conference Committee................................................................................................................
xix
IEEE Signa] Processing Society
...................................................................................................xx
ICASSP
1998
in Seattle
...............................................................................................................xxi
Call for Papers
.............................................................................................................................xxii
ICASSP-De
Paper
Cover Sheet
..................................................................................................xxiii
IEEE Copyright Form
...............................................................................................................xxiv
Recognizing Broadcast News
Transcription of Broadcast News
-
System Robustness Issues and
Adaptation Techniques
...............................................................................................................711
R.
Bakis,
S.
Schen,
P. Gopalakrishnan,
R.
Gopinath,
S.
Maes
and L. Polymenakos
Transcribing Broadcast News Shows
........................................................................................715
J. Gauvain, G. Adda,
L. Lamel
and M. Adda-Decker
Broadcast News Transcription Using
НТК
...............................................................................719
P. Woodland, M. Gales, D. Pye and S. Young
Transcription of Broadcast Television and Radio News: The
1996
Abbot
System
........................................................................... ..............................................................723
С
Cook, D. Kershaw, J. Christie, C. Seymour and S. Waterhouse
Improved Topic Discrimination of Broadcast News Using a Model of
Multiple Simultaneous Topics
....................................................................................................727
T. Imai, R. Schwartz, F.
Rubala
and L. Nguyen
CELP Speech Coding
Enhanced Full Rate Speech Codec for IS-136 Digital Cellular System
..................................731
T. Honkanen, J. Vainio,
K. Järvinen,
P. Haamsto, R. Salami, C.
Laßamme
and J. Adoul
A CELP Variable Rate Speech Codec with Low Average Rate
................................................735
L. Zhang, T. Wang and V. Cuperman
HCELP: Low Bit Rate Speech Coder for Voice Storage Applications
.....................................739
M. Bouraoui, F. Druilhe and G. Feng
Low-Rate CELP Speech Coding Using an Improved Weighting Function
.............................743
С
Kwon and
C. Un
Toll Quality Variable-Rate Speech Codec
.................................................................................747
P.
Ojala
A Variable-Rate
Multimodal
Speech Coder with Gain-Matched
Analysis-by-Synthesis
.................................................................................................................751
E. Paksoy, A. McCree and V. Viswanathan
Design of a Toll-Quality 4-Kbit/s Speech Coder Based on
Phase-Adaptive PSI-CELP
.........................................................................................................755
K.
Mano
A High-Quality Bi-CELP Speech Coder at
8
Kbit/s and Below
...............................................759
S. Kwon, H. Park and H. Chang
Low Complexity VQ for Multi-Tap Pitch Predictor Coding
.....................................................763
J.
Patel
A
4
Kbit/s Renewal Code Excited Linear Prediction Speech Coder
........................................767
H. Kim, Y. Cho, M. Kim and S. Kim
GSM Enhanced Full Rate Speech Codec
...................................................................................771
K. Järvinen,
J. Vainio, P. Kapanen, T. Honkanen, P. Haavisto, R.
Salami, C. Laflamme and J. Adoul
Description of ITU-T Recommendation G.729 Annex A: Reduced Complexity
8
Kbit/s CS-ACELP Codec
..........................................................................................................775
R. Salami, C. Laflamme, B. Bessette and J. Adoul
Language modeling
Semantic Clustering for Adaptive Language Modeling
...........................................................779
R. Kneser and J. Peters
Task Adaptation Using MAP Estimation in N-Gram Language Modeling
.............................783
H.
Masataki,
Y.
Sagisaka,
К.
Hisaki and T. Kawahara
Distant
Bigram
Language Modelling Using Maximum Entropy
............................................787
M. Simons, H. Ney and S. Martin
Nonuniform
Markov Models
......................................................................................................791
E. Ristad and R. Thomas
Modelling Word-Pair Relations in a Category-Based Language Model
..................................795
T. Niesler and P. Woodland
Language Model Adaptation Using Mixtures and an Exponentially
Decaying Cache
...........................................................................................................................799
P.
Clarkson
and A. Robinson
Confidence-Driven Estimator Perturbation: BMPC
.................................................................803
S. Besling and H. Meier
Domain Adaptation with Clustered Language Models
............................................................807
J. Ueberla
Improving Parsing of Spontaneous Speech with the Help of
Prosodie
Boundaries
..................................................................................................................................811
R.
Котре,
Α.
Kießling,
H.
Niemann,
E.
Nöth,
A.
Batliner,
S. Schachtl, T.
Rulând
and H.
Block
Specialized Language Models Using Dialogue Predictions
......................................................815
C. Popovici and P. Baggia
K-TLSS(S)
Language Models for Speech Recognition
..............................................................819
G. Bordel
and
A. Varona
Language Model Adaptation for Conversational Speech Recognition Using
Automatically Tagged Pseudo-Morphological Classes
.............................................................823
C.
Crespo,
D.
Tapias,
G. Escalada and J. Alvarez
Noise
Robustness
Model
Adaptation
Based on
HMM
Decomposition for Reverberant Speech
Recognition
..................................................................................................................................827
Γ.
Takiguchi, S. Nakamura, K. Shikano and
Q
Huo
Model
Compensation
for Noises in Training and Test Data
....................................................831
D. Matrouf and J. Gauvain
Jacobian Approach to Fast Acoustic Model Adaptation
...........................................................835
S. Sagayama, Y. Yamaguchi, S. Takahashi and J. Takahashi
A Unified Maximum Likelihood Approach to Acoustic Mismatch
Compensation: Application to Noisy Lombard Speech Recognition
........................................839
M. Afify, Y. Gong and J
Haton
Enhancement and Recognition of Noisy Speech Within an
Autoregressive
Hidden Markov Model Framework Using Noise Estimates from the Noisy
Signal
...........................................................................................................................................843
B. Logan and A. Robinson
Fast Speech Recognition Algorithm Under Noisy Environment Using
Modified CMS-PMC and Improved IDMM+SQ
........................................................................847
H. Yamamoto,
T. Kosáka,
M.
Yamada,
Y.
Komori
and
M.
Fujiła
The Effects of Background Music on Speech Recognition Accuracy
........................................851
B. Raj, V. Parikh and R. Stern
Joint Model and Feature Space Optimization for Robust Speech
Recognition
..................................................................................................................................855
J. Hwang and C. Wang
Co-Channel Speech Separation for Robust Automatic Speech Recognition:
Stability and Efficiency
.............................................................................................................859
K. Yen and Y. Zhao
Missing Data Techniques for Robust Speech Recognition
.......................................................863
M. Cooke, A. Morris and P. Green
Spectral Subtraction and Rasta-Filtering in Text-Dependent HMM-Based
Speaker Verification
...................................................................................................................867
D.
Hardt
and K.
Fellbaum
Noise Robust Speech Recognition with State Duration Constraints
.......................................871
■K. Laurila
Word Spotting with Confidence
Confidence Measures for Spontaneous Speech Recognition
....................................................875
T.
Schaaf
and T. Kemp
A Probabilistic Approach to Confidence Estimation and Evaluation
......................................879
L. Gillick, Y.
Ito
and J. Young
Word-Based Confidence Measures As a Guide for Stack Search in Speech
Recognition
..................................................................................................................................883
С
Neti, S. Roukos and E.
Eide
Neural
-
Network Based Measures of Confidence for Word Recognition
................................887
M. Weintraub, F. Beaufays, Z. Rivlin, Y.
König
and
A. Stoicke
Improving Utterance Verification Using Hierarchical Confidence
Measures in Continuous Natural Numbers Recognition
.........................................................891
J. Caminero, L. Hernandez-Gomez,
C. de
la Torre and C. Martin
On the Influence of Frame-Asynchronous Grammar Scoring in a CSR System
....................895
A. Rubio,
J. Diaz, P. Garcia and J.
Segura
A Segment-Based
Wordspotter
Using Phonetic Filler Models
................................................899
A. Manos
and V. Zue
A Multi-Phase Approach for Fast Spotting of Large Vocabulary Chinese
Keywords from Mandarin Speech Using
Prosodie
Information
..............................................903
B.
Bai,
С.
Tseng and L. Lee
Accurate Keyword Spotting Using Strictly Lexical Fillers
......................................................907
R.
El Méliani
and D. O Shaughnessy
Failure Simulation for a Phoneme
HMM
Based Keyword Spotter
.........................................911
M.
Holzapfel,
G.
Ruske
and
H.
Höge
Wordspotting
Using a Predictive Neural Model for the Telephone Speech
Corpus
..........................................................................................................................................915
S. Suhardi and K.
Fellbaum
Speech Synthesis
Shape-Invariant Pitch and Time-Scale Modification of Speech by
Variable Order Phase Interpolation
..........................................................................................919
M. Pollard, B. Cheetham, C. Goodyear and M. Edgington
A Chinese Text-to-Speech System Based on Part-of-Speech Analysis,
Prosodie
Modeling and Non-Uniform Units
..............................................................................923
F. Chou,
С.
Tseng,
К.
Chen and L. Lee
Automatic
Prosodie
Modeling for Speaker and Task Adaptation in
Text-to-Speech
.............................................................................................................................927
E. Lopez-Gonzalo, J. Rodriguez-Garcia, L. Hernandez-Gomez
and J.
Villar
Prosody Generation with a Neural Network: Weighing the Importance of
Input Parameters
........................................................................................................................931
G.
Sonntag,
T.
Portele
and
В.
Heuft
Evaluation of a Speech Synthesis Method for Nonlinear Modeling of
Vocal Folds Vibration Effect
.......................................................................................................935
H. Ohmura and K. Tanaka
Generation of Fo Contour Using Stochastic Mapping and Vector
Quantization Control Parameters
.............................................................................................939
B. Heo-Jin, K. Yeon-Jun and O. Yung-Hwan
Spectral Normalization Employing Hidden Markov Modeling of Line
Spectrum Pair Frequencies
........................................................................................................943
B. Pellom and J.
Hansen
Time Domain Technique for Pitch Modification and Robust Voice
Transformation
...........................................................................................................................947
R.
Vergin,
D.
O Shaughnessy and A. Farhat
A New Fundamental Frequency Modification Algorithm with
Transformation of Spectrum Envelope According to Fo
...........................................................951
K. Tanaka and M. Abe
Reliability Assessment and Evaluation of Objectively Measured
Descriptors for Perceptual Speaker Characterization
..............................................................955
B. Necioglu, M. Clements and T. Barnwell
Recent Improvements on Microsoft s Trainable Text-to-Speech System
-
Whistler
.......................................................................................................................................959
X. Huang,
A. Acero,
H. Hon, Y. Ju, J. Liu, S.
Meredith
and M.
Plumpe
Automatic
Generation of Speech Synthesis Units Based on Closed Loop
Training
.......................................................................................................................................963
T. Kagoshima and M. Akamine
Speech Features and Acoustic Modeling
Isolated Word Recognition Using the
HMM
Structure Selected by the
Genetic Algorithm
.......................................................................................................................967
T. Takara, K.
Higa
and I. Nagayama
Discrete Mixture
HMM
..............................................................................................................971
S.
Takahashi,
К.
Aikawa and
S. Sagayama
Using Word Temporal Structure in
HMM
Speech Recognition
..............................................975
L. Fissore, F. Ravera and P. Laface
Smoothness Analysis for Trajectory Features
..........................................................................979
Z.
Ни
and E. Barnard
Frequency-Warping and Speaker-Normalization
.....................................................................983
S. Umesh, L. Cohen and D. Nelson
Integrating Syllable Boundary Information Into Speech Recognition
....................................987
S. Wu, M. Shire, S.
Greenberg
and
N.
Morgan
Explicit, N-Best
Formant
Features for Vowel Classification
...................................................991
P.
Schmid
and E. Barnard
Dual-Channel Auditory Spectrum Modeling
............................................................................995
J.
Billa
Direct Identification Vs. Correlated Models to Process Acoustic and
Articulatory Informations in Automatic Speech Recognition
..................................................999
R.
André-Obrecht
and B. Jacob
Adapting PSN Recognition Models to the GSM Environment by Using
Spectral Transformation
..........................................................................................................1003
T.
Soulas,
С.
Mokbel,
D.
Jouvet
and
J.
Monne
Integrated-Multilingual
Speech Recognition Using Universal
Phonological Features in a Functional Speech Production Model
.........................................1007
L. Deng
Phone Classification with
Segmental
Features and a Binary-Pair
Partitioned Neural Network Classifier
...................................................................................1011
S. Zahorian, P. Silsbee and X. Wang
Speaker Adaptation and Normalisation
Smoothed N-Best-Based Speaker Adaptation for Speech Recognition
.................................1015
T. Matsui, T. Matsuoka and S. Furui
A Fast Algorithm for Unsupervised Incremental Speaker Adaptation
.................................1019
M.
Schüßler, F. Gallwitz
and
S. Harbeck
Improved Estimation of Supervision in Unsupervised Speaker Adaptation
........................1023
S. Homma, K. Aikawa and S. Sagayama
Improved Bayesian Learning of Hidden Markov Models for Speaker
Adaptation
.................................................................................................................................1027
J.
Chien, H.
Wang and
С.
Lee
Studies in Transformation-Based Adaptation
........................................................................1031
V. Nagesha and L. Gillick
Speaker
Adaptation
in the
Philips System
for Large Vocabulary
Continuous Speech Recognition
...............................................................................................1035
E. Thelen, X. Aubert and P.
Beyerlein
Speaker Normalization Based on Frequency Warping
..........................................................1039
P. Zhan and M. Westphal
Speaker Adaptive Training: A Maximum Likelihood Approach to Speaker
Normalization
...........................................................................................................................1043
T. Anastasakos, J. McDonough and J. Makhoul
Experiments in Speaker Normalisation and Adaptation for Large
Vocabulary Speech Recognition
...............................................................................................1047
D. Pye and P. Woodland
Effectiveness of Speaker Normalized
HMM
by Projection to Speaker
Subspace
....................................................................................................................................1051
Y. Ariki
Speaker Normalization and Adaptation Based on Linear Transformation
..........................1055
J. Ishii and M.
Tonomura
Speaker-Adapted Training on the Switchboard Corpus
........................................................1059
J. McDonough, T. Anastasakos, G. Zavaliagkos and H. Gish
Speaker Verification and Identification
Model Transformation for Robust Speaker Recognition from Telephone
Data
...........................................................................................................................................1063
F. Beaufays and M. Weintraub
Speaker Recognition with the Switchboard Corpus
...............................................................1067
L. Lamel
and J. Gauvain
Handset-Dependent Background Models for Robust Text-Independent
Speaker Recognition
.................................................................................................................1071
L. Heck and M. Weintraub
Telephone Based Speaker Recognition Using Multiple Binary Classifier
and Gaussian Mixture Models
.................................................................................................1075
P.
Castellano,
S.
Słomka
and
S.
Sridharan
Comparison of Whole Word and
Subword
Modeling Techniques for Speaker
Verification with Limited Training Data
................................................................................1079
S.
Euler,
R.
Langlitz and J.
Zinke
A Comparison of Model Estimation Techniques for Speaker Verification
............................1083
M. Carey, E. Parris, S. Bennett and H. Lloyd-Thomas
Speaker Verification Using Frame and Utterance Level Likelihood
Normalization
...........................................................................................................................1087
S. Nakagawa and K. Markov
A New
Codebook
Training Algorithm for VQ-Based Speaker Recognition
...........................1091
J. He, L. Liu and G. Palm
Bispectrum Features for Robust Speaker Identification
........................................................1095
S. Wenndt and S. Shamsunder
Speaker Identification Based Text to Audio Alignment for an Audio
Retrieval System
.......................................................................................................................1099
D. Roy and
C. M
alamud
Robust Speaker Recognition through Acoustic Array Processing and
Spectral Normalization
.............................................................................................................1103
J. Gonzalez-Rodriguez and J. Ortega-Garcia
Providing Single and Multi-Channel Acoustical Robustness to Speaker
Identification Systems
..............................................................................................................1107
J.
Ortega-García
and J. Gonzalez-Rodriguez
Language and Speaker Identification
Robust Spoken Language Identification Using Large Vocabulary Speech
Recognition
................................................................................................................................1111
J. Hieronymus and S. Kadambe
Double Bigram-Decoding in Phonotactic Language Identification
.......................................1115
J.
Navrátil
and W.
Zühlke
Random Walk Theory Applied to Language Identification
....................................................1119
E. Marcherei
and M.
Savie
Frequency Characteristics of Foreign Accented Speech
.........................................................1123
L. Arslan and J.
Hansen
A Study on Improving Decisions in Closed Set Speaker Identification
................................1127
M. Demirekler and A. Saranli
The Use of Harmonic Features in Speaker Recognition
........................................................1131
B.
Imperi,
Z. Kacie
and
В.
Horvat
An Approach to Speaker Identification Using Multiple Classifiers
......................................1135
V. Radová
and J. Psutka
Spoken Language Systems
Development and Evaluation of the ATOS Spontaneous Speech
Conversational System
.............................................................................................................1139
J. Alvarez, D.
Tapias,
С.
Crespo, I. Cortázar
and
F.
Martinez
A Spoken Language System for Automated Call Routing
.....................................................1143
G. Riccardi, A. Gorin, A. Ljolje and M. Riley
Dialogos:
A Robust System for Human-Machine Spoken Dialogue on the
Telephone
..................................................................................................................................1147
D. Albesano, P. Baggia, M.
Danieli,
R.
Gemello, E. Gerbino
and C. Rullent
Surfin
the
World
Wide
Web
with Japanese
............................................................................1151
К.
Kondo and
С.
Hemphill
Internet Chinese Information Retrieval Using Unconstrained Mandarin
Speech Queries Based on a Client-Server Architecture and a
PAT-Tree-Based Language Model
...........................................................................................1155
L. Chien, M.
Chen,
H.
Wang,
L. Lee,
S. Lin, J. Hong and J. Shen
Combining Key-Phrase Detection and Subword-Based Verification for
Flexible Speech Understanding
...............................................................................................1159
T. Kawahara, C. Lee and B. Juang
Controlling Limited-Domain Applications by Probabilistic Semantic
Decoding of Natural Speech
.....................................................................................................1163
H.
Stahl,
J.
Müller
and M. Lang
Speech Enhancement
Multi-Channel Speech Enhancement in a Car Environment Using Wiener
Filtering and Spectral Subtraction
..........................................................................................1167
J. Meyer and K. Simmer
Weighted Matching Algorithms and Reliability in Noise Cancelling by
Spectral Subtraction
.................................................................................................................1171
N.
Yoma, F. Mclnnes and M. Jack
HMM-Based Speech Enhancement Using Harmonic Modeling
............................................1175
M. Deisher and A. Spanias
Model Based Speech Pause Detection
.....................................................................................1179
B.
McKinley
and G. Whipple
Integrated Speech Enhancement and Coding in the Time-Frequency Domain
...................1183
A. Drygajlo and B.
Carnero
Quality Enhancement of Narrowband CELP-Coded Speech via Wideband
Harmonic Re-Synthesis
............................................................................................................1187
C. Chan and W.
Hui
Speech Enhancement Using CSS-Based Array Processing
...................................................1191
F.
Asano
and S. Hayamizu
Co-Channel Speaker Separation Using Constrained Nonlinear Optimization
....................1195
D. Benincasa and M.
Savie
A Contextual Blind Separation of Delayed and Convolved Sources
.....................................1199
T. Lee and R. Orglmeister
Segregation of Concurrent Speech with the Reassigned Spectrum
......................................1203
G. Meyer, F.
Plante
and F. Berthommier
Enhancement of Esophageal Speech by Injection Noise Rejection
.......................................1207
H. Javkin, M.
Galler
and
N.
Niedzielski
Real-Time Digital Speech Processing Strategies for the Hearing
Impaired
....................................................................................................................................1211
N.
Magotra and S.
Sirivara
Iterative-Batch and Sequential Algorithms for Single Microphone
Speech Enhancement
...............................................................................................................1215
S. Gannot, D. Burshtein and E.
Weinstein
Kalman
Filtering for Low Distortion Speech Enhancement in Mobile
Communication
.........................................................................................................................1219
P.
Sörqvist,
P. Handel and B.
Ottersten
Features for ASR
Exploiting the Potential of Auditory Preprocessing for Robust Speech
Recognition by Locally Recurrent Neural Networks
..............................................................1223
K.
Kasper,
H.
Reininger and D. Wolf
Feature Adaptation Using Deviation Vector for Robust Speech
Recognition in Noisy Environment
..........................................................................................1227
T. Hwang, L. Lee and H. Wang
Binaural Phoneme Recognition Using the Auditory Image Model and
Cross-Correlation
......................................................................................................................1231
K. Francis and T. Anderson
Utterance Dependent Parametric Warping for a Talker-Independent
HMM-Based Recognizer
...........................................................................................................1235
D. Mashao and J. Adcock
Phase-Corrected
RASTA
for Automatic Speech Recognition Over the Phone
......................1239
J.
de
Veth and L. Boves
A Binaural Speech Processing Method Using Subband-Crosscorrelation
Analysis for Noise Robust Recognition
....................................................................................1243
S. Knjita, K. Takeda and F. Itakura
Modelling Asynchrony in Speech Using Elementary Single-Signal
Decomposition
...........................................................................................................................1247
M. Tomlinson, M. Russell, R. Moore, A. Buckland and M. Fawley
Subband-Based Speech Recognition
........................................................................................1251
H. Bourlard and
S. Dupont
Sub-Band Based Recognition of Noisy Speech
........................................................................1255
S. Tibrewala and H.
Hermansky
Recognizing Reverberant Speech with RASTA-PLP
..............................................................1259
B. Kingsbury and
N.
Morgan
Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based
Speech Recognition
...................................................................................................................1263
S. Vaseghi,
N. Harte
and B. Milner
Maximum Likelihood Weighting of Dynamic Speech Features for CDHMM
Speech Recognition
...................................................................................................................1267
J.
Hernando
Speech Recognition Using Automatically Derived Acoustic
Baseforms
................................1271
R.
Rose and E. Lleida
On Combining Frequency Warping and Spectral Shaping in
HMM
Based
Speech Recognition
...................................................................................................................1275
A. Potamianos and R. Rose
Speech Analysis
Recursive Linear Prediction Using
OBE
Identification with Automatic
Bound Estimation
.....................................................................................................................1279
J.
Délier, T.
Lin and
M. Nayeri
Nonlinear Long-Term Prediction of Speech Signals
...............................................................1283
M. Birgmeier, H.
Bernhard
and G. Kubin
Vocal Tract Shape Trajectory Estimation Using
MLP
Analysis-by-Synthesis
...............................................................................................................1287
H. Richards, J. Mason, J. Bridle and M. Hunt
Fast and Robust Joint Estimation of Vocal Tract and Voice Source
Parameters
................................................................................................................................1291
D. Wen and H.
Norio
Spectral Correlates of Glottal Waveform Models: An Analytic Study
...................................1295
B.
Doval
and
C. d
Alessandro
A Time Varying ARMAX Speech Modeling with Phase Compensation Using
Glottal Source Model
................................................................................................................1299
K. Funaki, Y. Miyanaga and K. Tochinai
Speech Representation and Transformation Using Adaptive Interpolation
of Weighted Spectrum: VOCODER Revisited
.........................................................................1303
H. Kawahara
The Weft: A Representation for Periodic Sounds
...................................................................1307
D. Ellis
A Computationally Efficient Algorithm for Calculating Loudness
Patterns of Narrowband Speech
..............................................................................................1311
M.
Hauenstein
Two-Channel Blind Deconvolution for Non-Minimum Phase Impulse
Responses
..................................................................................................................................1315
K. Furuya and Y. Kaneda
Variable Time-Scale Modification of Speech Using Transient Information
.........................1319
S. Lee and H. Kim
Speech Enhancement with Reduction of Noise Components in the Wavelet
Domain
......................................................................................................................................1323
J. Seok and K.
Вае
Blind Separation and Restoration of Signals Mixed in Convolutive
Environment
.............................................................................................................................1327
J.
Xi
and J. Reilly
Construction and Evaluation of a Robust Multifeature Speech/Music
Discriminator
............................................................................................................................1331
E. Scheirer and M. Slaney
Topics in Speech Coding I
Encoding of Speech Spectral Parameters Using Adaptive Quantization
Methods
.....................................................................................................................................1335
I. Lee and H. Woo
Optimal Transformation of LSP Parameters Using Neural Network
...................................1339
H. Vu
and L. Lois
Speech Spectrum Representation and Coding Using Multigrams with
Distance
.....................................................................................................................................1343
J. Cernocký,
G. Baudoin
and
G. Chollet
Incorporating Perception Into LSF Quantization
-
Some Experiments
................................1347
R. Cohn and J. Collura
Predictive VQ for Noisy Channel Spectrum Coding:
AR
Or MA?
..........................................1351
J.
Skoglund
and J. Linden
Efficient Encoding of Mel-Generalized Cepstrum for CELP Coders
.....................................1355
K. Koishida, T. Kobayashi, S. Imai and K. Tokuda
A Candidate Coder for the ITU-T s New Wideband Speech Coding Standard
.....................1359
J. Chen
Perceptual Speech Coding Using Time and Frequency Masking Constraints
.....................1363
B.
Carnero
and A. Drygajlo
A Multi-Band CELP Wideband Speech Coder
........................................................................1367
A. Ubale
and A. Gersho
A Design of Transform Coder for Both Speech and Audio Signals at
1
Bit/sample
..................................................................................................................................1371
T. Moriya,
N.
Iwakami, A. Jin, K. Ikeda and S.
Miki
Speech Quality Assessment of Compounded Digital Telecommunication
Systems: Perceptual Dimensions
.............................................................................................1375
K. Petersen, S.
Hansen
and J. Sorensen
Performance Assessment of Tandem Connection of Cellular and
Satellite-Mobile Coders
............................................................................................................1379
S. Campos
Neto,
F.
Corcoran and A. Karahisar
The Consequences of Linguistic Perception on Low-Rate Speech Coding
............................1383
J. Parry and I. Burnett
Using a Quantitative Psychoacoustical Signal Representation for
Objective Speech Quality Measurement
.................................................................................1387
M.
Hansen
and B. Kollmeier
Speech Models and Features
A Method of Extracting Time-Varying Acoustic Features Effective for
Speech Recognition
...................................................................................................................1391
K. Tanaka and H.
Kojima
Elimination of Trajectory Folding Phenomenon:
HMM,
Trajectory Mixture
HMM
and Mixture Stochastic Trajectory Model
....................................................................1395
I. Illina and Y. Gong
Linear Dynamic
Segmental
HMMs: Variability Representation and
Training Procedure
...................................................................................................................1399
W. Holmes and M. Russell
Model Parameter Estimation for Mixture Density Polynomial Segment
Models
........................................................................................................................................1403
T. Fukada, Y. Sagisaka and K. Paliwal
The Importance of Segmentation Probability in Segment Based Speech
Recognizers
................................................................................................................................1407
J. Verhasselt, J. Martens, I. Illina, J.
Haton
and Y. Gong
Adaptation of Polynomial Trajectory Segment Models for Large
Vocabulary Speech Recognition
...............................................................................................1411
A. Kannan
and M.
Ostendorf
Speaker Adaptation Experiments Using Nonstationary-State Hidden
Markov Models: A MAP Approach
...........................................................................................1415
C. Rathinavelu and L. Deng
Vocabulary Optimization Based on Perplexity
.......................................................................1419
K. Hwang
REMAP for Video Soundtrack Indexing
..................................................................................1423
P. Gelin and C.
Wellekens
Robust Pitch Detection of Speech Signals Using Steerable Filters
.......................................1427
J.
Cai
and Z. Liu
Evaluation of the Relationship Between Emotional Concepts and
Emotional Parameters on Speech
............................................................................................1431
T. Moriyama, H.
Saito
and S. Ozawa
Time-Frequency Analysis of the Glottal Opening
..................................................................1435
W. Wokurek
Time-Frequency Structured Decorrelation of Speech Signals via
Nonseparable
Gabor
Frames
....................................................................................................1439
W. Kozek and H. Feichtinger
Generalized Mixture of HMMs for Continuous Speech Recognition
.....................................1443
F. Korkmazskiy, B. Juang and F. Soong
Topics in ASR
Writer Adaptation of a HMM Handwriting Recognition System
..........................................1447
Л.
Senior and K. Nathan
In-Service Adaptation of Multilingual Hidden-Markov-Models
............................................1451
U. Bub, J.
Köhler
and B.
Imperi
Development
of Dialect-Specific Speech Recognizers Using Adaptation
Methods
.....................................................................................................................................
1455
V. Diakoloukas, V. Digalakis, L. Neumeyer and J.
Kaja
Syllable-Based Relevance Feedback Techniques for Mandarin Voice
Record Retrieval Using Speech Queries
..................................................................................1459
B.
Bai,
L.
Chien
and
L.
Lee
Automatic
Alternative Transcription Generation and Vocabulary
Selection for Flexible Word Recognizers
.................................................................................1463
D. Torre, L. Villarrubia, J. Elvira and L. Hernandez-Gomez
An Advanced System to Generate Pronunciations of Proper Nouns
.....................................1467
N.
Deshmukh, J. Ngan, J.
Hamaker
and J. Picone
Automatic Pronunciation Scoring for Language Instruction
.................................................1471
H. Franco, L. Neumeyer, Y. Kim and O.
Ronen
Speaker-Independent Name Dialing with Out-of-Vocabulary Rejection
..............................1475
С
Ramalingam, L. Netsch and Y.
Kao
Hidden Understanding Models for Statistical Sentence Understanding
..............................1479
R. Schwartz, S. Miller, D.
Stallard
and J. Makhoul
An Alternative Scheme for Perplexity Estimation
.................................................................1483
F.
Bimbót,
M.
El-Beze and
M.
Jardino
Extensions
to Phone-State Decision-Tree Clustering: Single Tree and
Tagged Clustering
.....................................................................................................................1487
D. Paul
Evaluation of Fast Algorithms for Finding the Nearest Neighbor
........................................1491
S. Lubiarz and P.
Lockwood
Fusion of Visual and Acoustic Signals for Command-Word Recognition
..............................1495
R.
Kober,
U.
Harz
and J.
Schiffers
Difference in
Visual Information
Between Face to Face and Telephone
Dialogues
...................................................................................................................................1499
Y. Iwano, Y. Sugita, Y. Kasahara, S. Nakazato and K. Shirai
Compensation (Speaker, Channel, Noise)
Cepstrum-Based Filter-Bank Design Using Discriminative Feature
Extraction Training at Various Levels
....................................................................................1503
A. Biem
and S. Katagiri
Minimum Error Rate Training for Designing Tree-Structured Probability
Density Function
.......................................................................................................................1507
W.
Chou
A Frequency-Weighted
HMM
Based on Minimum Error Classification for
Noisy Speech Recognition
.........................................................................................................1511
H. Matsumoto and
M. Ono
Dictionary-Based Discriminative
HMM
Parameter Estimation for
Continuous Speech Recognition Systems
................................................................................1515
D. Willett, C.
Neukirchen
and J.
Rottland
A DFE-Based Algorithm for Feature Selection in Speech Recognition
.................................1519
A. de la
Torre,
A. Peinado, A. Rubio
and V. Sanchez
Robustness Issues and Solutions in Speech Recognition Based Telephony
Services
......................................................................................................................................1523
V. Raman and V. Ramanujam
Speaker-Dependent Speech Recognition Based on Phone-Like Units Models
-
Application to Voice Dialing
...................................................................................................1527
V. Fontaine and H. Bourlard
Enhanced Control and Estimation of Parameters for a Telephone Based
Isolated Digit Recognizer
.........................................................................................................1531
J. Bauer
HTIMIT and LLHDB: Speech Corpora for the Study of Handset Transducer
Effects
........................................................................................................................................1535
D. Reynolds
Robustness Improvements in Continuously Spelled Names Over the
Telephone
..................................................................................................................................1539
M.
Galler
and J. Junqua
A Fast Algorithm for Stochastic Matching with Application to Robust
Speaker Verification
.................................................................................................................1543
Q. Li, S. Parthasarathy and A. Rosenberg
A Bayesian Predictive Classification Approach to Robust Speech
Recognition
................................................................................................................................1547
Q.
Huo,
H.
Jiang and C. Lee
Robust Speech Recognition Based on Viterbi Bayesian Predictive
Classification
.............................................................................................................................1551
H. Jiang, K. Hirose and Q.
Huo
Speech Coding at Low Bit Rate
Efficient Mixed Excitation Models in LPC Based Prototype
Interpolation Speech Coders
....................................................................................................1555
С
Papanastasiou and C. Xydeas
High Quality Split Band LPC Vocoder Operating at Low Bit Rates
.....................................1559
I. Atkinson, S. Yeldener and A. Kondoz
Non-Linear Techniques for Pitch and Waveform Enhancement in PWI Coders
..................1563
H. Li and
G. Lockhart
Multi-Prototype Waveform Coding Using Frame-by-Frame
Analysis-by-Synthesis
...............................................................................................................1567
I. Burnett and D. Pham
Multiband Prototype Waveform Analysis for Very Low Bit Rate Speech
Coding
........................................................................................................................................1571
K. Yaghmaie and A. Kondoz
A Formant
Vocoder Based on Mixtures of Gaussians
............................................................1575
P. Zolfaghari and T. Robinson
Natural Quality Variable-Rate Spectral Speech Coding Below
3.0
Kbps
.............................1579
E. Erzin, A. Kumar and A. Gersho
A New 2-Kbit/s Speech Coder Based on Normalized Pitch Waveform
..................................1583
Y. Hiwasaki and K.
Mano
A Comparison of the New
2400
Bps MELP
Federal Standard with Other
Standard Coders
.......................................................................................................................1587
M.
Kohier
MELP: The New Federal Standard at
2400
Bps
.....................................................................1591
L.
Supplee,
R.
Cohn,
J. Collura and A. McCree
Using a Perception-Based Frequency Scale in Waveform Interpolation
..............................1595
J.
Thyssen,
В.
Kleijn and R.
Hagen
Very Low Complexity Interpolative Speech Coding at
1.2
to
2.4
Kbps
.................................1599
Y. Shoham
Modified Multiband Excitation Model at
2400
Bps
................................................................1603
M.
Jamrozik and J. Gowdy
Variable Bit Rate MBELP Speech Coding via V/UV Distribution Dependent
Spectral Quantization
...............................................................................................................1607
E. Yu and
С
Chan
Author Index
.............................................................................................................................
Al
|
any_adam_object | 1 |
author_corporate | ICASSP München |
author_corporate_role | aut |
author_facet | ICASSP München |
author_sort | ICASSP München |
building | Verbundindex |
bvnumber | BV011407364 |
ctrlnum | (OCoLC)632730018 (DE-599)BVBBV011407364 |
format | Conference Proceeding Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01203nam a2200289 cc4500</leader><controlfield tag="001">BV011407364</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">970701s1997 ad|| |||| 10||| eng d</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)632730018</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV011407364</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield><subfield code="a">DE-91G</subfield><subfield code="a">DE-91</subfield></datafield><datafield tag="111" ind1="2" ind2=" "><subfield code="a">ICASSP</subfield><subfield code="n">22</subfield><subfield code="d">1997</subfield><subfield code="c">München</subfield><subfield code="j">Verfasser</subfield><subfield code="0">(DE-588)1901193-3</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">1997 International Conference on Acoustics, Speech and Signal Processing</subfield><subfield code="b">ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center</subfield><subfield code="n">2</subfield><subfield code="p">Speech processing</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1997</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Munich</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">S. 711 - 1610</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)1071861417</subfield><subfield code="a">Konferenzschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="w">(DE-604)BV011372148</subfield><subfield code="g">2</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung TU Muenchen</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-007669545</subfield></datafield></record></collection> |
genre | (DE-588)1071861417 Konferenzschrift gnd-content |
genre_facet | Konferenzschrift |
id | DE-604.BV011407364 |
illustrated | Illustrated |
indexdate | 2024-07-09T18:09:14Z |
institution | BVB |
institution_GND | (DE-588)1901193-3 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-007669545 |
oclc_num | 632730018 |
open_access_boolean | |
owner | DE-29T DE-91G DE-BY-TUM DE-91 DE-BY-TUM |
owner_facet | DE-29T DE-91G DE-BY-TUM DE-91 DE-BY-TUM |
physical | S. 711 - 1610 Ill., graph. Darst. |
publishDate | 1997 |
publishDateSearch | 1997 |
publishDateSort | 1997 |
record_format | marc |
spelling | ICASSP 22 1997 München Verfasser (DE-588)1901193-3 aut 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing 1997 Munich S. 711 - 1610 Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier (DE-588)1071861417 Konferenzschrift gnd-content (DE-604)BV011372148 2 Digitalisierung TU Muenchen application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |
subject_GND | (DE-588)1071861417 |
title | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |
title_auth | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |
title_exact_search | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |
title_full | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing |
title_fullStr | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing |
title_full_unstemmed | 1997 International Conference on Acoustics, Speech and Signal Processing ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center 2 Speech processing |
title_short | 1997 International Conference on Acoustics, Speech and Signal Processing |
title_sort | 1997 international conference on acoustics speech and signal processing icassp 97 april 21 24 1997 munich germany gasteig munich s cultural center speech processing |
title_sub | ICASSP 97 ; April 21 - 24, 1997, Munich, Germany, Gasteig, Munich's cultural center |
topic_facet | Konferenzschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=007669545&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV011372148 |
work_keys_str_mv | AT icasspmunchen 1997internationalconferenceonacousticsspeechandsignalprocessingicassp97april21241997munichgermanygasteigmunichsculturalcenter2 |