Speaker-independent phoneme recognition on TIMIT database using integrated time-delay neural networks (TDNNs):

Abstract: "This paper describes a new structure of Neural Networks (NNs) for speaker-independent and context-independent phoneme recognition. This structrure is based on the integration of Time-Delay Neural Networks (TDNN, Waibel et al.[1988]) which have several TDNNs separated according to the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hataoka, Nobuo (VerfasserIn), Waibel, Alex H. (VerfasserIn)
Format: Buch
Sprache:English
Veröffentlicht: Pittsburgh, Pa. 1989
Schriftenreihe:Carnegie-Mellon University <Pittsburgh, Pa.> / Computer Science Department: CMU-CS 89,190
Schlagworte:
Zusammenfassung:Abstract: "This paper describes a new structure of Neural Networks (NNs) for speaker-independent and context-independent phoneme recognition. This structrure is based on the integration of Time-Delay Neural Networks (TDNN, Waibel et al.[1988]) which have several TDNNs separated according to the duration of phonemes. As a result, the proposed structure has the advantage that it deals with phonemes of varying duration more efficiently. In the experimental evaluation of the proposed new structure, 16-English vowel recognition was performed using 5268 vowel tokens picked from 480 sentences spoken by 140 speakers (98 males and 42 females) on the TIMIT(TI-MIT) database
The number of training tokens and testing tokens was 4326 from 100 speakers (69 males and 31 females) and 942 from 40 speakers (29 males and 11 females), respectively. The result was a 60.5% recognition rate (around 70% for a collapsed 13-vowel case), which was improved from 56% in the single TDNN structure, showing the effectiveness of the proposed new structure to use temporal information.
Beschreibung:21 S.

Es ist kein Print-Exemplar vorhanden.

Fernleihe Bestellen Achtung: Nicht im THWS-Bestand!