
Faites connaître cet article à vos amis:
Speech Recognition by Man and Machine: Influence of Speaking Rate, Style, and Effort on the Recognition Performance of Human Listeners and Automatic Classifiers
Bernd T. Meyer
Speech Recognition by Man and Machine: Influence of Speaking Rate, Style, and Effort on the Recognition Performance of Human Listeners and Automatic Classifiers
Bernd T. Meyer
While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from the distinct errors between these two. Based on the differences, the signal processing mechanisms are analyzed that are suitable to increase the robustness of ASR. The comparison focuses on the influence of intrinsic variation of speech, i.e., changes in speaking rate, effort and style, as well as dialect and accent. The outcome of the experiments suggests that the processing of temporal cues in ASR bears room for improvement. Therefore, spectro-temporal features are employed as input to ASR systems, which results in an increase of recognition performance for varying speaking effort and speaking style compared to standard features. This documents the usefulness of spectro-temporal and temporal information for automatic recognizers.
Médias | Livres Paperback Book (Livre avec couverture souple et dos collé) |
Validé | 3 novembre 2010 |
ISBN13 | 9783838121550 |
Éditeurs | Suedwestdeutscher Verlag fuer Hochschuls |
Pages | 140 |
Dimensions | 226 × 8 × 150 mm · 213 g |
Langue et grammaire | English |
Voir tous les Bernd T. Meyer ( par ex. Paperback Book )