Normal view MARC view ISBD view

Contemporary Methods for Speech Parameterization [electronic resource] /by Todor Ganchev.

by Ganchev, Todor [author.]; SpringerLink (Online service).

Material type: materialTypeLabel

BookSeries: SpringerBriefs in Electrical and Computer Engineering: Publisher: New York, NY : Springer New York, 2011.Description: X, 114p. 32 illus., 23 illus. in color. online resource.ISBN: 9781441984470.Subject(s): Engineering | Computer science | Translators (Computer programs) | Engineering | Signal, Image and Speech Processing | Language Translation and Linguistics | User Interfaces and Human Computer InteractionDDC classification: 621.382 Online resources: Click here to access online

Contents:

Basic Concepts and Applicability of Speech Parameterization -- Survey on speech parameterization -- Fourier transform based methods -- Wavelet packets based methods -- Evaluation on the speech recognition task -- Evaluation on the speaker recognition task -- Practical considerations -- Links to code and further sources of information.

In: Springer eBooksSummary: Contemporary Methods for Speech Parameterization offers a general view of short-time cepstrum-based speech parameterization and provides a common ground for further in-depth studies on the subject. Specifically, it offers a comprehensive description, comparative analysis, and empirical performance evaluation of eleven contemporary speech parameterization methods, which compute short-time cepstrum-based speech features. Among these are five discrete wavelet packet transform (DWPT)-based, six discrete Fourier transform (DFT)-based speech features and some of their variants which have been used on the speech recognition, speaker recognition, and other related speech processing tasks. The main similarities and differences in their computation are discussed and empirical results from performance evaluation in common experimental conditions are presented. The recognition accuracy obtained on the monophone recognition, continuous speech recognition and speaker recognition tasks is contrasted against the one obtained for the well-known and widely used Mel Frequency Cepstral Coefficients (MFCC). It is shown that many of these methods lead to speech features that do offer competitive performance on a certain speech processing setup when compared to the venerable MFCC. The last does not target the promotion of certain speech features but instead aims to enhance the common understanding about the advantages and disadvantages of the various speech parameterization techniques available today and to provide the basis for selection of an appropriate speech parameterization in each particular case.

Tags from this library: No tags from this library for this title. Add tag(s)

average rating: 0.0 (0 votes)

Holdings ( 3 )
Title notes
Comments ( 0 )

Item type	Current location	Call number	Status
		TA1637-1638 (Browse shelf)	Available
		TK7882.S65 (Browse shelf)	Available
Long Loan	MAIN LIBRARY	TK5102.9 (Browse shelf)	Available

Close shelf browser

Previous								Next
Previous	TK7882.S65 Spoken Dialogue Systems Technology and Design	TK7882.S65 Entropy and Information Theory	TK7882.S65 Analysis of Engineering Drawings and Raster Map Images	TK7882.S65 Contemporary Methods for Speech Parameterization	TK7882.S65 Video Segmentation and Its Applications	TK7882.S65 Electronics for Guitarists	TK7882.S65 Digital Signal Processing for In-Vehicle Systems and Safety	Next

Contemporary Methods for Speech Parameterization offers a general view of short-time cepstrum-based speech parameterization and provides a common ground for further in-depth studies on the subject. Specifically, it offers a comprehensive description, comparative analysis, and empirical performance evaluation of eleven contemporary speech parameterization methods, which compute short-time cepstrum-based speech features. Among these are five discrete wavelet packet transform (DWPT)-based, six discrete Fourier transform (DFT)-based speech features and some of their variants which have been used on the speech recognition, speaker recognition, and other related speech processing tasks. The main similarities and differences in their computation are discussed and empirical results from performance evaluation in common experimental conditions are presented. The recognition accuracy obtained on the monophone recognition, continuous speech recognition and speaker recognition tasks is contrasted against the one obtained for the well-known and widely used Mel Frequency Cepstral Coefficients (MFCC). It is shown that many of these methods lead to speech features that do offer competitive performance on a certain speech processing setup when compared to the venerable MFCC. The last does not target the promotion of certain speech features but instead aims to enhance the common understanding about the advantages and disadvantages of the various speech parameterization techniques available today and to provide the basis for selection of an appropriate speech parameterization in each particular case.

There are no comments for this item.

Koha online

Contemporary Methods for Speech Parameterization [electronic resource] /by Todor Ganchev.

by Ganchev, Todor [author.]; SpringerLink (Online service).

Close shelf browser