Full metadata record
DC FieldValueLanguage
dc.contributor.authorChi, Taishihen_US
dc.contributor.authorShamma, Shihab A.en_US
dc.date.accessioned2014-12-08T15:16:18Z-
dc.date.available2014-12-08T15:16:18Z-
dc.date.issued2006-07-01en_US
dc.identifier.issn1558-7916en_US
dc.identifier.urihttp://dx.doi.org/10.1109/TSA.2005.860828en_US
dc.identifier.urihttp://hdl.handle.net/11536/12092-
dc.description.abstractWe examine the encoding of acoustic spectra by parameters derived from singularities found in their multiscale auditory representations. The multiscale representation is a wavelet transform of an auditory version of the spectrum, formulated based on findings of perceptual experiments and physiological research in the auditory cortex. The multiscale representation of a spectral pattern usually contains well-defined singularities in its phase function that reflect prominent features of the underlying spectrum such as its relative peak locations and amplitudes. Properties (locations and strength) of these singularities are examined and employed to reconstruct the original spectrum by using an iterative projection algorithm. Although the singularities form a nonconvex set, simulations demonstrate that a well-chosen initial pattern usually converges on a good approximation of the input spectrum. Perceptually intelligible speech can be resynthesized from the reconstructed auditory spectrograms, and hence these singularities can potentially serve as efficient features in speech compression. Besides, the singularities are very noise-robust which makes them useful features in various applications such as vowel recognition and speaker identification.en_US
dc.language.isoen_USen_US
dc.subjectauditory modelen_US
dc.subjectconvex projectionen_US
dc.subjectphase singularityen_US
dc.subjectspectrum restorationen_US
dc.titleSpectrum restoration from multiscale auditory phase singularities by generalized projectionsen_US
dc.typeArticleen_US
dc.identifier.doi10.1109/TSA.2005.860828en_US
dc.identifier.journalIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSINGen_US
dc.citation.volume14en_US
dc.citation.issue4en_US
dc.citation.spage1179en_US
dc.citation.epage1192en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000238709200010-
dc.citation.woscount0-
Appears in Collections:Articles


Files in This Item:

  1. 000238709200010.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.