完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHsu, Sheng-Binen_US
dc.contributor.authorLee, Chang-Hsingen_US
dc.contributor.authorChang, Pei-Chunen_US
dc.contributor.authorHan, Chin-Chuanen_US
dc.contributor.authorFan, Kuo-Chinen_US
dc.date.accessioned2019-04-02T05:59:13Z-
dc.date.available2019-04-02T05:59:13Z-
dc.date.issued2018-12-01en_US
dc.identifier.issn1520-9210en_US
dc.identifier.urihttp://dx.doi.org/10.1109/TMM.2018.2834866en_US
dc.identifier.urihttp://hdl.handle.net/11536/148467-
dc.description.abstractInvestigating the identity, distribution, and evolution of bird species is important for both biodiversity assessment and environmental conservation. The discrete wavelet transform (DWT) has been widely exploited to extract time-frequency features for acoustic signal analysis. Traditional approaches usually compute statistical measures (e.g., maximum, mean, standard deviation) of the DWT coefficients in each subband independently to yield the feature descriptor, without considering the intersubband correlation. A new acoustic descriptor, called the local wavelet acoustic pattern (LWAP), is proposed to characterize the correlation of the DWT coefficients in different subbands for birdsong recognition. First, we divide a variable-length birdsong segment into a number of fixed-duration texture windows. For each texture window, several LWAP descriptors are extracted. The vector of locally aggregated descriptors (VLAD) is then used to aggregate the set of LWAP descriptors into a single VLAD vector. Finally, principal component analysis (PCA) plus linear discriminant analysis (LDA) are employed to reduce the feature dimensionality for classification purposes. Experiments on two birdsong datasets show that the proposed LWAP descriptor outperforms other local descriptors, including linear predictive coding cepstral coefficients, Mel-frequency cepstral coefficients, perceptual linear prediction cepstral coefficients, chroma features, and prosody features. Furthermore, the proposed LWAP descriptor, followed by VLAD encoding, PCA plus LDA feature extraction, and a simple distance-based classifier, yields promising results that are competitive with those obtained by the state-of-the-art convolutional neural networks.en_US
dc.language.isoen_USen_US
dc.subjectBirdsong recognitionen_US
dc.subjectdiscrete wavelet transform (DWT)en_US
dc.subjectvector of locally aggregated descriptors (VLAD)en_US
dc.titleLocal Wavelet Acoustic Pattern: A Novel Time-Frequency Descriptor for Birdsong Recognitionen_US
dc.typeArticleen_US
dc.identifier.doi10.1109/TMM.2018.2834866en_US
dc.identifier.journalIEEE TRANSACTIONS ON MULTIMEDIAen_US
dc.citation.volume20en_US
dc.citation.spage3187en_US
dc.citation.epage3199en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000450212600001en_US
dc.citation.woscount0en_US
顯示於類別:期刊論文