Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chien, Jen-Tzung | en_US |
dc.contributor.author | Chueh, Chuang-Hua | en_US |
dc.date.accessioned | 2014-12-08T15:28:42Z | - |
dc.date.available | 2014-12-08T15:28:42Z | - |
dc.date.issued | 2012 | en_US |
dc.identifier.isbn | 978-1-4673-1026-0 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/20765 | - |
dc.description.abstract | Topic model can be established by using Dirichlet distributions as the prior model to characterize latent topics in natural language. However, topics in real-world stream data are non-stationary. Training a reliable topic model is a challenging study. Further, the usage of words in different paragraphs within a document is varied due to different composition styles. This study presents a hierarchical segmentation model by compensating the heterogeneous topics in stream level and the heterogeneous words in document level. The topic similarity between sentences is calculated to form a beta prior for stream-level segmentation. This segmentation prior is adopted to group topic-coherent sentences into a document. For each pseudo-document, we incorporate a Markov chain to detect stylistic segments within a document. The words in a segment are generated by identical composition style. This new model is inferred by a variational Bayesian EM procedure. Experimental results show benefits by using the proposed model in terms of perplexity and F measure. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Machine Learning | en_US |
dc.subject | Topic Model | en_US |
dc.subject | Graphical Model | en_US |
dc.subject | Hierarchical Segmentation | en_US |
dc.title | LATENT DIRICHLET LEARNING FOR HIERARCHICAL SEGMENTATION | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.journal | 2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | en_US |
dc.contributor.department | 電機資訊學士班 | zh_TW |
dc.contributor.department | Undergraduate Honors Program of Electrical Engineering and Computer Science | en_US |
dc.identifier.wosnumber | WOS:000311966000063 | - |
Appears in Collections: | Conferences Paper |