標題: An HMM-Based Algorithm for Content Ranking and Coherence-Feature Extraction
作者: Liu, Chien-Liang
Hsaio, Wen-Hoar
Lee, Chia-Hoang
Chi, Hsiao-Cheng
資訊工程學系
Department of Computer Science
關鍵字: Coherence-feature extraction;hidden Markov model (HMM);input devices and strategies;natural language processing (NLP);predictive content
公開日期: 1-Mar-2013
摘要: In this paper, we propose an algorithm called coherence hidden Markov model (HMM) to extract coherence features and rank content. Coherence HMM is a variant of HMM and is used to model the stochastic process of essay writing and identify topics as hidden states, given sequenced clauses as observations. This study uses probabilistic latent semantic analysis for parameter estimation of coherence HMM. In coherence-feature extraction, support vector regression (SVR) with surface features and coherence features is used for essay grading. The experimental results indicate that SVR can benefit from coherence features. The adjacent agreement rate and the exact agreement rate are 95.24% and 59.80%, respectively. Moreover, this study submits high-scoring essays to the same experiment and finds that the adjacent agreement rate and exact agreement rate are 98.33% and 64.50%, respectively. In content ranking, we design and implement an intelligent assisted blog writing system based on the coherence-HMM ranking model. Several corpora are employed to help users efficiently compose blog articles. When users finish composing a clause or sentence, the system provides candidate texts for their reference based on current clause or sentence content. The experimental results demonstrate that all participants can benefit from the system and save considerable time on writing articles.
URI: http://dx.doi.org/10.1109/TSMCA.2012.2207104
http://hdl.handle.net/11536/21738
ISSN: 2168-2216
DOI: 10.1109/TSMCA.2012.2207104
期刊: IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
Volume: 43
Issue: 2
起始頁: 440
結束頁: 450
Appears in Collections:Articles


Files in This Item:

  1. 000317614400017.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.