完整後設資料紀錄
DC 欄位語言
dc.contributor.authorLee, HJen_US
dc.contributor.authorWang, JSen_US
dc.date.accessioned2014-12-08T15:01:58Z-
dc.date.available2014-12-08T15:01:58Z-
dc.date.issued1997-03-01en_US
dc.identifier.issn0167-8655en_US
dc.identifier.urihttp://hdl.handle.net/11536/695-
dc.description.abstractA scientific document usually consists of text and mathematical expressions. In this paper, we present a system for segmenting and understanding text and mathematical expressions in a document, The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. After we extract all text lines in a document, we separate all symbols in each text line and calculate direction-feature vectors and aspect ratios for those symbols. Then, a nearest-neighbor algorithm recognizes characters. In the expression formation stage, we build a symbol relation tree for each text line that represents the relationships among the symbols in the text line. Each text line is decomposed into a collection of primitive tokens: operands, operators and separators. Heuristic rules based on these primitive tokens are used to correct text recognition errors. Finally, we extract all mathematical expressions according to basic expression forms. Several pages of documents were scanned to test the method. All mathematical expressions are understood. In the expressions generated, a few symbols are misrecognized. The average recognition rate was 96.16%. (C) 1997 Elsevier Science B.V.en_US
dc.language.isoen_USen_US
dc.subjectcharacter segmentationen_US
dc.subjectcharacter recognitionen_US
dc.subjectexpression formationen_US
dc.subjecterror correctionen_US
dc.titleDesign of a mathematical expression understanding systemen_US
dc.typeArticleen_US
dc.identifier.journalPATTERN RECOGNITION LETTERSen_US
dc.citation.volume18en_US
dc.citation.issue3en_US
dc.citation.spage289en_US
dc.citation.epage298en_US
dc.contributor.department交大名義發表zh_TW
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentNational Chiao Tung Universityen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:A1997WZ62900008-
dc.citation.woscount25-
顯示於類別:期刊論文


文件中的檔案:

  1. A1997WZ62900008.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。