Full metadata record
DC FieldValueLanguage
dc.contributor.authorLee, HJen_US
dc.contributor.authorWang, JSen_US
dc.date.accessioned2014-12-08T15:01:58Z-
dc.date.available2014-12-08T15:01:58Z-
dc.date.issued1997-03-01en_US
dc.identifier.issn0167-8655en_US
dc.identifier.urihttp://hdl.handle.net/11536/695-
dc.description.abstractA scientific document usually consists of text and mathematical expressions. In this paper, we present a system for segmenting and understanding text and mathematical expressions in a document, The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. After we extract all text lines in a document, we separate all symbols in each text line and calculate direction-feature vectors and aspect ratios for those symbols. Then, a nearest-neighbor algorithm recognizes characters. In the expression formation stage, we build a symbol relation tree for each text line that represents the relationships among the symbols in the text line. Each text line is decomposed into a collection of primitive tokens: operands, operators and separators. Heuristic rules based on these primitive tokens are used to correct text recognition errors. Finally, we extract all mathematical expressions according to basic expression forms. Several pages of documents were scanned to test the method. All mathematical expressions are understood. In the expressions generated, a few symbols are misrecognized. The average recognition rate was 96.16%. (C) 1997 Elsevier Science B.V.en_US
dc.language.isoen_USen_US
dc.subjectcharacter segmentationen_US
dc.subjectcharacter recognitionen_US
dc.subjectexpression formationen_US
dc.subjecterror correctionen_US
dc.titleDesign of a mathematical expression understanding systemen_US
dc.typeArticleen_US
dc.identifier.journalPATTERN RECOGNITION LETTERSen_US
dc.citation.volume18en_US
dc.citation.issue3en_US
dc.citation.spage289en_US
dc.citation.epage298en_US
dc.contributor.department交大名義發表zh_TW
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentNational Chiao Tung Universityen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:A1997WZ62900008-
dc.citation.woscount25-
Appears in Collections:Articles


Files in This Item:

  1. A1997WZ62900008.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.