公式识别问题被分为字符分割和结构分析两部分内容。系统地研究了数学公式识别的全过程,使用自适应字符分割方法和基线结构分析算法成功地实现了一般数学公式的识别,识别率比较高,较好地完成了公式识别任务。从实验结果中可以看出,这种基于基线结构分析的数学公式识别方法能够满足大多数印刷体公式的识别,是一种较好的方法。
The formula recognition problem was divided into character segmentation and structure analysis.In this paper,the whole recognition process was studied in detail,using character over-segmentation method and BST character structure analysis algorithm,the general formula could be recognized and remerged successfully,and the recognition ratio is very high.We can conclude from the experiment result that the mathematical formula recognition method based on baseline structure analysis can satisfy the need of most situations.