哈萨克斯拉夫图像文本经过行切分和列切分后,存在水平方向接触和垂直方向重叠的粘连字符。为提高字符识别率,依据字符连通域的最小外接矩形切分开垂直方向重叠的粘连字符图像块;利用判决条件:字符宽度概率密度分布图、字符图像块垂直投影的波峰数目和字符图像块垂直投影波峰的对称性,分离初始粘连字符图像块中正确的单个字符图像块和实际接触的粘连字符图像块;在允许的字符宽度范围内,寻找粘连字符图像垂直投影图的极小值点,以切分实际接触的粘连字符。实验结果表明,该方法泛化能力较好且识别率有明显提高。
After line and column segmentation of the Kazakh Slavic image text,there is adhesion between characters.To improve the character recognition rate,according to the minimum circumscribed rectangle of connected domain,the vertical overlapping image block of characters was cut.Decision conditions adopted included word probability density distribution of wide,vertical projection wave number,and vertical projection wave symmetry,those were used to separate the correct individual characters of image block and the actual contacted touching character image block.In the range of allowed characters width,minimum points of touching character image vertical projection were searched to cut the actual contacted adhesive characters.The experimental results show this method makes recognition rate improved.