东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于最大-最小相似度学习方法的文本提取

ISSN号：1000-9825
期刊名称：《软件学报》
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]北京理工大学计算机科学与技术学院智能信息技术北京市重点实验室,北京100081, [2]北京林业大学信息学院,北京100083, [3]中国科学院自动化研究所模式识别国家重点实验室,北京100080
相关基金：Supported by the National Natural Science Foundation of China under Grant No.60473049 （国家自然科学基金）; the National Basic Research Program of China under Grant No.2006CB303105（国家重点基础研究发展计划（973））; the Excellent Young Scholars Research Fund of Beijing Institute of Technology of China under Grant No.2006Y1202 （北京理工大学优秀青年教师资助计划）

关键词：文本提取, 高斯混合模型, 判别学习, 最大-最小相似度学习, 最小分类错误学习, text extraction, Gaussian mixture modeling, discriminative training, maximum-minimum similarity training, minimum classification error training

中文摘要：

应用最大-最小相似度（maximum-minimum similarity,简称MMS）学习方法，对基于高斯混合模型的文本区域提取方法中的有关参数进行优化．该学习方法通过最大化正样本相似度和最小，化反样本相似度获得最佳分类能力．根据这种判别学习思想，建立了相应的目标函数，并利用最速梯度下降法寻找目标函数最小值，以得到文本区域提取方法的最优参数集合．文本区域提取实验结果表明：在用期望最大化（expectation maximization，简称EM）算法获得参数的极大似然估计值后，使用最大．最小相似度学习方法，使文本提取综合性能明显提高，开放实验的召回率和准确率分别达到98．55％和93．56％．在实验中，最大．最小相似度学习方法的表现还优于常用的判别学习方法——最小分类错误（minimum classification error,简称MCE）学习方法．

英文摘要：

This paper proposes a maximum-minimum similarity training algorithm to optimize the parameters in the effective method of text extraction based on Gaussian mixture modeling of neighbor characters. The maximum-minimum similarity training （MMS） methods optimize recognizer performance through maximizing the similarities of positive samples and minimizing the similarities of negative samples. Based on this approach to discriminative training, it defines the objective function for text extraction, and uses the gradient descent method to search the minimum of the objective function and the optimum parameters for the text extraction method. The experimental results of text extraction show the effectiveness of MMS training in text extraction, Compared with the maximum likelihood estimation of parameters from expectation maximization （EM） algorithm, the training results after MMS has the performance of text extraction improved greatly. The recall rate of 98.55% and the precision rate of 93.56% are achieved. The experimental results also show that the maximum-minimum similarity （MMS） training behaves better than the commonly used discriminative training of the minimum classification error （MCE）.

同期刊论文项目

基于立体视觉的动态手势识别与语义描述方法

期刊论文 21 会议论文 10 获奖 2

同项目期刊论文

Face recognition with local st

Symmetrical null space LDA for

DETECTOR:基于关系数据库通用的在线关键词查询系统

A Bottom-up algorithm for find

EyeScreen: A Vision-Based Gest

一种基于视觉的手指屏幕交互方法

一种基于最小割算法的稠密深度恢

基于自适应聚合的立体视觉合作算

Fisher non-negative matrix fac

Precise shape measurement of d

基于非参数信念传播的可行C-空间

人体三维运动实时跟踪与建模系统

基于小波和匹配跟踪的分层图像编码算法

基于Gabor字典的低速率视频编码

基于互补子空间线性判别分析的人脸识别

图像中多语种文本提取的高斯混合建模方法

基于非参数信念传播的可行C-空间关节人手跟踪方法

层级潜变量空间中的三维人手跟踪方法

EyeScreen： A Vision-Based Gesture Interaction System

Hand Motion Tracking Using Simulated Annealing Method in a Discrete Space

期刊信息

《软件学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国科学院软件研究所中国计算机学会
主编：赵琛
地址：北京8718信箱中国科学院软件研究所
邮编：100190
邮箱：jos@iscas.ac.cn
电话：010-62562563

国际标准刊号：ISSN：1000-9825
国内统一刊号：ISSN：11-2560/TP
邮发代号:82-367

获奖情况:
2001年入选中国期刊方阵“双百期刊”,2000年荣获中国科学院优秀科技期刊一等奖

国内外数据库收录:
俄罗斯文摘杂志,美国数学评论（网络版）,波兰哥白尼索引,德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:54609