东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于多核融合的中文领域实体关系抽取

ISSN号：1003-0077
期刊名称：《中文信息学报》
时间：0
分类：TP301[自动化与计算机技术—计算机系统结构;自动化与计算机技术—计算机科学与技术]
作者机构：[1]昆明理工大学信息工程与自动化学院,昆明650500, [2]昆明理工大学智能信息处理重点实验室,昆明650500
相关基金：国家自然科学基金（61262041,61472168,61562052）资助项目;云南省自然科学基金重点项目（2013FA030）资助项目.

作者：郭剑毅, 陈鹏, 余正涛, 线岩团, 毛存礼, 赵君

关键词：条件随机场模型, 越南语分词, 词法, 基本特征, 最大熵, 歧义模型, condition random fields（CRFs） , Vietnamese segmentation, morphology, essential characteristics, maximum entropy, ambiguity model

中文摘要：

通过对越南语词法特点的研究，把越南语的基本特征融入到条件随机场中（Condition random fields，CRFs），提出了一种基于CRFs和歧义模型的越南语分词方法。通过机器标注、人工校对的方式获取了25981条越南语分词语料作为CRFs的训练语料。越南语中交叉歧义广泛分布在句子中，为了克服交叉歧义的影响，通过词典的正向和逆向匹配算法从训练语料中抽取了5377条歧义片段，并通过最大熵模型训练得到一个歧义模型，并融入到分词模型中。把训练语料均分为10份做交叉验证实验，分词准确率达到了96．55％。与已有越南语分词工具VnTokenizer比较，实验结果表明该方法提高了越南语分词的准确率、召回率和F值。

英文摘要：

The Vietnamese lexical features are discussed and essential characteristics ot Vmtnamese are integrated into condition random fields （CRFs） to propose a Vietnamese word segmentation method based on CRFs and ambiguity model. The segmentation corpus consisting of 25 981 Vietnamese is ob tained as a training corpus of CRFs by computer marking and artificial proofreading. Vietnamese crossing ambiguity is widely distributed in the sentence. To eliminate the effects of crossing ambiguity, 5 377 am- biguity fragments are extracted from training corpus through dictionary of the forward and reverse matc- hing algorithm. An ambiguity model is obtained by training the maximum entropy model. Then they are both ineorparted into the segmentation model. The training corpus is divided into ten copies evenly for cross validation experiments. The segmentation accuracy reaches 96.55 % in the experiment. Experimen- tal results show that the method improves the segmentation accuracy rate, the recall rate and the F value of Vietnamese word obviously, compared with Vietnamese segmentation tool VnTokenizer.

同期刊论文项目

特定领域实体关系获取与实体链接

期刊论文 2

专家检索资源获取与学习排序方法研究

期刊论文 57 会议论文 15

同项目期刊论文

Question answering oriented muti-kernel support vector data description user modeling

Research on semantic label extraction of domain entity relation based on CRF and rules

结合概率型神经网络(PNN)和学习矢量量化(LVQ)算法的文本分类方法

Sparseness of least squares support vector machines based on active learning

领域本体概念实例、属性和属性值的抽取及关系预测

Chinese question classification transfer learning method based on feature mapping

Multi-page Chinese expert metadata extraction method based on the 3D model

基于自适应聚类的虚假评论检测

专家证据文档识别无向图模型

Restricted domain question-answering text retrieval method based on supervised latent dirichlet allo

Approaches to Detect Micro-blog User Interest Communities through theIntegration of Explicit User Re

The Expert Ranking MethodBased on Listwise with Associated Features

A Chinese expert disambiguation method based on semi-supervised graph clustering

Concept name similarity calculation based on wordnet and ontology

基于随机游走策略的专家关系网络构建

Fusion of long distance dependency features for chinese named entity recognition based on Markov log

Multi-page Chinese expert metadata extraction model based on the fuzzy clustering

结合FCA与Jena的领域本体半自动构建方法研究

Expert ranking method based on ListNet with multiple features

基于有指导LDA用户兴趣模型的微博主题挖掘

基于特征映射的微博用户标签兴趣聚类方法

基于Listwise的深度学习专家排序方法

基于主题信息的项目评审专家推荐方法

基于凸组合核函数的中文领域实体关系抽取

Chinese Question Classification Based on Question Property Kernel

Expert List-wise RankingMethod based on Sparse Learning

A news event ranking method based on list with attributes and relationship

Unstructured data extraction of Chinese expert web page

基于深度神经网络的有色金属领域实体识别

基于标签传播算法的新词情感极性识别

基于实体消歧的中文实体关系抽取

基于LM算法的领域概念实体属性关系抽取

基于主题-对立情感依赖模型的虚假评论检测方法

融合领域知识短语树核函数的中文领域实体关系抽取

The CIV semantic relations extraction based on Markov logic networks

融合词频特性及邻接变化数的微博新词识别

基于半监督图聚类的项目主题模型构建方法

Review expert collaborative recommendation algorithm based on topic relationship

一种融合 PageRank 的协同过滤帖子推荐方法

一种基于特征映射的中文专家消歧方法

融合结构和内容特征提取多类型网页文本要素

基于半监督主动学习的虚假评论检测

基于多核学习算法的钢铁生产轧钢过程故障检测

基于特征加权重叠度的中文实体协同消歧方法

基于灰色关联分析的中文新闻事件关联性识别

期刊信息

《中文信息学报》
北大核心期刊（2011版）

主管单位:中国科学技术协会
主办单位:中国中文信息学会中国科学院软件研究所
主编：孙茂松
地址：北京海淀中关村南四街4号中科院软件所
邮编：100190
邮箱：jcip@iscas.ac.cn
电话：010-62562916

国际标准刊号：ISSN：1003-0077
国内统一刊号：ISSN：11-2325/N
邮发代号:

获奖情况:

国内外数据库收录:
日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:9136