东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

汉语中方位参考点恢复研究

期刊名称：计算机研究与发展2007, 44 (02): 265-268, 重要期刊
时间：0
分类：TP391.2[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]苏州市职业大学计算机学院,江苏苏州215104, [2]东北林业大学信息与计算机工程学院,哈尔滨150040, [3]哈尔滨工业大学计算机科学与技术学院,哈尔滨150001
相关基金：国家自然科学基金（ the National Natural Science Foundation of China under Grant No.60575041 ）
相关项目：基于Ontology的自然语言描述的空间概念三维可视化研究

关键词：多AGENT, 语块识别, 分布式, multi-agent, chunking, distributed

中文摘要：

为了能比较不同方法的性能，常常希望在公共的训练集和测试集上进行语块识别。但是，用于实验的公共训练集和测试集往往规模较小而且具有领域的局限性。因而，在跨领域的真实语料情况下，语块识别的精确率有很大的下降。采用真实开放语料，设计多组实验研究不同的词性标注结果、不同领域的语料和不同的知识库对语块识别的影响，考察基于多Agent结构的分布式英语语块识别策略在实际系统中应用的可能性。实验表明，基于多Agent结构的分布式英语语块识别策略在真实开放语料下F测度达到了92％．基本能够满足实际应用的需要。

英文摘要：

Public corpus is often used to do research in order to compare the performance of different method.But the public corpus is only for experimentation, so its size is usually small and the field of public corpus is local.So the veracity of chunking descends on real different field corpus.Several experiments are designed to study the influence to chunking with different result of part of speech,different field corpus and different repository in this paper.The feasibility of distributed multi-agent English chunking strategy used to real application system is reviewed.Through testing on the real public corpus,F score of English chunking using multi-agent model achieves to 92%,which almost satisfies the practical need.

同期刊论文项目