为了提高维吾尔语中机构名的自动识别准确率,从维吾尔语的语言特点出发,对维吾尔语中机构名的组织结构进行了分类并将其形式化表示;根据此特征设计出有效地识别规则,创建了特征词库、地名库和修饰词库等知识库;设计并实现了基于状态转移原理的高效识别算法。实验结果表明,该算法识别的F值达到83.05%,获得了较好结果。
To improve the automatic recognation of organization name in Uyghur, through anaiyms of the charactersitics of Uyghur organization name, the following work was done. First, the organization name in Uyghur was classified depending on its structure and it was formally described. After then, effective recognizing rules were desingned according to these features, knowledge base was created such as features word base, place name base and qualifier word base. Finally, efficient recognition algorithm was designed and implemented based on the principles of state transition. Representative examples from the Tianshan net news were selected to build the test set for organization name recognition, experimental results showed that, this method achieved better results with the F measure of 83.05 %.