依据依存句法理论,该文制订了藏语句法标注体系及层次结构。通过分析构建藏语依存树库中存在的问题,提出了半自动依存树库构建模式,针对藏语特性提出了融合丰富特征的词对依存分类模型和依存边标注模型,实现了依存树库构建可视化工具,校对构建了1.1万句藏语依存句法树后,在基线系统下经实验验证,依存识别正确率提高了3%,使构建藏语依存树库工作取得了有效进展。
According dependency syntactic theory this paper gave Tibetan typed dependencies and its hierarchy,and then we analyzed some problems in building Tibetan dependency Treebank.We proposed a mode to construct dependency tree semi-automatically,it includes word-pairs dependency classification model and dependency edges annotation model with rich features template based on Tibetan language grammar.And we implemented visualized tool which used to build and proofreading 11thousand sentences Treebank.On the baseline system the experimental results show that,the dependency recognition accuracy obtains an improvement of 3%.