为解决藏文复合句引起的依存句法分析性能下降的主要问题,该文提出了一种基于判别式的藏文复合句切分标注方法,先根据藏文固有的虚词语法结构和连词特征,将复合句子切分标注为句法分析的基本单元,然后将句法分析之后的各个部分依据主分句关系进行合并,生成复合句的完整分析结果。实验结果表明该方法在一定程度上降低了藏文复合句依存句法分析的复杂度,最终句法分析的准确率达到88.72%。
This paper proposes a discriminative method of identifying the clause to solve the performance decrease caused by Tibetan compound sentence.In this method,the complex sentence is first divided into different syntactic analysis units according to the inherent features of conjunctions.Then each clause is parsed independently.Finally the whole dependency tree is generated by merging the parse of each clause.Experimental results show that the method could decrease the complexity of parsing,and boost the parsing accuracy up to 88.72%.