复句关系类别的识别是对复句分句之间语义关系的甄别,是分析复句语义关系的关键。在现代汉语复句中,二句式和三句式复句占绝大多数,而三句式复句又可以拆分为二句式复句,所以多句式复句的研究归结起来就是二句式复句的研究。以二句式非充盈态有标复句为研究对象,结合汉语复句的句法理论、关系标记搭配理论,以汉语复句语料库以及搜索引擎获取的复句为语料,进行二句式非充盈态有标复句关系类别的自动标志。该方法对二句式非充盈态有标复句关系类别进行自动识别,准确率达到89%,实验结果证明了该方法的有效性。
Recognition of relation category is screening for semantic relation of clauses in a compound sentence, and it is the key to analyze semantic relationships of Chinese compound sentences, Most of Chinese compound sentences have two or three clauses and compound sentences with three clauses can be divided into two clauses. Therefore, the study of compound sentences with multiple clauses comes down to the study of compound sentences with two clauses. This paper described a study of non-saturated compound sentences with two clauses, using syntactic theory and the collocation theory of the relation markers of Chinese compound sentences. The data source came from the corpus of Chinese compound sentences and some compound sentences from the search engine. The study finally recognized relation category of non-saturated compound sentences with two clauses. The experimental results show that the accuracy of automatic identification of non-saturated compound sentences with two clauses is 89% ,the new measure is more effective.