汉语缺乏严格的形态束缚,在句子里双音词经常由于嵌入相关句法成分而呈现出分离。为了提高自然语言处理中语段分词、词性标注及基于规则的句子语义计算的准确性与有效度等,文章系统考察了现代汉语里典型词与离合词的分离现象,挖掘并刻画出它们各自形成的语法动因、条件、意义以及分离而成的成分的语法特性等,从而对现代汉语中双音词的分离做出了系统多层级的理论思考。文章指出典型词的分离是语用作用的结果,分离结构的意义是主观的;而离合词分离是句法语义作用的产物,意义是客观的。
With flexible morphological rules, Chinese is common to be observed for the disyllable word split by int- setting some syntactical constituents of a sentence. To facilitate the word segmentation, part-of-speech tagging and semantic computing, the paper systematically discusses two types of the disyllable word split in Mandarin: one for the the Separable word, and the other is for the Prototype word. It examines describes the grammatical movitation of their formation~ properties, and meanings, indicating that 1) the Prototype word split is caused by pragmatic fac- tors, with subjective structures and grammatical meanings; yet 2) the Separable word split is caused by the syntactic and semantic expression, with objective structures and meanings.