现有语言学角度的义项区分/归纳在系统性、原则和标准等方面还有待改善。尝试采取以下措施改善现有研究:在知网分类体系的框架内,根据词语词性的不同,分别参考配价理论、事件类型和生成词库理论的物性结构等,按照一定的步骤和参数,对词语的义项加以区分、归纳,提出确定词语义项数目的原则性基础和3个区分义项的标准:研究显示,义项区分/归纳的可操作性、完全性和离散性都得到了较好的提高。
Existing study in word sense induction (WSI) from linguistic perspective need further im-provements in systemacity, principle, and methods. The following measures are used to improve exist-ing studies: (1) WSI is carried out against the background of linguistic ontology; (2) WSI is carried out according to the part of speech of words, certain steps, and parameters, and refferring to valency theory, event type theory, and qualia structure theory of the Generative Lexicon. A principle is offered to decide on the sense number of a word and 3 standards are offered for WSI. Studies show that operat- ability, completeness and discreteness of WSI are improved obviously.