情感词汇的获取是文本倾向性分析的基础。为了解决人工识别方法低效的不足,并为维吾尔语情感词的研究及情感词词典的创建提供一些可供选择的方法和思路,该文首先分析了维吾尔语情感词汇在上下文中表现的特征,并结合维吾尔语本身的语法特征,建立了扩展的维吾尔语新增特征模型,与词频逆文档频率(TF-IDF)算法相结合,实现了维吾尔语情感词汇的识别。实验结果指出该特征模型有效地提高了情感词汇的识别率。
The sentiment vocabulary is essential for the sentiment analysis. To deal with the inefficiency of manual acquisition, this papers proposes an extension of features based on the grammar and context characteristics of Uyghur sentimental words. Combined with the TF-IDF measure, our algorithm is proved to effectively improve the recognition of sentiment words.