随着计算机和互联网技术的发展和普及,计算机病毒所带来的安全威胁日趋严重。基于特征码扫描的病毒检测技术是目前检测已知病毒最为简单、有效的方法,但病毒特征码需要经验丰富的计算机病毒分析师手动从病毒中提取出来,其效率并不高。提出了一种基于N-Gram的病毒特征码自动提取方法,将N-Gram统计语言模型应用到病毒特征码提取中。通过实验证明了该算法能有效提取病毒特征码。
With the development of computer and Internet, security threats brought by computer virus become more and more serious. Virus detection based on signature code is the simplest method. But virus analysts with rich experience are needed to extract signature from virus, and it is inefficient. A computer virus signature automatic extraction method based on N Gram is presented by this paper, and the N gram statistical language model is applied in virus signature extraction. It is proved that the algorithm can extract virus signature efficientiy