针对蜜罐中数据分析系统的薄弱性,提出了基于PCA和改进的ReliefF方法的告警日志分析系统。通过主成分分析,去除特征之间的冗余性,能够有效降低算法的复杂性,再利用改进的ReliefF算法,选择出最能代表样本的特征,构成有效特征子集,实现特征的降维。该方法能够在保证较高分类精度的同时,显著提高分类速度,并在一定程度上实现了数据分析的智能化和自动化,实验结果表明了其正确性。
An analysis system of the alert log based on PCA and improved ReliefF is proposed to resolve the problem of weak data analysis system in honeypot. Through the analysis of the principle component, it can effectively decrease the redundancy of features and reduce the complexity of the algorithm. Then by using improved ReliefF to take out the most representative features which constitute the effective feature sets, the dimensions of the features are decreased. The method effectively decreases the time of classification with a higher accuracy, and achieves the target of intelligent and automated data analysis. In the end, experimental results show that the method is correct.