软件缺陷数据是软件质量分析和改进的重要基础数据之一。如何在分析缺陷数据前对缺陷数据进行有效的预处理,如何根据缺陷特征对缺陷数据进行合理分类,如何对缺陷数据进行挖掘以及统计分析,是软件缺陷研究领域面临的问题。详细介绍了缺陷数据预处理、缺陷分类以及缺陷数据挖掘分析3个方面的研究内容、方法和技术,并对这些方法进行了比较和分析,最后提出了几个软件缺陷数据处理研究领域需要进一步研究的问题。
Software defect is important basic data for software quality analysis and improvement. The problems faced by software defect research is how to preprocess noisy defect data effectively before analysis, how to classify deject data according to their characters, and how to mine and analyze them. This paper described the contents, methods and technologies of the above mentioned three problems, then compared and analyzed them, at last proposed several problems of defect data which is worth further studying.