东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

信用评分模型中稀有事件特殊采样处理方法探讨

ISSN号：1007-3116
期刊名称：统计与信息论坛
时间：2012.11.11
页码：15-19
分类：F222.3[经济管理—国民经济]
作者机构：[1]中国人民大学应用统计研究中心,北京100872, [2]中国人民大学统计学院,北京100872
相关基金：国家社会科学基金青年项目《中国银行业信用风险管理的理论与实践--基于稀有事件和拒绝推断的信用评分模》（09CTJ003）;中国人民大学科学研究基金项目（中央高校基本科研业务费专项资金）《基于高频和超高维数据的中国金融市场若干重大问题研究》（10XNL007）
相关项目：基于高频数据的股市极端风险测度及其防范研究

关键词：信用评分模型, 稀有事件, 不平衡数据, 特殊采样, credit scoring models, rare event, unbalanced data, special sampling

中文摘要：

信用评分模型的建模样本是由坏客户这一稀有事件和好客户这一大众事件组成的不平衡数据，故从模型残差的方差这一角度刻画稀有事件识别的难度，借鉴机器学习领域处理不平衡数据的方法，对建模样本中的稀有事件做特殊采样处理然后再建模，并证明对建模样本做特殊采样处理后必须用经验公式校正样本偏差。实证分析表明这是提高信用评分模型准确性的有效方法。

英文摘要：

The modeling samples of credit scoring models are unbalanced data consisted by the rare event of being a bad customer and the common event of being a good one. From the variance of model residual, the difficulty of rare event detection is depicted. The special sampling method applied in unbalanced data in machine learning is referenced to dealing with the modeling samples of credit scoring models. An empirical correction formula must be used to correct the sample bias caused by the special sampling is verified. The results of empirical study demonstrated the efficiency of this method.

同期刊论文项目