随着互联网技术的迅猛发展,隐私保护已成为个人或机构关心的基本问题,各种数据挖掘工具的出现使得隐私泄露问题日益突出.通常移除标识符的方式发布数据是无法阻止隐私泄露的,攻击者仍然可以通过链接操作以很高的概率来获取用户的隐私数据.匿名化是目前数据发布环境下实现隐私保护的主要技术之一.论文筒要介绍了匿名化技术的相关概念和基本原理,主要从匿名化原则、匿名化方法和匿名化度量等方面对匿名化技术研究现状进行了深入分析和总结,最后指出匿名化技术的研究难点以及未来的研究方向.
With the rapid development of Internet technology, privacy preservation has been an essential issue for individuals or organizations. The emergence of kinds of data mining tools makes privacy disclosure issues be increasingly critical. The method of releasing data through removing identifier from the table could not truly prevent privacy disclosure. The attacker can also infer the private data of the corresponding individual with high probability by linking operation. Anonymization is one of the primary techniques reali- zing privacy protection in data dissemination environment. The paper simply introduces the general concept and basal principle of the anonymization techniques, and mainly analyzes and summarizes the progress of the anonymization principles, anonymization methods and anonymization metrics. Finally, the present problems and directions for future research are discussed.