高维大数据中如何快速查询有用信息是目前面临的问题之一。为了解决维度灾难带来的问题,本文分析了传统的相似性度量函数,然后进行了改进,提出了的新算法能够应用到高维数据空间,仿真实验表明,该算法在高维空间中查询的精确度优于其他传统距离函数,并对噪音有抵抗性。
How to query useful information quickly in high-dimension data is one of the problems. In order to solve the problem of dimension disaster, the traditional similarity measure function were analyzed and improved in this paper. The new algorithm can be applied to the high-dimensional data space. The simulation results show that the query algorithm is better than other traditional distance function in high dimension accuracy, and it can resistant to noise.