大数据时代,矢量数据量急剧增长,迫切需要寻找有效的矢量大数据存储方法。提出了一种基于HBase的矢量数据云存储策略。首先应用四叉树剖分方法构建多级格网索引并基于Hilbert填充曲线对矢量数据进行聚类划分;其次结合多级格网编码和Hilbert排列码设计矢量要素唯一标识并以此作为矢量要素在HBase数据库的行键;再次对矢量数据的存储规则进行了具体设计;最后通过两个对比实验对矢量要素唯一标识聚类效果及矢量数据查询效率进行了验证,实验结果证明了该方法的可行性与有效性。
Nowadays we enter the big data era. The amount of vector data is growing explosively. There is an urgent need for efficient storage method of vector big data. A cloud storage strategy of vector data based on HBase was proposed in this paper. Firstly, quadtree decomposition method was applied to build multi-level grid index and Hilbert space filling curve was applied to partition vector data. Secondly, vector element unique identifier was designed based on multi-level grid code and Hilbert sequence code, which was treated as RowKey of vector element in HBase. Thirdly, the storage rule of vector data was designed in detail. Finally, two contrast experiments were used to verify good feasibility and high efficiency of this proposed method.