针对海量医学图像存储及检索效率低的问题,该文提出利用Hadoop平台分别实现大量医学图像的分布式存储以及并行处理模式下的基于内容的医学图像检索。利用HIPI图像处理接口将医学图像上传到分布式文件系统(HDFS)中;然后,分别提取图像的形状以及纹理特征,并将其特征向量存储到HDFS中;最后,利用MapReduce模型实现并行式检索并将图像检索结果按照相似度大小进行排序及显示。实验结果表明,在Hadoop云平台下大量医学图像的存储效率以及检索效率较高,且图像数量越多效率优势越明显。
Since the storage and retrieval efficiency of massive medical images is low,the Hadoop platform is used to realize the distributed storage of the massive medical images and content?based medical image retrieve in the parallel processing mode respectively. The Hadoop image processing interface(HIPI)is adopted to upload the medical images to the Hadoop distributed file system(HDFS). The shape and texture features of the images are extracted respectively,and their feature vectors are stored in HDFS. The MapReduce model is employed to realize the parallel retrieval,and sort and display the image retrieval results ac?cording to their similarity. The experimental results show that the efficiency of the massive medical images storage and retrieval by means of Hadoop cloud platform is high,and the efficiency advantage is significantly obvious with the increase of the image quantity.