基于图像中物体之间的空间关系的图像检索往往受困于待处理的图像中物体种类和空间位置难以自动准确地获取.文中基于物体识别算法的输出,提出一种对物体空间关系的三元组表示法,给出基于这种表示方法对图像索引、相似度计算和检索排序的方法及允许用户使用查询词和空间关系表达查询需求的二维输入界面,并实现原型系统.这种表示法具有良好的鲁棒性,可容忍物体识别算法一定程度的误差,将物体识别得到的置信度加入三元组表示法置信度计算和排序算法中,减少物体识别结果误差对检索性能的影响.在原型系统上的实验表明,该系统在实验中对包含物体位置关系的检索给出更准确的结果,在NDCG@m、MAP、F@m上均优于现有系统.
For the image retrieval system based on spatial relationship of objects in images, it is hard to automatically recognize objects and their spatial relations correctly. Based on the outputs of object detection algorithms, a triple representation of the spatial relationship in images is proposed. Based on the representation, a method for indexing images, computing similarities and ranking results is proposed. A 2D user-match interface is also developed for users to express their needs in terms of retrieval keywords and spatial relationships, and a prototype is established. The representation is robust against errors of object detection. Incorporating the confidence given by object detection into the triple representation and ranking method, the impact of object detection errors on the performance of image retrieval is reduced. With the queries comprising explicit spatial relationship, the proposed approach gives more accurate results in experiments. It performs better than the existing systems in terms of NDCG@ m, MAP and F@ m.