为了准确而高效地进行大规模场景理解,提出基于增量计算的条件随机场下的大规模场景致密语义地图构建方法.该方法利用双目视觉估算相机运动轨迹,根据图像序列语义标注结果构建语义地图.递增的语义地图的构建过程是关键,需要检测致密化处理后的输入帧相较于前一帧的新增体素,对新增体素内部三维点过分割成超体素,利用前后多帧的标注结果指导超体素的标注,如此逐帧地将新增体素融合到语义地图中.该方法将时序上的先验信息作为条件随机场中的数据项,依据超体素的邻接关系定义平滑项,利用图割法求解新增超体素的标签.实验表明,该方法能够获取准确的大规模语义地图,有效减少对冗余点的处理,改善图像上的标注结果.
In order to efficiently achieve accurate large-scale scene understanding result,A new large scale dense semantic mapping system was proposed.The system constructed a map by incrementally calculating with a conditional random field model.The method used stereo visual odometry to get the motion of the camera,and used the labeled image sequences to build semantic map.The key point was to incrementally build the semantic map which detected newly built voxels,over-segment the points within these voxels into supervoxels,labeled these supervoxels under the guidance of neighboring frames and used the rigid transformation matrix to fuse the newly labeled points with the already built map.A conditional random field model was constructed which took labeling results of sequential frames as the data term,took the coherent labeling constraint between neighboring supervoxels as the pairwise term and solved the model by graph cut.Experimental evaluations show that the approach can get an accurate large scale semantic map and decrease computational cost,The approach can improve the labeling results at image level.