针对现有混淆网络生成方法难以兼顾速度和质量的问题,研究了基于横断一致性的Lattice分段方法和基于最大置信度的Lattice分段方法,研究了用这两种Lattice分段方法来减少对混淆网络质量的影响。提出了一种基于Lattice分段的高质量混淆网络快速生成方法。该方法把原始大规模Lattice分割成小尺寸的Lattice,分别生成混淆网络,从而可减小计算规模,提高网络生成速度。同时通过分段数目来调节速度和质量之间的平衡。实验结果显示,与词聚类算法相比,所提方法显著提高了混淆网络的生成速度,而对混淆网络质量影响很小。从解码性能看,在相同速度下所提方法获得了比采用剪枝的词聚类算法更低的错误率。
Aimed at the problem that the existing confusion network generating methods cannot keep a tmdeoff between the network generation speed and the quality of confusion network, the paper investigates two major lattice segmentation methods with the purpose of using them to reduce the impacts of segmentation to the quality of confusion networks, and based on this, pre~ents a high-quality method for fast generating confusion networks based on lattice segmentation. The method segments the large-scale lattice from automatic speech recognition (ASR) into sequences of smaller sub-lattices and then generates the confusion networks from these sub-lattices, thus remarkably decreasing the computation scale and increasing the network generating speed. The balance between the generation speed and the network quality is controlled by the segmentation number. The experimental results show that the proposed method can significantly improve the speed of confusion network generation while hold almost the same quality compared with the traditional word-clustering method without lattice segmentation. At the same speed, the proposed method can obtain a lower tonal syllable error rate than the word- clustering method with lattice pruning.