文本作为语言的视觉形式是人类最重要的交流工具,基于文本的信息隐藏算法具有很高的实用价值.文本的一个明显特点是高度凝练,信息冗余少,因而文本隐藏的容量较低.另一方面,传统的文本隐藏建立在修改载体的基础上,而文本语义对于修改非常敏感,微小的修改可能引起明显异常,因而文本隐藏的隐蔽性较差.针对以上问题,深入分析了网络文本大数据的特点,据此设计了一种基于网络文本大数据的信息隐藏算法,把对载体的修改转化成对载体的检索,并通过位置信息定位秘密消息,从而不需要修改载体即可嵌入信息,隐蔽性大大提高.另外,实验结果显示本算法具有接近通用字符编码效率的嵌入率(18比特/字符),是一种高效的信息隐藏算法.
As the visual form of language, text is the most important media of communication for human beings. Text based informa- tion hiding algorithm has high practical value in many fields. The hiding capacity of text is low because it is highly concise with less redundant information. On the other hand, traditional text information hiding is based on modifying carder text which is so sensitive to changes that even minor modifications can cause obvious abnormalities, making hiding easy to be detected. In this paper, we analyze the characteristics of Web text and big data, and design a novel information hiding algorithm based on it. Modification of carder is transformed into search of carrier,and position information can locate secret message. In this way,we can embed information without modifying the carder so that undetectability is greatly improved. Experimental results show that the embedding efficiency is close to general character encoding efficiency (18bits/character), so this algorithm is an efficient information hiding method.