藏语句子边界的正确识别是藏文文本处理首先要解决的问题。而藏语书面语中标点符号的特殊性是造成藏语句子边界识别困难的主要原因。该文主要对现代书面藏语中常见的以藏语助动词结尾的藏语句子边界识别进行研究,结合藏文标点符号的特点提出藏语助动词结尾句子边界识别方法。
Due to the special features of Tibetan punctuation system, sentence boundary detection (SBD) is one of the most significant tasks in Tibetan text processing. This work focuses on detecting modern Tibetan sentence boundary which is ended by auxiliary, and proposes a Tibetan SBD algorithm.