盲文语料库建设在国内外还没有先例。为了建设多功能、综合性的汉语盲文语料库,通过对现行盲文颁布60多年来的盲文出版物语料、非出版物语料以及盲人语料的采集,力图涵盖盲文使用的全貌。从语料调查、语料筛选、语料采集流程几个方面阐述了盲文语料库语料采集的主要内容,并指出语料采集的重点难点问题及应对策略。
The construction of Braille corpus is the first at home and abroad. We try to build a corpus of Chinese Braille multi-functional and comprehensive, which covering the whole using Braille. We studies word corpus of Braille publication, informal Braille publication and word corpus of the Blind since the promulgation of the current Chinese Braille. This paper explains linguistic material collection in terms of material investigation ,selection and collection, and points out important and difficult issues in linguistic material collection process.