将染色质免疫共沉淀技术(ChIP)与下一代高通量测序技术相结合的染色质免疫共沉淀测序(ChIP-seq),已成为功能基因组学、特别是基因表达调控领域研究的关键技术。ChIP-seq实验带来的海量数据向生物信息学研究人员提出了新的挑战。由于此领域数据处理技术的发展大大滞后于实验技术进步,有必要系统地介绍和回顾ChIP-seq数据处理的各个方面,以便更多研究人员进入此领域设计或改进相应的算法。文章结合实例详细介绍了ChIP-seq数据整个流程,并重点讨论了其中的主要问题和关键环节,为这一研究领域的科研人员提供一个快速而深入的认识。
The next-generation sequencing coupled with chromatin immunoprecipitation (ChlP-seq) is becoming a key technology for the study of transcriptional regulation in the context of functional genomics. Due to the overwhelming amount of data generated from ChlP-seq experiments, the ChlP-seq data processing brings many new challenges in the field of bioinformatics. Considering the development of data processing skills largely behind that of the ChlP-seq experiment techniques, it is urgent to give a review on the ChlP-seq data processing for more and more oncoming researchers to build or improve algorithms. This paper provides a brief overview of the ChlP-seq data processing, highlighting the main problems and methods in detail, to allow scientists to understand rapidly and deeply.