针对传统DNA载体检测和清除工具的不足,实现一种基于智能检测的DNA载体检测新工具.该工具的核心思想是无需预先给定克隆载体序列模版、剪接位点和克隆适配片段等信息,通过建立从载体出现概率到序列权重的映射,把载体片段的检测转化为求给定序列中具有最大权重的k个不重叠相交的子序列问题,并且引入了罚函数控制避免对载体序列的过度清除.实验结果表明该工具能显著提高载体清除的效率和准确性,在超长序列处理的时候更稳定、错误率更低.
A novel tool for DNA vector detection and removal merging intelligent detection is proposed.This approach can automatically find and locate vector segments using the concept of mapping from vector segments detection to the discovery of k maximal scoring non-intersecting sub-segment sets without extra background information such as vector sequence,splice site and clone adapter.Moreover,the utilization of maximum complexity-penalty function can control the sensitivity of vector detection and avoid over trimming.Experiments show that this approach can significantly improve the efficiency and accuracy of vector removal and provide more stable performance than conventional methods do,particularly in high-throughput DNA sequence processing.