东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

面向时序基因表达数据的双聚类算法

ISSN号：1001-3695
期刊名称：《计算机应用研究》
时间：0
分类：TP311[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]西北工业大学计算机学院,西安710129
相关基金：国家“973”计划资助项目（2012CB316203）; 国家自然科学基金资助项目（61272121）

关键词：时间点连续, 基因共表达, 一致趋势, 双聚类, time-continuous, gene co-expression, coherent evolution, bicluster

中文摘要：

对某种生物而言,在某段连续时间内共表达的基因预示着其在同时完成某一生物过程或其间存在某种调控关系;而目前在基因表达数据上的大多数双聚类算法都是针对非连续样本点的情况提出的,对于连续样本点（样本之间存在顺序关系）的情况很少涉及。因此在考虑连续样本点的情况下,提出了一种在时序基因表达数据上挖掘极大一致趋势共表达基因集的双聚类算法TCBicluster。在每个时间点产生行常量共表达基因集,进而构造以时间点为顶点、以相邻时间点间满足一致性要求的共表达基因集为边的权值图,并采用扩展连续时间点的方式对权值图进行双聚类挖掘,使用有效的剪枝策略提高算法效率。实验证明,TCBicluster算法比RAP及CC-TSB算法更能有效挖掘极大一致趋势共表达双聚类且具有较高的效率和良好的可扩展性。

英文摘要：

For one creature, if some genes on it show co-expressed in a certain continuous time interval, they are very likely to complete a biological process simultaneously or exist some regulation relationships. At present, most of the bicluster algo- rithms in gene expression data were proposed under the discontinuous samples. That is, the bicluster algorithms for samples existing a sequential relationship were very few. For this reason, this paper proposed an efficient time-continuous bicluster al- gorithm TCBieluster to mine the maximal coherent evolution and co-expression gene sets from the time-series microarray gene expression dataset. First, TCBicluster algorithm generated all the constant row co-expression gene sets for every time point. Then, it built the weighted range multigraph which used the time points as its vertexes and the co-expression gene sets with co- herent evolution between two adjacent time points as its edges. Finally, TCBicluster expanded the multigraph with a mode that only considered the behind adjacent vertex as the candidate. In addition, it used some efficient pruning techniques to improve the efficiency. The experimental results show that the maximal coherent evolution and co-expression biclusters mined by TCBi- cluster algorithm are of better quality than RAP and CC-TSB. Simultaneously, TCBicluster algorithm also indicates higher mining efficiency and better extensibility.

同期刊论文项目

挖掘蛋白质网络动态特征的方法研究

期刊论文 4

同项目期刊论文

Constrained query of order-preserving submatrix in gene expression data

分布式大数据多函数依赖冲突检测

基于离散时序基因表达数据的双聚类算法

期刊信息

《计算机应用研究》
北大核心期刊（2011版）

主管单位:四川省科学技术厅
主办单位:四川省计算机研究院
主编：刘营
地址：成都市成科西路3号
邮编：610041
邮箱：arocmag@163.com
电话：028-85210177 85249567

国际标准刊号：ISSN：1001-3695
国内统一刊号：ISSN：51-1196/TP
邮发代号:62-68

获奖情况:
第二届国家期刊奖百种重点科技期刊,国内计算技术类重点核心期刊,国内外著名数据库收录期刊

国内外数据库收录:
俄罗斯文摘杂志,波兰哥白尼索引,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:60049