东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于非时序观察数据的因果关系发现综述

ISSN号：0254-4164
期刊名称：《计算机学报》
时间：0
分类：TP18[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
作者机构：[1]广东工业大学计算机学院,广州510006, [2]卡内基梅隆大学哲学系匹兹堡美国15213, [3]佛山科学技术学院数学与大数据学院,广东佛山528000
相关基金：本课题得到NSFC-广东联合基金（U1501254）、国家自然科学基金（61572143）、广东省杰出青年科学基金（2014A030306004）资助.

关键词：因果关系, 因果关系发现, 观察数据, 结构学习, 加性噪声模型, 人工智能, 机器学习, causality, causal discovery, observational data, structure learning, additive noise model, artificiall intelligence, machine learning

中文摘要：

探索和发现事物间的因果关系是数据科学的一个核心问题,其中蕴含着丰富的科学发现机会和巨大的商业价值.基于非时序观察数据的因果关系发现方法能够从被动观察获得的数据中发现变量之间的因果关系,因而在各领域有广泛应用.这一类方法在过去三十年取得很大进展,已经成为因果关系发现的重要途径.文中从因果关系方向推断、高维数据上的误发现率控制和不完全观察数据上的隐变量检测这三个研究热点出发,对现有的因果关系模型与假设、基于约束的方法、基于因果函数模型的方法和混合型方法这三大类方法,验证与测评涉及的数据集及工具等方面进行了详尽的介绍与分析.基于约束的方法主要包括因果骨架学习和因果方向推断两个阶段：首先基于因果马尔可夫假设,采用条件独立性检验学习变量之间的因果骨架,然后基于奥卡姆剃刀准则利用V-结构确定因果方向,典型的算法有Peter-Clark算法、Inductive Causation等,这类方法的主要不足是存在部分无法判断的因果关系方向,即存在Markov等价类难题.基于因果函数模型的方法则基于数据的因果产生机制假设,在构建变量之间的因果函数模型的基础之上,基于噪声的非高斯性、原因变量与噪声的独立性、原因变量分布与因果函数梯度的独立性等因果假设推断变量之间的因果关系方向,典型的算法有针对线性非高斯无环数据的Linear NonGaussian Acyclic Model算法、针对后非线性数据的Post-NonLinear算法、适用于非线性或离散数据的Additive Noise Model等,这类方法的主要不足是需要较为严格的数据因果机制假设,且Additive Noise Model等方法主要适用于低维数据场景.混合型方法则希望充分发挥基于约束的方法和基于因果函数类方法的优势,分别采用基于约束的方法进行全局结构学习和基于因果函数模型进行局部结构学习和

英文摘要：

Exploring and detecting the causal relations among variables have shown huge practical values in recent years, with numerous opportunities for scientific discovery, and have been commonly seen as the core of data science. Among all possible causal discovery methods, the approaches to causal discovery from non-temporal observational data can recover the causal structures from passive observational data in general cases, and have shown extensive application prospects in a lot of real world applications. After 30 years ＇ rapid progress, causal discovery from non-temporal observational data have been considered as an important research direction of causal discovery. In this survey, we discuss three hot research topics including causal direction inference, false discovery rate control on high-dimensional data, and latent variable detection in partially observational data. Around the above research topics, we extensively review and analyze recent achievements in several aspects of causal discovery, especially focusing on causal models and their basic assumptions, constraint based approaches, casual function based approaches, hybrid approaches, and the related benchmarks and tools. A typical constraint based approach is a two-phase method, firstly utilize the conditional independence tests to learn the causal skeleton based on the Causality Markov Assumption, and then use the V-structures to determine the causal directions based on Occam;s razor principle. The typical constraint based algorithms include Peter-Clark （PC） algorithm and Inductive Causation （IC） algorithm. The main limitation of this class of methods is that they cannot distinguish the underlying causal structure from its statistically equivalent structure, i.e. the algorithms return some undetermined causal directions. This limitation is also known as Markov equivalence class problem. The casual function based approaches are based on data generating process assumptions. After fitting the causal function model among the variables, the causal funct

同期刊论文项目

面向智慧城市的大规模信息处理与智能计算理论与技术

期刊论文 10

高维不完全观察数据上的因果关系推断及其应用

期刊论文 9

同项目期刊论文

基于因果强度的时序因果关系发现算法

基于加权频繁子树相似度的网页评论信息抽取

离散数学的翻转课堂教学法研究

一种基于Storm的在线产品评论信息采集的方法

面向汽车评论的细粒度情感分析方法研究

基于混合高斯分布伪样本生成的情感分析方法

基于衣物共现信息与多任务学习的衣物识别

K-means聚类算法的实例教学研究

特征学习的单幅图像去雾算法

基于加权频繁子树相似度的网页评论信息抽取

离散数学的翻转课堂教学法研究

Minimizing Resource Cost for Camera Stream Scheduling in Video Data Center

SBV：基于SVG的生物信息可视化软件

面向汽车评论的细粒度情感分析方法研究

基于衣物共现信息与多任务学习的衣物识别

K-means聚类算法的实例教学研究

KECVS：一个面向专业文献知识实体的类型标注及可视化系统

期刊信息

《计算机学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国计算机学会中国科学院计算技术研究所
主编：孙凝晖
地址：北京中关村科学院南路6号
邮编：100190
邮箱：cjc@ict.ac.cn
电话：010-62620695

国际标准刊号：ISSN：0254-4164
国内统一刊号：ISSN：11-1826/TP
邮发代号:2-833

获奖情况:
中国期刊方阵“双效”期刊

国内外数据库收录:
美国数学评论（网络版）,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:48433