东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

非负费用折扣半马氏决策过程

ISSN号：0583-1431
期刊名称：《数学学报》
时间：0
分类：O211.62[理学—概率论与数理统计;理学—数学] O231.3[理学—运筹学与控制论;理学—数学]
作者机构：[1]中山大学数学与计算科学学院,广州510275
相关基金：国家自然科学基金资助项目（60874004 10925107）

作者：黄永辉[1], 郭先平[1]

关键词：半马氏决策过程, 折扣费用, 最优策略, Semi-Markov decision processes, discounted cost, optimal policy

中文摘要：

本文考虑可数状态非负费用的折扣半马氏决策过程.首先在给定半马氏决策核和策略下构造一个连续时间半马氏决策过程,然后用最小非负解方法证明值函数满足最优方程和存在ε-最优平稳策略,并进一步给出最优策略的存在性条件及其一些性质.最后,给出了值迭代算法和一个数值算例.

英文摘要：

This paper deals with discounted semi-Markov decision processes with countable states and nonnegative costs.We first construct a continuous-time semi-Markov decision process under a given semi-Markov decision kernel and each policy. Then,we prove that the value function satisfies the optimality equation and there exists an e-optimal stationary policy by using a minimum nonnegative solution approach,and further give conditions for the existence of optimal policies as well as some properties of optimal policies.Finally,a value iteration algorithm for computing the value function is developed and a numerical example is given.

同期刊论文项目

马氏过程与随机最优化

期刊论文 26

随机动态系统高级最优控制的研究

期刊论文 23 著作 1

同项目期刊论文

Optimal control for probabilistic Boolean networks

NEW DISCOUNT AND AVERAGE OPTIMALITY CONDITIONS FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

Bias optimality for multichain continuous-time Markov decision processes

Zero-sum stochastic games with average payoffs: New optimality conditions

DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES

Total reward criteria for unconstrained/constrained continuous-time Markov decision processes

Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss

受控排队系统的平均最优与约束平均最优

截断前马氏过程与截断后马氏过程

Moments of the maximum of normed partial sums of ρ^--mixing random variables

Total reward criteria for unconstrained/constrained continuous-time Markov decision processes

Performance analysis for controlled semi-Markov systems with application to maintenance

New optimality conditions for average-payoff continuous-time Markov games in Polish spaces

Finite horizon semi-Markov decision processes with application to maintenance systems

First passage models for denumerable semi-Markov decision processes with nonnegative discounted cost

New sufficient conditions for average optimality in continuous-time Markov decision processes

Denumerable continuous-time Markov decision processes with multiconstraints on average costs

A mean-variance optimization problem for discounted Markov decision processes

Comparative effectiveness research on patients with acute ischemic stroke using Markov decision proc

New discounted and average optimality conditions for continuous-time Markov decision processes

Nonzero-sum games for continuous-time Markov chains with unbounded transition and average payoff rat

Linear programming and constrained average optimality for general continuous-time Markov decision pr

Minimum risk probability for finite horizon semi-Markov decision processes

Absorbing continuous-time Markov decision processes with total cost criteria

First passage optimality for continuous-time Markov decision processes with varying discount factors

Markov decision processes with state-dependent discount factors and unbounded rewards/costs

Nonstationary discrete-time deterministic and stochastic control systems with infinite horizon

Nonstationary discrete-time deterministic and stochastic control systems: Bounded and unbounded case

期刊信息

《数学学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国科学院数学与系统科学研究院数学研究院
主编：李炳仁
地址：北京市海淀区中关村东路55号
邮编：100080
邮箱：Actamath@amss.ac.cn
电话：010-62551910

国际标准刊号：ISSN：0583-1431
国内统一刊号：ISSN：11-2038/O1
邮发代号:2-502

获奖情况:
1996年中科院优秀科技期刊二等奖,1997年全国优秀科技期刊二等奖,2000年中科院优秀科技期刊二等奖

国内外数据库收录:
美国数学评论（网络版）,德国数学文摘,荷兰文摘与引文数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:9981