东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于性能势的Markov控制过程双时间尺度仿真算法

ISSN号：1004-731X
期刊名称：《系统仿真学报》
时间：0
分类：TP391.9[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]中国科学技术大学自动化系,合肥230027
相关基金：国家自然科学基金（60574065,60774038）

作者：鲍秉坤[1], 殷保群[1], 奚宏生[1]

关键词： Markov控制过程, 性能势, 双时间尺度, 随机逼近, Markov decision process, performance potential, two time-scale, stochastic approximation

中文摘要：

在基于性能势的随机逼近方法中引入双时间尺度的概念，提出了离散时间Markov控制过程的基于性能势的双时间尺度仿真梯度算法，弥补了传统算法中每步更新算法更新频率过快和更新环更新算法更新频率过慢的不足，并利用三个数值例子来说明双时间尺度更新算法在计算复杂度、收敛速度和收敛精度上的优势。

英文摘要：

A novel two time-scale simulation-based gradient algorithm based on performance potential for discrete time Markov decision process was proposed, by introducing the concept of two time-scale into the performance potential based stochastic approximation. This algorithm tackles the limitations in classical approaches that the every-update simulation- based gradient algorithm updates too frequently, and the regenerative-update gradient algorithm updates too infrequently. Three numerical examples illustrate the superiority of two time-scale simulation-based gradient algorithm in computational complexity, convergence speed and convergence precision.

同期刊论文项目

面向信息服务系统的半Markov切换空间控制过程

期刊论文 22 会议论文 9

隐Markov过程的性能灵敏度分析与优化

期刊论文 20

同项目期刊论文

A Markov decision model for low-layer readaheads

Adaptive optimisation of timeout policy for dynamic power management based on semi-Markov control pr

Two time-scale gradient approximation algorithm for adaptive Markov reward processes

一种支持并发访问流的文件预取算法

A relaxing bandwidth smoothing schedule for transmitting prerecorded VBR video in periodic network

具有马氏跳跃参数的切削加工系统控制问题研究

Dynamic File Grouping for Load Balancing in Streaming Media Clustered Server Systems

Generalized PCRTT offline bandwidth smoothing based on SVM and systematic video segmentation

The equivalent relation between timeout and stochastic policies for dynamic power management

Partially Observalbe Markov Decision Processes and Performance Sensitivity Analysis

动态电源管理超时策略自适应优化算法

Estimating the Delay-time for the Stability of Markovian Jump Bilinear Systems with Saturating Actuators

基于Chord网络动态数据的Skyline计算

一种基于访问概率预分配的流媒体集群动态副本更新算法

多业务流媒体服务系统的自适应服务组合算法

动态电源管理超时策略与随机型策略的等效关系

基于用户感受质量的流媒体服务器VCR功能实现方法

Sensitivity analysis and estimates of the performance for M/G/1 queueing systems

Performance optimization of semi-Markov decision processes with discounted-cost criteria

A state aggregation approach to singularly perturbed Markov reward processes

连续时间POMDP的策略梯度估计

基于观测的POMDP优化算法及其仿真

基于POMDP模型的机器人行动的仿真优化

动态电源管理超时策略自适应优化算法

Partially observable Markov decision processes and performance sensitivity analysis

Error bounds of optimization algorithms for semi-Markov decision processes

一类分层非结构化P2P系统的随机切换模型

基于POMDP的VOD接入控制建模与仿真

动态电源管理的随机切换模型与策略优化

半Markov控制过程基于性能势仿真的并行优化算法

动态电源管理超时策略与随机型策略的等效关系

非线性采样观测器的误差分析

基于耦合技术计算Markov链性能势的仿真算法

基于双层P2P架构的VoD系统

基于CDN和P2P的分布式网络存储系统

无线多媒体通信网适应带宽配置在线优化算法

期刊信息

《系统仿真学报》
北大核心期刊（2011版）

主管单位:中国航天科工集团公司
主办单位:北京仿真中心中国仿真学会
主编：李伯虎
地址：北京市海淀区永定路50号院
邮编：100039
邮箱：simu-xb@vip.sina.com
电话：010-88527147

国际标准刊号：ISSN：1004-731X
国内统一刊号：ISSN：11-3092/V
邮发代号:82-9

获奖情况:

国内外数据库收录:
美国化学文摘（网络版）,荷兰文摘与引文数据库,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）

被引量:51729