东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于聚类状态隶属度的动态调度Q-学习

ISSN号：1002-0470
期刊名称：《高技术通讯》
时间：0
分类：O24[理学—计算数学;理学—数学]
作者机构：[1]哈尔滨工业大学机电工程学院,哈尔滨150001
相关基金：国家自然科学基金（60572174）和863计划（2008AA042401）资助项目.

作者：王国磊[1], 钟诗胜[1], 林琳[1]

关键词：动态调度, Q-学习, 调度规则选择, 状态聚类, 隶属度, dynamic scheduling, Q-learning, dispatching rule selection, state clustering, membership

中文摘要：

提出了一种利用Q-学习解决动态单机调度环境下的自适应调度规则选择的方法。该方法针对动态调度环境中系统状态空间大，Q-学习不易收敛的特点，首先提取系统状态特征，对系统状态进行合理聚类，有效地降低系统状态空间维数，然后在学习过程中令设备Agent根据瞬时状态向量对各聚类状态的隶属度做出综合判断，选择合适规则，并在每次迭代后根据隶属度将动作奖惩分配给各聚类状态的动作值函数。仿真结果表明，所提Q-学习算法较之传统Q-学习具有更快的收敛速度，提高了设备Agent的动态调度规则选择能力。

英文摘要：

Q-learning was applied to resolution of the adaptive dispatching rule selection problem under dynamic single-machine scheduling environment. Considering that Q-learning is hard to converge due to the large scale of the system state space during dynamic scheduling, the method extracts several state features of the system firstly, so that the dimension of the system state space can be reduced through the fuzzy clustering method. Then the machine agent can choose proper rules based on the transient system state membership of all the clustering system states. Each time after machine agent performs an action, the reward is assigned to all the value functions of the same rule in different clustering system states according to the fuzzy membership. The simulation results demonstrate that the proposed algorithm has a faster convergence rate, compared with the traditional Q-learning algorithm, and can improve the dynamic dispatching rule selection ability of machine agent.

同期刊论文项目

基于过程神经网络的飞机发动机性能衰退预测理论与应用研究

期刊论文 32 会议论文 1

同项目期刊论文

基于双隐层过程神经网络的飞机发动机故障检测

基于确定时间连续Petri网的航空发动机总装资源调度

Convergence analysis of the learning algorithm for parallel feedforward process neural network

Time series prediction using wavelet process neural network

面向航空发动机全寿命管理的航线数据处理系统

基于元模型的模糊Petri网反向传播学习算法

基于过程神经网络的航空发动机排气温度预测

基于对传过程神经网络的航空发动机转子仿真故障诊断研究

多分辨小波过程神经网络及其应用研究

连续小波过程神经网络及其仿真研究

Continuous wavelet process neural network and its application

基于小波过程神经网络的飞机发动机状态监视

基于过程神经网络的时间序列预测及其应用研究

Time series prediction based on Elman process neural network and its application

Approximation capability analysis of parallel process neural network with application to aircraft en

Elman-style Process Neural Network with Time-varying Output Functions and Its Application

具有时变输入输出函数的反馈过程神经网络及应用

基于离散时间最优控制的航空发动机装配序列规划

基于递归过程神经网络的航空发动机滑油系统状态监测

民用航空发动机滑油金属含量预测及其控制研究

基于过程神经网络与气动热力参数的航空发动机状态监视

改进BP算法在过程神经网络中的应用

Hybrid Recurrent Process Neural Network for Aero Engine Condition Monitoring

基于时变阈值过程神经网络的太阳黑子数预测

一种基于过程神经元网络的非线性动态系统辨识模型及应用

一种反馈过程神经元网络模型及在动态信号分类中的应用

基于神经网络的平面连杆机构MATLAB仿真

一种改进的量子粒子群优化算法及其应用

平面连杆机构等效力矩和转动惯量数学模型

Time series prediction using wavelet process neural network

一种过程支持向量机模型及其若干理论性质

期刊信息

《高技术通讯》
北大核心期刊（2011版）

主管单位:中华人民共和国科学科技部
主办单位:中国科学技术信息研究所
主编：赵志耘
地址：北京市三里河路54号
邮编：100045
邮箱：hitech@istic.ac.cn
电话：010-68514060 68598272

国际标准刊号：ISSN：1002-0470
国内统一刊号：ISSN：11-2770/N
邮发代号:82-516

获奖情况:
《中国科学引文数据》刊源,《中国科技论文统计与分析》刊源

国内外数据库收录:
美国化学文摘（网络版）,荷兰文摘与引文数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,英国英国皇家化学学会文摘

被引量:12178