文件名称:MDP-model-of-MPNP
介绍说明--下载内容均来自于网络,请自行研究使用
在matlab平台上,针对多周期报童问题,采用值迭代算法、策略迭代算法和强化学习算法求解MDP模型的实例-This is an example presentting how to apply value-iteration algorithm,policy-iteration algorithm and reinforcement learning algorithm to MDP model, which aims to solve the multi-period newsboy problem.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
多周期报童问题的MDP建模及求解\draw.asv
.............................\draw.m
.............................\drawFigure.asv
.............................\drawFigure.m
.............................\initial.asv
.............................\initial.m
.............................\initial2.m
.............................\main.asv
.............................\main.m
.............................\policyIteration.asv
.............................\policyIteration.m
.............................\QLearning.asv
.............................\QLearning.m
.............................\revenueMDP.asv
.............................\revenueMDP.m
.............................\revenuesS.m
.............................\reward.asv
.............................\reward.m
.............................\transitionMatrix.asv
.............................\transitionMatrix.m
.............................\valueIteration.asv
.............................\valueIteration.m
多周期报童问题的MDP建模及求解
.............................\draw.m
.............................\drawFigure.asv
.............................\drawFigure.m
.............................\initial.asv
.............................\initial.m
.............................\initial2.m
.............................\main.asv
.............................\main.m
.............................\policyIteration.asv
.............................\policyIteration.m
.............................\QLearning.asv
.............................\QLearning.m
.............................\revenueMDP.asv
.............................\revenueMDP.m
.............................\revenuesS.m
.............................\reward.asv
.............................\reward.m
.............................\transitionMatrix.asv
.............................\transitionMatrix.m
.............................\valueIteration.asv
.............................\valueIteration.m
多周期报童问题的MDP建模及求解