文件名称:suntton-RL-book-demo
- 所属分类:
- 人工智能/神经网络/遗传算法
- 资源属性:
- [Matlab] [源码]
- 上传时间:
- 2012-11-26
- 文件大小:
- 164kb
- 下载次数:
- 2次
- 提 供 者:
- 朱**
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
sutton强化学习书籍的所有matlab例子,学习很有用,不易找到-all matlab demo about sutton s book for reinforcement learning
(系统自动生成,下载前可以参看下载内容)
下载文件列表
suntton强化学习书籍所有代码\suntton非matlab代码说明.doc
...........................\~$内容说明.doc
...........................\内容说明.doc
...........................\Chapter 9 (Planning and Learning)\blocking_mz_Script.m.m
...........................\.................................\do_ex_9_1_exps.m.m
...........................\.................................\dynaQplus_maze.m.m
...........................\.................................\dynaQplus_maze_Script.m.m
...........................\.................................\dynaQ_maze.m.m
...........................\.................................\dynaQ_maze_Script.m.m
...........................\.................................\ex_9_4_dynaQplus.m.m
...........................\.................................\ex_9_4_dynaQplus_Script.m.m
...........................\.................................\mk_ex_9_1_mz.m.m
...........................\.................................\mk_ex_9_2_mz.m.m
...........................\.................................\mk_ex_9_3_mz.m.m
...........................\.................................\plot_mz_policy.m.m
...........................\........8 (Generailzation and Function Approximation)\do_mnt_car_Exps.m
...........................\.....................................................\GetTiles_Mex.C
...........................\.....................................................\GetTiles_Mex_Script.m
...........................\.....................................................\get_ctg.m
...........................\.....................................................\linAppFn.m
...........................\.....................................................\mnt_car_learn.m
...........................\.....................................................\next_state.m
...........................\.....................................................\ret_q_in_st.m
...........................\.....................................................\stp_fn_approx_Script.m
...........................\.....................................................\targetF.m
...........................\.....................................................\tiles.C
...........................\.....................................................\tiles.h
...........................\........7 (Eligibility Traces)\eg_7_5_episode.m
...........................\..............................\eg_7_5_learn_at.m
...........................\..............................\eg_7_5_learn_rt.m
...........................\..............................\eg_7_5_Script.m
...........................\..............................\gw_w_et.m
...........................\..............................\gw_w_et_Script.m
...........................\..............................\rw_accumulating_vs_replacing_Script.m
...........................\..............................\rw_episode.m
...........................\..............................\rw_offline_ntd_learn.m
...........................\..............................\rw_offline_ntd_learn_Script.m
...........................\..............................\rw_offline_tdl_learn.m
...........................\..............................\rw_offline_tdl_learn_Script.m
...........................\..............................\rw_online_ntd_learn.m
...........................\..............................\rw_online_ntd_learn_Script.m
...........................\..............................\rw_online_tdl_learn.m
...........................\..............................\rw_online_tdl_learn_Script.m
...........................\..............................\rw_online_w_et.m
...........................\..............................\rw_online_w_et_Script.m
...........................\..............................\rw_online_w_replacing_traces.m
...........................\........6 (Temporal Difference Learning)\cmpt_arms_err.m
...........................\........................................\eg_6_2_learn.m
...........................\........................................\eg_rw_batch_learn.m
...........
...........................\~$内容说明.doc
...........................\内容说明.doc
...........................\Chapter 9 (Planning and Learning)\blocking_mz_Script.m.m
...........................\.................................\do_ex_9_1_exps.m.m
...........................\.................................\dynaQplus_maze.m.m
...........................\.................................\dynaQplus_maze_Script.m.m
...........................\.................................\dynaQ_maze.m.m
...........................\.................................\dynaQ_maze_Script.m.m
...........................\.................................\ex_9_4_dynaQplus.m.m
...........................\.................................\ex_9_4_dynaQplus_Script.m.m
...........................\.................................\mk_ex_9_1_mz.m.m
...........................\.................................\mk_ex_9_2_mz.m.m
...........................\.................................\mk_ex_9_3_mz.m.m
...........................\.................................\plot_mz_policy.m.m
...........................\........8 (Generailzation and Function Approximation)\do_mnt_car_Exps.m
...........................\.....................................................\GetTiles_Mex.C
...........................\.....................................................\GetTiles_Mex_Script.m
...........................\.....................................................\get_ctg.m
...........................\.....................................................\linAppFn.m
...........................\.....................................................\mnt_car_learn.m
...........................\.....................................................\next_state.m
...........................\.....................................................\ret_q_in_st.m
...........................\.....................................................\stp_fn_approx_Script.m
...........................\.....................................................\targetF.m
...........................\.....................................................\tiles.C
...........................\.....................................................\tiles.h
...........................\........7 (Eligibility Traces)\eg_7_5_episode.m
...........................\..............................\eg_7_5_learn_at.m
...........................\..............................\eg_7_5_learn_rt.m
...........................\..............................\eg_7_5_Script.m
...........................\..............................\gw_w_et.m
...........................\..............................\gw_w_et_Script.m
...........................\..............................\rw_accumulating_vs_replacing_Script.m
...........................\..............................\rw_episode.m
...........................\..............................\rw_offline_ntd_learn.m
...........................\..............................\rw_offline_ntd_learn_Script.m
...........................\..............................\rw_offline_tdl_learn.m
...........................\..............................\rw_offline_tdl_learn_Script.m
...........................\..............................\rw_online_ntd_learn.m
...........................\..............................\rw_online_ntd_learn_Script.m
...........................\..............................\rw_online_tdl_learn.m
...........................\..............................\rw_online_tdl_learn_Script.m
...........................\..............................\rw_online_w_et.m
...........................\..............................\rw_online_w_et_Script.m
...........................\..............................\rw_online_w_replacing_traces.m
...........................\........6 (Temporal Difference Learning)\cmpt_arms_err.m
...........................\........................................\eg_6_2_learn.m
...........................\........................................\eg_rw_batch_learn.m
...........