强化学习精要核心算法与Tensorflow实现及其源代码和个人注解.zip
下载

所属分类:人工智能 > 深度学习
文件大小:96.31 MB
上传日期:2019/7/9 23:11:19
MD5:1eab48bc7e************697cf89e4a
资源说明:主要是强化学习精要核心算法与tensorflow实现的源代码,以及baselines中的代码,还包括个人的一些笔记。
移动页面: MIP AMP

[资源合计] 文件夹:80,文件:409

# 文件名称 大小 最后修改时间
1 《强化学习精要 核心算法与TensorFlow实现》PDF-DESKTOP-1FJU2B5.pdf 88.2 MB 2019/5/23 22:17:38
2 《强化学习精要》源代码\ch8\123123\model.data-00000-of-00001 23.6 MB 2019/5/30 22:15:30
3 《强化学习精要》源代码\ch8\123123\model.meta 138.6 KB 2019/5/30 22:15:31
4 baselines\acktr\kfac.py 44.43 KB 2019/7/3 19:20:19
5 《强化学习精要》源代码\.idea\workspace.xml 39.94 KB 2019/6/3 22:17:58
6 baselines\acktr\__pycache__\kfac.cpython-36.pyc 23.61 KB 2019/7/3 19:04:59
7 baselines\deepq\build_graph.py 20.13 KB 2019/5/24 17:48:40
8 baselines\acer\acer_simple.py 17.2 KB 2019/7/2 20:47:29
9 baselines\ddpg\ddpg.py 17 KB 2019/7/3 16:19:24
10 baselines\deepq\__pycache__\build_graph.cpython-36.pyc 16.53 KB 2019/7/3 19:27:38
11 baselines\deepq\__pycache__\build_graph.cpython-37.pyc 16.5 KB 2019/5/24 17:48:47
12 baselines\__pycache__\logger.cpython-36.pyc 15.39 KB 2019/7/3 16:09:43
13 baselines\__pycache__\logger.cpython-37.pyc 15.35 KB 2019/5/24 17:48:47
14 baselines\common\__pycache__\distributions.cpython-36.pyc 15.21 KB 2019/7/4 11:02:49
15 baselines\common\__pycache__\distributions.cpython-37.pyc 15.04 KB 2019/5/24 17:48:47
16 baselines\her\ddpg.py 14.82 KB 2019/5/24 17:48:40
17 baselines\gail\trpo_mpi.py 14.32 KB 2019/5/24 17:48:40
18 baselines\.idea\workspace.xml 14.21 KB 2019/6/25 22:06:21
19 baselines\her\__pycache__\ddpg.cpython-37.pyc 13.9 KB 2019/5/24 17:48:47
20 baselines\logger.py 13.44 KB 2019/6/27 11:19:25
21 baselines\ddpg\__pycache__\ddpg.cpython-36.pyc 12.22 KB 2019/7/3 16:28:43
22 baselines\ddpg\__pycache__\ddpg.cpython-37.pyc 12.18 KB 2019/5/24 17:48:47
23 baselines\deepq\simple.py 11.44 KB 2019/5/24 17:48:40
24 baselines\acer\__pycache__\acer_simple.cpython-36.pyc 11.14 KB 2019/7/3 17:19:16
25 baselines\trpo_mpi\trpo_mpi.py 11.09 KB 2019/5/24 17:48:40
26 baselines\common\distributions.py 10.82 KB 2019/7/4 11:02:46
27 baselines\common\__pycache__\tf_util.cpython-36.pyc 10.71 KB 2019/7/3 16:45:27
28 baselines\ppo2\ppo2.py 10.68 KB 2019/5/24 17:48:40
29 baselines\common\__pycache__\tf_util.cpython-37.pyc 10.67 KB 2019/5/24 17:48:47
30 《强化学习精要》源代码\ch8\123123\123123.npy 10.58 KB 2019/5/30 22:15:31
31 baselines\a2c\__pycache__\utils.cpython-36.pyc 10.49 KB 2019/7/3 17:18:50
32 baselines\a2c\__pycache__\utils.cpython-37.pyc 10.43 KB 2019/5/24 17:48:47
33 baselines\common\tf_util.py 10.41 KB 2019/5/24 17:48:40
34 baselines\acer\__pycache__\acer_simple.cpython-37.pyc 10.37 KB 2019/5/24 17:48:47
35 baselines\gail\__pycache__\trpo_mpi.cpython-37.pyc 10.33 KB 2019/5/24 17:48:47
36 baselines\a2c\utils.py 9.6 KB 2019/6/27 14:01:54
37 baselines\ppo1\pposgd_simple.py 9.27 KB 2019/5/24 17:48:40
38 baselines\common\__pycache__\atari_wrappers.cpython-36.pyc 9.25 KB 2019/7/3 17:17:12
39 baselines\common\__pycache__\atari_wrappers.cpython-37.pyc 9.23 KB 2019/5/24 17:48:47
40 baselines\gail\run_mujoco.py 9.14 KB 2019/5/24 17:48:40
41 baselines\trpo_mpi\__pycache__\trpo_mpi.cpython-36.pyc 8.85 KB 2019/7/4 10:57:36
42 baselines\trpo_mpi\__pycache__\trpo_mpi.cpython-37.pyc 8.84 KB 2019/5/24 17:48:47
43 baselines\ppo2\__pycache__\ppo2.cpython-36.pyc 8.63 KB 2019/7/3 19:30:58
44 baselines\ppo2\__pycache__\ppo2.cpython-37.pyc 8.61 KB 2019/5/24 17:48:47
45 baselines\ddpg\training.py 8.49 KB 2019/7/3 16:20:57
46 baselines\deepq\__pycache__\simple.cpython-36.pyc 8.45 KB 2019/7/3 19:27:38
47 baselines\deepq\__pycache__\simple.cpython-37.pyc 8.44 KB 2019/5/24 17:48:47
48 baselines\common\__pycache__\misc_util.cpython-36.pyc 8.09 KB 2019/7/3 16:09:58
49 baselines\common\__pycache__\misc_util.cpython-37.pyc 8.08 KB 2019/5/24 17:48:47
50 《强化学习精要》源代码\.DS_Store 8 KB 2018/4/30 16:41:44
51 《强化学习精要》源代码\ch8\DQN.py 7.99 KB 2019/5/23 16:12:30
52 baselines\common\atari_wrappers.py 7.9 KB 2019/5/24 17:48:40
53 baselines\her\rollout.py 7.6 KB 2019/5/24 17:48:40
54 baselines\common\misc_util.py 7.42 KB 2019/5/24 17:48:40
55 baselines\a2c\a2c.py 7.39 KB 2019/5/24 17:48:40
56 《强化学习精要》源代码\ch8\__pycache__\util.cpython-37.pyc 7.22 KB 2019/5/23 16:13:24
57 《强化学习精要》源代码\ch8\__pycache__\util.cpython-36.pyc 7.22 KB 2019/5/24 9:14:50
58 baselines\her\__pycache__\rollout.cpython-37.pyc 6.99 KB 2019/5/24 17:48:47
59 baselines\acktr\acktr_disc.py 6.67 KB 2019/5/24 17:48:40
60 baselines\deepq\__pycache__\replay_buffer.cpython-36.pyc 6.65 KB 2019/7/3 19:27:39
61 baselines\deepq\__pycache__\replay_buffer.cpython-37.pyc 6.63 KB 2019/5/24 17:48:47
62 《强化学习精要》源代码\ch4\logs\events.out.tfevents.1525065168.localhost 6.62 KB 2018/4/30 13:12:48
63 baselines\her\experiment\train.py 6.58 KB 2019/5/24 17:48:40
64 baselines\gail\__pycache__\run_mujoco.cpython-36.pyc 6.58 KB 2019/7/3 19:29:15
65 baselines\gail\__pycache__\run_mujoco.cpython-37.pyc 6.56 KB 2019/5/24 17:48:47
66 《强化学习精要》源代码\ch8\util.py 6.46 KB 2019/5/23 16:06:35
67 baselines\ppo2\__pycache__\policies.cpython-36.pyc 6.3 KB 2019/7/3 17:18:50
68 baselines\ppo2\__pycache__\policies.cpython-37.pyc 6.29 KB 2019/5/24 17:48:47
69 baselines\a2c\__pycache__\policies.cpython-37.pyc 6.29 KB 2019/5/24 17:48:46
70 baselines\deepq\replay_buffer.py 6.28 KB 2019/5/24 17:48:40
71 baselines\ppo1\__pycache__\pposgd_simple.cpython-36.pyc 6.18 KB 2019/7/3 19:30:21
72 baselines\ppo1\__pycache__\pposgd_simple.cpython-37.pyc 6.17 KB 2019/5/24 17:48:47
73 baselines\a2c\__pycache__\a2c.cpython-36.pyc 6.06 KB 2019/7/3 17:18:48
74 baselines\a2c\__pycache__\a2c.cpython-37.pyc 6.04 KB 2019/5/24 17:48:46
75 《强化学习精要》源代码\ch6\.DS_Store 6 KB 2018/4/30 16:30:14
76 《强化学习精要》源代码\ch7\.DS_Store 6 KB 2018/4/30 16:31:28
77 baselines\a2c\policies.py 5.8 KB 2019/5/24 17:48:40
78 baselines\ppo2\policies.py 5.8 KB 2019/5/24 17:48:40
79 baselines\her\experiment\config.py 5.79 KB 2019/5/24 17:48:40
80 《强化学习精要》源代码\ch6\snake.py 5.75 KB 2019/5/22 15:17:04
81 baselines\gail\gail-eval.py 5.75 KB 2019/5/24 17:48:40
82 baselines\acktr\__pycache__\acktr_disc.cpython-36.pyc 5.67 KB 2019/7/3 19:08:59
83 baselines\bench\monitor.py 5.65 KB 2019/7/3 19:59:13
84 baselines\acktr\__pycache__\filters.cpython-36.pyc 5.43 KB 2019/7/3 19:01:18
85 baselines\acktr\__pycache__\filters.cpython-37.pyc 5.36 KB 2019/5/24 17:48:47
86 baselines\common\__pycache__\filters.cpython-36.pyc 5.35 KB 2019/7/3 18:55:54
87 baselines\acktr\acktr_cont.py 5.33 KB 2019/7/3 19:00:29
88 baselines\bench\__pycache__\monitor.cpython-36.pyc 5.33 KB 2019/7/4 8:52:29
89 baselines\bench\__pycache__\monitor.cpython-37.pyc 5.31 KB 2019/5/24 17:48:47
90 baselines\bench\benchmarks.py 5.3 KB 2019/5/24 17:48:40
91 baselines\common\__pycache__\segment_tree.cpython-36.pyc 5.24 KB 2019/7/3 19:27:39
92 baselines\common\__pycache__\segment_tree.cpython-37.pyc 5.22 KB 2019/5/24 17:48:47
93 baselines\ddpg\main.py 5.22 KB 2019/7/3 16:46:21
94 baselines\her\normalizer.py 5.18 KB 2019/5/24 17:48:40
95 baselines\gail\behavior_clone.py 5.07 KB 2019/5/24 17:48:40
96 baselines\ddpg\__pycache__\training.cpython-36.pyc 5 KB 2019/7/3 16:28:43
97 baselines\ddpg\__pycache__\training.cpython-37.pyc 4.99 KB 2019/5/24 17:48:47
98 baselines\bench\__pycache__\benchmarks.cpython-36.pyc 4.99 KB 2019/7/3 16:09:43
99 baselines\bench\__pycache__\benchmarks.cpython-37.pyc 4.91 KB 2019/5/24 17:48:47
100 baselines\her\experiment\__pycache__\train.cpython-37.pyc 4.89 KB 2019/5/24 17:48:47
101 baselines\her\__pycache__\normalizer.cpython-36.pyc 4.78 KB 2019/7/3 19:29:46
102 baselines\her\__pycache__\normalizer.cpython-37.pyc 4.76 KB 2019/5/24 17:48:47
103 《强化学习精要》源代码\ch6\__pycache__\snake.cpython-37.pyc 4.75 KB 2019/5/21 10:49:47
104 baselines\common\segment_tree.py 4.75 KB 2019/5/24 17:48:40
105 baselines\common\vec_env\__pycache__\__init__.cpython-36.pyc 4.75 KB 2019/7/3 17:18:48
106 baselines\common\vec_env\__pycache__\__init__.cpython-37.pyc 4.73 KB 2019/5/24 17:48:47
107 baselines\gail\__pycache__\gail-eval.cpython-37.pyc 4.72 KB 2019/5/24 17:48:47
108 《强化学习精要》源代码\ch6\policy_iter.py 4.63 KB 2019/5/21 22:25:41
109 baselines\acktr\__pycache__\acktr_cont.cpython-36.pyc 4.62 KB 2019/7/3 19:00:32
110 baselines\common\__pycache__\schedules.cpython-36.pyc 4.59 KB 2019/7/3 19:27:38
111 baselines\common\__pycache__\schedules.cpython-37.pyc 4.57 KB 2019/5/24 17:48:47
112 baselines\gail\adversary.py 4.56 KB 2019/5/24 17:48:40
113 《强化学习精要》源代码\ch7\__pycache__\snake.cpython-37.pyc 4.42 KB 2019/5/22 14:51:27
114 baselines\gail\dataset\mujoco_dset.py 4.37 KB 2019/5/24 17:48:40
115 baselines\gail\__pycache__\behavior_clone.cpython-37.pyc 4.33 KB 2019/5/24 17:48:47
116 baselines\acer\buffer.py 4.28 KB 2019/5/24 17:48:40
117 baselines\gail\dataset\__pycache__\mujoco_dset.cpython-36.pyc 4.26 KB 2019/7/3 19:28:46
118 baselines\gail\dataset\__pycache__\mujoco_dset.cpython-37.pyc 4.24 KB 2019/5/24 17:48:47
119 baselines\her\__pycache__\util.cpython-36.pyc 4.18 KB 2019/7/3 19:29:46
120 baselines\her\__pycache__\util.cpython-37.pyc 4.16 KB 2019/5/24 17:48:47
121 baselines\her\experiment\__pycache__\config.cpython-37.pyc 4.05 KB 2019/5/24 17:48:47
122 baselines\her\util.py 3.97 KB 2019/5/24 17:48:40
123 baselines\gail\__pycache__\adversary.cpython-36.pyc 3.93 KB 2019/7/3 19:28:46
124 baselines\gail\__pycache__\adversary.cpython-37.pyc 3.9 KB 2019/5/24 17:48:47
125 baselines\her\__pycache__\replay_buffer.cpython-36.pyc 3.8 KB 2019/7/3 19:29:46
126 baselines\her\__pycache__\replay_buffer.cpython-37.pyc 3.78 KB 2019/5/24 17:48:47
127 baselines\deepq\__pycache__\utils.cpython-36.pyc 3.76 KB 2019/7/3 19:27:39
128 baselines\deepq\__pycache__\utils.cpython-37.pyc 3.75 KB 2019/5/24 17:48:47
129 baselines\her\experiment\__pycache__\plot.cpython-37.pyc 3.65 KB 2019/5/24 17:48:47
130 baselines\ddpg\__pycache__\main.cpython-37.pyc 3.63 KB 2019/5/24 17:48:47
131 baselines\common\schedules.py 3.62 KB 2019/5/24 17:48:40
132 baselines\common\mpi_running_mean_std.py 3.59 KB 2019/7/3 17:11:54
133 baselines\common\__pycache__\cmd_util.cpython-36.pyc 3.59 KB 2019/7/3 17:17:12
134 baselines\her\replay_buffer.py 3.58 KB 2019/5/24 17:48:40
135 baselines\common\__pycache__\cmd_util.cpython-37.pyc 3.57 KB 2019/5/24 17:48:47
136 baselines\acktr\policies.py 3.54 KB 2019/5/24 17:48:40
137 《强化学习精要》源代码\ch7\monte_carlo.py 3.54 KB 2019/5/22 18:42:31
138 《强化学习精要》源代码\ch7\snake.py 3.52 KB 2019/5/22 14:44:09
139 baselines\her\experiment\plot.py 3.51 KB 2019/5/24 17:48:40
140 baselines\deepq\models.py 3.48 KB 2019/5/24 17:48:40
141 baselines\common\vec_env\__pycache__\subproc_vec_env.cpython-36.pyc 3.47 KB 2019/7/3 17:18:48
142 baselines\common\vec_env\__pycache__\subproc_vec_env.cpython-37.pyc 3.43 KB 2019/5/24 17:48:47
143 《强化学习精要》源代码\ch4\logs\events.out.tfevents.1525065162.localhost 3.43 KB 2018/4/30 13:12:42
144 baselines\__pycache__\results_plotter.cpython-37.pyc 3.42 KB 2019/5/24 17:48:47
145 baselines\ppo2\run_atari.py 3.42 KB 2019/7/2 21:14:55
146 baselines\acktr\kfac_utils.py 3.31 KB 2019/5/24 17:48:40
147 baselines\deepq\experiments\custom_cartpole.py 3.27 KB 2019/5/24 17:48:40
148 baselines\common\__pycache__\mpi_running_mean_std.cpython-36.pyc 3.26 KB 2019/7/3 17:11:58
149 baselines\common\__pycache__\mpi_running_mean_std.cpython-37.pyc 3.23 KB 2019/5/24 17:48:47
150 baselines\ddpg\__pycache__\models.cpython-36.pyc 3.22 KB 2019/7/3 17:14:49
151 baselines\ddpg\__pycache__\models.cpython-37.pyc 3.2 KB 2019/5/24 17:48:47
152 baselines\ddpg\__pycache__\noise.cpython-36.pyc 3.19 KB 2019/7/3 16:45:27
153 《强化学习精要》源代码\ch7\policy_iter.py 3.18 KB 2019/5/22 14:56:10
154 baselines\ddpg\__pycache__\noise.cpython-37.pyc 3.17 KB 2019/5/24 17:48:47
155 baselines\acer\__pycache__\buffer.cpython-36.pyc 3.11 KB 2019/7/3 17:19:18
156 baselines\gail\__pycache__\mlp_policy.cpython-36.pyc 3.1 KB 2019/7/3 19:28:44
157 baselines\gail\__pycache__\mlp_policy.cpython-37.pyc 3.1 KB 2019/5/24 17:48:47
158 baselines\acer\__pycache__\buffer.cpython-37.pyc 3.08 KB 2019/5/24 17:48:47
159 《强化学习精要》源代码\ch7\__pycache__\monte_carlo.cpython-37.pyc 3.08 KB 2019/5/22 20:37:09
160 baselines\common\vec_env\__init__.py 3.05 KB 2019/5/24 17:48:40
161 baselines\acktr\__pycache__\value_functions.cpython-36.pyc 3.02 KB 2019/7/3 18:55:54
162 baselines\results_plotter.py 3.01 KB 2019/5/24 17:48:40
163 baselines\common\__pycache__\mpi_adam.cpython-36.pyc 3 KB 2019/7/3 16:11:36
164 baselines\common\__pycache__\mpi_adam.cpython-37.pyc 2.99 KB 2019/5/24 17:48:47
165 《强化学习精要》源代码\ch6\__pycache__\policy_iter.cpython-37.pyc 2.98 KB 2019/5/22 9:42:11
166 baselines\common\__pycache__\math_util.cpython-36.pyc 2.96 KB 2019/7/3 16:09:44
167 baselines\ppo1\__pycache__\mlp_policy.cpython-36.pyc 2.95 KB 2019/7/3 19:30:33
168 baselines\ppo1\__pycache__\mlp_policy.cpython-37.pyc 2.95 KB 2019/5/24 17:48:47
169 baselines\common\__pycache__\math_util.cpython-37.pyc 2.93 KB 2019/5/24 17:48:47
170 baselines\acer\__pycache__\policies.cpython-36.pyc 2.93 KB 2019/7/3 17:19:18
171 baselines\acer\__pycache__\policies.cpython-37.pyc 2.91 KB 2019/5/24 17:48:47
172 baselines\common\cmd_util.py 2.89 KB 2019/5/24 17:48:40
173 baselines\gail\mlp_policy.py 2.86 KB 2019/5/24 17:48:40
174 baselines\deepq\__pycache__\models.cpython-36.pyc 2.84 KB 2019/7/3 19:27:38
175 baselines\deepq\__pycache__\models.cpython-37.pyc 2.83 KB 2019/5/24 17:48:47
176 《强化学习精要》源代码\ch6\__pycache__\value_iter.cpython-37.pyc 2.83 KB 2019/5/22 10:41:36
177 baselines\ddpg\__pycache__\memory.cpython-36.pyc 2.81 KB 2019/7/3 16:45:27
178 baselines\common\vec_env\subproc_vec_env.py 2.8 KB 2019/5/24 17:48:40
179 《强化学习精要》源代码\ch6\value_iter.py 2.78 KB 2019/5/21 20:39:58
180 baselines\ddpg\__pycache__\memory.cpython-37.pyc 2.78 KB 2019/5/24 17:48:47
181 baselines\ppo1\mlp_policy.py 2.78 KB 2019/5/24 17:48:40
182 baselines\her\her.py 2.76 KB 2019/5/24 17:48:40
183 baselines\common\mpi_adam.py 2.72 KB 2019/5/24 17:48:40
184 baselines\acktr\filters.py 2.71 KB 2019/7/3 19:01:15
185 baselines\deepq\utils.py 2.71 KB 2019/5/24 17:48:40
186 baselines\ppo1\__pycache__\cnn_policy.cpython-36.pyc 2.69 KB 2019/7/3 19:30:22
187 baselines\ppo1\__pycache__\cnn_policy.cpython-37.pyc 2.69 KB 2019/5/24 17:48:47
188 baselines\ddpg\memory.py 2.68 KB 2019/7/3 16:19:51
189 baselines\filters.py 2.68 KB 2019/7/3 18:58:13
190 baselines\acer\policies.py 2.67 KB 2019/5/24 17:48:40
191 baselines\trpo_mpi\__pycache__\nosharing_cnn_policy.cpython-36.pyc 2.64 KB 2019/7/4 10:57:48
192 baselines\acktr\value_functions.py 2.64 KB 2019/5/24 17:48:40
193 baselines\trpo_mpi\__pycache__\nosharing_cnn_policy.cpython-37.pyc 2.64 KB 2019/5/24 17:48:47
194 《强化学习精要》源代码\ch7\__pycache__\policy_iter.cpython-37.pyc 2.57 KB 2019/5/22 17:14:47
195 baselines\acktr\__pycache__\kfac_utils.cpython-36.pyc 2.52 KB 2019/7/3 17:19:48
196 baselines\acktr\__pycache__\kfac_utils.cpython-37.pyc 2.49 KB 2019/5/24 17:48:47
197 baselines\common\__pycache__\dataset.cpython-36.pyc 2.47 KB 2019/7/3 16:09:44
198 baselines\common\__pycache__\dataset.cpython-37.pyc 2.46 KB 2019/5/24 17:48:47
199 baselines\ddpg\models.py 2.43 KB 2019/7/3 17:14:46
200 baselines\deepq\experiments\__pycache__\custom_cartpole.cpython-37.pyc 2.4 KB 2019/5/24 17:48:47

请留下有营养的评论,广告灌水一律拉黑处理,谢谢合作!