Commit 0abb10a
authored
Version 0.9.6: Notebooks / Model Performance Evaluation (#10)
* Added:
- Interpreter (Cleaner) with cleaner code / closer to fastai
- To and From Pickle
- to_csv
Notes:
- I tried doing a from_csv implementation, however I am seeing that
something like this might not be possible unless using system
stuff. Not sure when I will ever get to this. I have some ideas about saving images
/ states as files with file paths... Maybe to_csv generates a file system also?
* Added:
- Group Interpreter for combining model runs
- Initial fixed dqn notebook (soft of)
Fixed:
- recorder callback ordering
- renaming. It seems that fasti has some cool in-notebook test widgets
that we might want to use in the future
* Added:
- Group Interpreter merging
- DQN base notebook
- Interpreters with by default close envs
Fixed:
- env closing <- might be a continuous issue due to different physics engines
* Fixed:
- setup.py fastai needs to be min 1.0.59
* Fixed:
- cpu / device issues.
* Added:
- DQN Group Results
- Reward Metric
Notes:
- I am realizing that we need sum reward smoothing. The graphs are way
too messy.
* Added:
- Analysis property to the group interpretation
* Fixed:
- PER crashing due to containing 0 items
* Added:
- Group Interpretation value smoothing
* Fixed:
- Value smoothing making the reward values way too big
- Tests take too long. If Image input, just do a shorter fit cycle
- PER batch size not updating
- Tests take too long. If Image input, just do a shorter fit cycle
- cuda issues
- Bounds n_possible_values is only calculated when used.
Should make iteration faster.
Added:
- Smoothing for the scalar plotting
* More test fixing
* Fixed:
- cuda issues
* Added:
- Lunary Lander performance test
* Added:
- minigrid compat
- normalization module for dqns using Bounds object
* Fixed:
- Normalizing cuda error
* Fixed:
- DDPG cuda error
* Fixed:
- pybullet human rendering. Pybullet renders differently from regular
openai envs. Basically if you want to see what is happening, the ender
method needs to be executed prior to reset.
Added:
- DDPG testing
- ddpg env runs
- more results
- more ddpg tests
- walker2d data
* Fixed:
- Possibly pybullet envs from crashing. There was an issue where the pybullet wrapper was not being added :(
* Version 0.9.5 mass refactor (#12)
* Added:
- Refactored DQN code
- DQN learner basic
Fixed:
- DQN model crashing
* Added:
- All DQNs pass tests
* Fixed:
- Some dqn / gym_maze / embedding related crashes
- DQN test code and actual DQN tests
* Added:
- Maze heap map interpreter
- Started q value interpreter
* Fixed:
- DDPG GPU issue. Sampling / action and state objects support to device calls.
- DQN GPU issue.
- azure pipeline test
* Updated:
- jupyter notebooks
* Removed:
- old code files
* Fixed:
- metrics, ddpg tests
* Added:
- basic q value plotting
- basic q value plotting for ddpg
* Updated Version
* Changed:
- Setup.py excludes some third arty packages due to pypi restriction. Need to find a way around this.
* Removed:
- old code from README. Revisions coming.
* Added:
- batch norm toggling. For now / forever defaulted to false
* Version 0 9 5 mass refactor (#13)
* Added:
- revised test script
- Slowly adding tests.
* Fixed:
- somehow trained_learner method in test was completely broken
* Added:
- Interpreter edge control. can also show average line
* Fixed:
- models being all shitty. Apparently, batch norm reaaally screws them up. If you use batch norm, the batch size needs to be massive (128 wasnt large enough). By default, you can kind of turn off batch_norm in the Tabular models, but they still, when given a continuous input, will have an entry batch norm. I over-wrote it and now they work significantly better :)
* Updated:
- gitignore1 parent 6364d54 commit 0abb10a
File tree
83 files changed
+4726
-2385
lines changed- build
- docs_src
- data
- cartpole_dddqn
- cartpole_ddqn
- cartpole_dqn fixed targeting
- cartpole_dqn
- cartpole_dueling dqn
- halfcheetah_ddpg
- lunarlander_dddqn
- lunarlander_ddqn
- lunarlander_dqn fixed targeting
- lunarlander_dqn
- lunarlander_dueling dqn
- mountaincarcontinuous_ddpg
- mujocoreach_ddpg
- pendulum_ddpg
- walker2d_ddpg
- fast_rl
- agents
- core
- notebooks
- tests
- util
- res
- tests
- data
- cartpole_dqn
- cat
- dog
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
83 files changed
+4726
-2385
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | | - | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | 23 | | |
27 | 24 | | |
28 | 25 | | |
| |||
72 | 69 | | |
73 | 70 | | |
74 | 71 | | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
83 | | - | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
88 | | - | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | | - | |
94 | | - | |
95 | | - | |
96 | | - | |
97 | | - | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
106 | | - | |
107 | | - | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | | - | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
183 | | - | |
184 | | - | |
185 | | - | |
186 | | - | |
187 | | - | |
188 | | - | |
189 | | - | |
190 | | - | |
191 | | - | |
192 | | - | |
193 | | - | |
194 | | - | |
195 | | - | |
196 | | - | |
197 | | - | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
198 | 84 | | |
199 | 85 | | |
200 | | - | |
201 | | - | |
202 | 86 | | |
203 | 87 | | |
204 | | - | |
205 | | - | |
| 88 | + | |
| 89 | + | |
206 | 90 | | |
207 | | - | |
208 | | - | |
209 | | - | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
210 | 94 | | |
211 | 95 | | |
212 | 96 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
6 | 9 | | |
7 | 10 | | |
8 | 11 | | |
9 | | - | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
4 | | - | |
5 | | - | |
6 | | - | |
7 | | - | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
8 | 8 | | |
9 | | - | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
14 | 14 | | |
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
0 commit comments