This repository was archived by the owner on May 6, 2021. It is now read-only.
Commit 2f28cbc
Fix bug in multi action ppo (#169)
* Remove dimension in log_pa, fix entropy for multi
* Update src/algorithms/policy_gradient/ppo.jl
Co-authored-by: Jun Tian <[email protected]>
Co-authored-by: Jun Tian <[email protected]>1 parent bc64e42 commit 2f28cbc
1 file changed
Lines changed: 2 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
267 | 267 | | |
268 | 268 | | |
269 | 269 | | |
270 | | - | |
| 270 | + | |
271 | 271 | | |
272 | 272 | | |
273 | 273 | | |
274 | | - | |
| 274 | + | |
275 | 275 | | |
276 | 276 | | |
277 | 277 | | |
| |||
280 | 280 | | |
281 | 281 | | |
282 | 282 | | |
283 | | - | |
284 | 283 | | |
285 | 284 | | |
286 | 285 | | |
| |||
0 commit comments