sp
|
a4d360f474
|
do not allowed movement when shield is empty
|
10 months ago |
sp
|
1a3ad544b7
|
added info picked_up
|
10 months ago |
sp
|
59f0011614
|
add info about opened_door
|
11 months ago |
sp
|
775341c3af
|
fixed typo in info callback
|
11 months ago |
sp
|
62fddc1e27
|
fixed bug in info callback
|
11 months ago |
sp
|
b9ed2ac234
|
changed some callbacks
|
11 months ago |
sp
|
24dec631aa
|
record gifs when evaluating
This currently does not use tensorboards feature, we have to leave this
as a TBD. The gifs will be left in the experiments log_dir
|
11 months ago |
sp
|
29799bc52f
|
resynthesize on reset is default False
|
11 months ago |
sp
|
2bcb38f6af
|
refactored training without shield
|
11 months ago |
sp
|
315b0c8e7d
|
added useful sb3 callbacks
|
11 months ago |
sp
|
62c1198f25
|
removed observation changes from shielding wrapper
|
11 months ago |
sp
|
372006a1da
|
major refactor in utils
- introduced common_parser for arguments
- the shield dict uses minigrid.core.State instead of strings
- switched shield query to minigrid get_symbolic_state
|
11 months ago |
Thomas Knoll
|
1cbaac75cb
|
cleanups
|
11 months ago |
Thomas Knoll
|
5dcabef8e0
|
added utils classes
|
11 months ago |