sp
|
2bcb38f6af
|
refactored training without shield
|
1 year ago |
sp
|
36c04f1b81
|
set sb3 device to auto
This automatically detects whether a GPU can be used for training.
|
1 year ago |
sp
|
71854bae01
|
init evalCallback for training with sb3
|
1 year ago |
sp
|
59c795348e
|
changes in sb3 rl training
- included callbacks for initial image and info plotting
- switched to CnnPolicy
- changed GRID_TO_PRISM_BINARY to environment var M2P_BINARY
|
1 year ago |
sp
|
7ccbe8f9bc
|
changes according to refactoring of utils
|
1 year ago |
Thomas Knoll
|
5dcabef8e0
|
added utils classes
|
1 year ago |
Thomas Knoll
|
1528173f58
|
changed iterations to evaluations
|
2 years ago |
Thomas Knoll
|
3dee543e24
|
renaming and notebooks
|
2 years ago |
Thomas Knoll
|
138d917fd6
|
added tune example
refactored and evaluation logging
|
2 years ago |
Thomas Knoll
|
f3747a1479
|
renaming / shield handling changes
|
2 years ago |
Thomas Knoll
|
757fbbcc0d
|
fixed shield generation
worker handling
|
2 years ago |
Thomas Knoll
|
1c2dbf706e
|
changed shield creation to create shield on reset
|
2 years ago |
Thomas Knoll
|
97f7d23cda
|
added rudimental key / door masking
|
2 years ago |
Thomas Knoll
|
b1b014dbd6
|
some refactoring as preparation for sb3 example
added sb3 example
|
2 years ago |