Thomas Knoll
|
bacacdda7d
|
changes to testall
|
11 months ago |
Thomas Knoll
|
d891211c5b
|
added configs and multiple test scripts
|
11 months ago |
Thomas Knoll
|
7ec988da12
|
log dir change
|
11 months ago |
Thomas Knoll
|
45d110b199
|
added probability arguments
|
11 months ago |
Thomas Knoll
|
af0c4e2f21
|
shield value script 15
|
11 months ago |
Thomas Knoll
|
65ea61dd18
|
added config file
|
11 months ago |
Thomas Knoll
|
5c9064ecbe
|
randomize start
|
11 months ago |
Thomas Knoll
|
80cbbe5a3a
|
minor changes
|
12 months ago |
Thomas Knoll
|
71acf4e2cc
|
added checkpoint & sandbox
|
1 year ago |
Thomas Knoll
|
afc9f5bc4d
|
added config / some adversary fixes
|
1 year ago |
Thomas Knoll
|
5745113179
|
changed one hot wrapping
|
1 year ago |
Thomas Knoll
|
d41ba6258f
|
adversary handling
|
1 year ago |
Thomas Knoll
|
906f251401
|
some fixes to key and door handling
|
1 year ago |
Thomas Knoll
|
ddc0a048b2
|
more args and door / key handling
|
1 year ago |
Thomas Knoll
|
bd09223fad
|
added shieldhandler key handling
|
1 year ago |
Thomas Knoll
|
c0fc671870
|
probability evaluation in jupyter notebooks
|
1 year ago |
Thomas Knoll
|
41f94bf92e
|
added expname to grid path
|
1 year ago |
Thomas Knoll
|
6e5fca3644
|
changed exp name handling
|
1 year ago |
Thomas Knoll
|
7c993cbac4
|
flattened tune logging directory structure
|
1 year ago |
Thomas Knoll
|
bcc19ec9ca
|
added trial name
|
1 year ago |
Thomas Knoll
|
37a9d79051
|
more changes to logging
|
1 year ago |
Thomas Knoll
|
442fff1344
|
chnaged logdir handling
|
1 year ago |
Thomas Knoll
|
1812afee59
|
changed shield
|
1 year ago |
Thomas Knoll
|
604d2c2b76
|
changed action handling for probabilities
|
1 year ago |
Thomas Knoll
|
1528173f58
|
changed iterations to evaluations
|
1 year ago |
Thomas Knoll
|
d64f569499
|
commented out shield export call
|
1 year ago |
Thomas Knoll
|
b05029eff0
|
added init call
|
1 year ago |
Thomas Knoll
|
7a6496cfee
|
removed some type annotations in callback
|
1 year ago |
Thomas Knoll
|
f0df936716
|
added steps argument and stop criteria
|
1 year ago |
Thomas Knoll
|
b238f5c1a7
|
changed jupyter notebooks to tune
|
1 year ago |
Thomas Knoll
|
8650b7c91f
|
added tune example
changes to algorithm parsing
|
1 year ago |
Thomas Knoll
|
717c644aad
|
changed ray tune example
|
1 year ago |
Thomas Knoll
|
3dee543e24
|
renaming and notebooks
|
1 year ago |
Thomas Knoll
|
4e182a8e5b
|
removed basic training
|
1 year ago |
Thomas Knoll
|
138d917fd6
|
added tune example
refactored and evaluation logging
|
1 year ago |
Thomas Knoll
|
e2c855dc6a
|
added more shielding options
(training, evaluation, none, both)
|
1 year ago |
Thomas Knoll
|
f3747a1479
|
renaming / shield handling changes
|
1 year ago |
Thomas Knoll
|
757fbbcc0d
|
fixed shield generation
worker handling
|
1 year ago |
Thomas Knoll
|
1c2dbf706e
|
changed shield creation to create shield on reset
|
1 year ago |
Thomas Knoll
|
97f7d23cda
|
added rudimental key / door masking
|
1 year ago |
Thomas Knoll
|
b1b014dbd6
|
some refactoring as preparation for sb3 example
added sb3 example
|
1 year ago |
Thomas Knoll
|
fab1e8f23f
|
added logdir handling
chnages to action index handling
|
1 year ago |
Thomas Knoll
|
7f20c3f909
|
arguments and log dir
|
1 year ago |
Thomas Knoll
|
fe96a6a0b6
|
added dqn algorithm
|
1 year ago |
Thomas Knoll
|
e42becef88
|
added dqn handling skeleton
|
1 year ago |
Thomas Knoll
|
f52262ad11
|
simple masking (only turn left allowed)
|
1 year ago |
Thomas Knoll
|
cf18349819
|
basic action embedding
|
1 year ago |
Thomas Knoll
|
dd9dd43036
|
initial layout for rl test
|
1 year ago |
Thomas Knoll
|
6b8ceedccb
|
fixed unit tests
changed shield specification in tests
|
1 year ago |
Thomas Knoll
|
a89c9711bf
|
changes after shield filename removal
|
1 year ago |