807 Commits (65ea61dd183aa859330f34dcc3b64b2c410a1ae4)
 

Author SHA1 Message Date
Thomas Knoll 65ea61dd18 added config file 1 year ago
Thomas Knoll 5c9064ecbe randomize start 1 year ago
Thomas Knoll 80cbbe5a3a minor changes 1 year ago
Thomas Knoll 71acf4e2cc added checkpoint & sandbox 1 year ago
Thomas Knoll afc9f5bc4d added config / some adversary fixes 1 year ago
Thomas Knoll 5745113179 changed one hot wrapping 1 year ago
Thomas Knoll d41ba6258f adversary handling 1 year ago
Thomas Knoll 906f251401 some fixes to key and door handling 1 year ago
Thomas Knoll ddc0a048b2 more args and door / key handling 1 year ago
Thomas Knoll bd09223fad added shieldhandler key handling 1 year ago
Thomas Knoll c0fc671870 probability evaluation in jupyter notebooks 1 year ago
Thomas Knoll 41f94bf92e added expname to grid path 1 year ago
Thomas Knoll 6e5fca3644 changed exp name handling 1 year ago
Thomas Knoll 7c993cbac4 flattened tune logging directory structure 1 year ago
Thomas Knoll bcc19ec9ca added trial name 1 year ago
Thomas Knoll 37a9d79051 more changes to logging 1 year ago
Thomas Knoll 442fff1344 chnaged logdir handling 1 year ago
Thomas Knoll 1812afee59 changed shield 1 year ago
Thomas Knoll 604d2c2b76 changed action handling for probabilities 1 year ago
Thomas Knoll 1528173f58 changed iterations to evaluations 1 year ago
Thomas Knoll d64f569499 commented out shield export call 1 year ago
Thomas Knoll b05029eff0 added init call 1 year ago
Thomas Knoll 7a6496cfee removed some type annotations in callback 1 year ago
Thomas Knoll f0df936716 added steps argument and stop criteria 1 year ago
Thomas Knoll b238f5c1a7 changed jupyter notebooks to tune 1 year ago
Thomas Knoll 8650b7c91f added tune example 1 year ago
Thomas Knoll 717c644aad changed ray tune example 1 year ago
Thomas Knoll 3dee543e24 renaming and notebooks 1 year ago
Thomas Knoll 4e182a8e5b removed basic training 1 year ago
Thomas Knoll 138d917fd6 added tune example 1 year ago
Thomas Knoll e2c855dc6a added more shielding options 1 year ago
Thomas Knoll f3747a1479 renaming / shield handling changes 1 year ago
Thomas Knoll 757fbbcc0d fixed shield generation 1 year ago
Thomas Knoll 1c2dbf706e changed shield creation to create shield on reset 1 year ago
Thomas Knoll 97f7d23cda added rudimental key / door masking 1 year ago
Thomas Knoll b1b014dbd6 some refactoring as preparation for sb3 example 1 year ago
Thomas Knoll fab1e8f23f added logdir handling 1 year ago
Thomas Knoll 7f20c3f909 arguments and log dir 1 year ago
Thomas Knoll fe96a6a0b6 added dqn algorithm 1 year ago
Thomas Knoll e42becef88 added dqn handling skeleton 1 year ago
Thomas Knoll f52262ad11 simple masking (only turn left allowed) 1 year ago
Thomas Knoll cf18349819 basic action embedding 1 year ago
Thomas Knoll dd9dd43036 initial layout for rl test 1 year ago
Thomas Knoll 6b8ceedccb fixed unit tests 1 year ago
Thomas Knoll a89c9711bf changes after shield filename removal 1 year ago
Thomas Knoll 6adfa0cde1 changed shield export function 1 year ago
Thomas Knoll 08e389a9da added shield_expression parameter for 1 year ago
Thomas Knoll 5d84d94028 removed useless calls in pre shield dt example 1 year ago
Thomas Knoll 7744b5e3dc changed pre shield decision tree export example 1 year ago
Thomas Knoll f2695b54d8 added dtcontrol dependency 1 year ago