7 Commits (5c9064ecbeb2c60b634054fb195fc185b616c991)

Author SHA1 Message Date
Thomas Knoll 5c9064ecbe randomize start 1 year ago
Thomas Knoll 80cbbe5a3a minor changes 1 year ago
Thomas Knoll afc9f5bc4d added config / some adversary fixes 2 years ago
Thomas Knoll 5745113179 changed one hot wrapping 2 years ago
Thomas Knoll 906f251401 some fixes to key and door handling 2 years ago
Thomas Knoll 604d2c2b76 changed action handling for probabilities 2 years ago
Thomas Knoll 3dee543e24 renaming and notebooks 2 years ago
Thomas Knoll 138d917fd6 added tune example 2 years ago
Thomas Knoll e2c855dc6a added more shielding options 2 years ago
Thomas Knoll f3747a1479 renaming / shield handling changes 2 years ago
Thomas Knoll 757fbbcc0d fixed shield generation 2 years ago
Thomas Knoll 1c2dbf706e changed shield creation to create shield on reset 2 years ago
Thomas Knoll 97f7d23cda added rudimental key / door masking 2 years ago
Thomas Knoll b1b014dbd6 some refactoring as preparation for sb3 example 2 years ago
Thomas Knoll fab1e8f23f added logdir handling 2 years ago
Thomas Knoll 7f20c3f909 arguments and log dir 2 years ago
Thomas Knoll fe96a6a0b6 added dqn algorithm 2 years ago
Thomas Knoll e42becef88 added dqn handling skeleton 2 years ago
Thomas Knoll f52262ad11 simple masking (only turn left allowed) 2 years ago
Thomas Knoll cf18349819 basic action embedding 2 years ago