26 Commits (5c6e69ccafe2f18618949377db37e1e6ff2e42cf)

Author SHA1 Message Date
sp 5c6e69ccaf disable learning 10 months ago
sp 3f032dd8b3 re-init evalCallback and save final model 10 months ago
sp b9ed2ac234 changed some callbacks 10 months ago
sp 24dec631aa record gifs when evaluating 10 months ago
sp d7e7a2411b use shield in evaluation when full shielding 11 months ago
sp 028c942625 evaluate sb3 training 11 months ago
sp a69dab422c log to stdout 11 months ago
sp b696dac5f6 configure logger manually to change csv filename 11 months ago
sp 703b213248 log to csv and tensorboard only 11 months ago
sp 95a06bd0f0 pass nocleanup flag correctly 11 months ago
sp 5ab83b7460 only create ShieldHandler when necessary 11 months ago
sp 7c39ab3b87 pass cleanup flag to handler 11 months ago
sp 2bcb38f6af refactored training without shield 11 months ago
sp 36c04f1b81 set sb3 device to auto 11 months ago
sp 71854bae01 init evalCallback for training with sb3 11 months ago
sp 59c795348e changes in sb3 rl training 11 months ago
sp 7ccbe8f9bc changes according to refactoring of utils 11 months ago
Thomas Knoll 5dcabef8e0 added utils classes 11 months ago
Thomas Knoll 1528173f58 changed iterations to evaluations 1 year ago
Thomas Knoll 3dee543e24 renaming and notebooks 1 year ago
Thomas Knoll 138d917fd6 added tune example 1 year ago
Thomas Knoll f3747a1479 renaming / shield handling changes 1 year ago
Thomas Knoll 757fbbcc0d fixed shield generation 1 year ago
Thomas Knoll 1c2dbf706e changed shield creation to create shield on reset 1 year ago
Thomas Knoll 97f7d23cda added rudimental key / door masking 1 year ago
Thomas Knoll b1b014dbd6 some refactoring as preparation for sb3 example 1 year ago