196 Commits (5c6e69ccafe2f18618949377db37e1e6ff2e42cf)

Author SHA1 Message Date
sp 5c6e69ccaf disable learning 10 months ago
sp 1a3ad544b7 added info picked_up 10 months ago
sp 59f0011614 add info about opened_door 10 months ago
sp 3f032dd8b3 re-init evalCallback and save final model 10 months ago
sp 27004e0916 check all actions for mask 10 months ago
sp 775341c3af fixed typo in info callback 10 months ago
sp 62fddc1e27 fixed bug in info callback 10 months ago
sp b9ed2ac234 changed some callbacks 10 months ago
sp e21a578ad8 fixed some issues with start parsing regex 10 months ago
sp 24dec631aa record gifs when evaluating 10 months ago
sp d7e7a2411b use shield in evaluation when full shielding 10 months ago
sp 2a0b75bb15 added helper for ShieldingConfig 10 months ago
sp 028c942625 evaluate sb3 training 10 months ago
sp a69dab422c log to stdout 10 months ago
sp f68a4052f9 add expname suffix 10 months ago
sp c387d99a6c include unique id in experiment name 10 months ago
sp b696dac5f6 configure logger manually to change csv filename 10 months ago
sp d2aa224bc8 move isodate to expname 10 months ago
sp 8cbbef4006 ensure that exp log directory exists 10 months ago
sp 703b213248 log to csv and tensorboard only 10 months ago
sp 95a06bd0f0 pass nocleanup flag correctly 10 months ago
sp d194537f56 shortened expname 10 months ago
sp 09ef1aac10 fixed nocleanup argument 10 months ago
sp efbad0cc27 log to subdirectory 10 months ago
sp 5ab83b7460 only create ShieldHandler when necessary 10 months ago
sp 7c39ab3b87 pass cleanup flag to handler 10 months ago
sp 11d5c3c811 store shield files in local tmp dir 10 months ago
sp 29799bc52f resynthesize on reset is default False 10 months ago
sp 7627645e63 changed default expname 10 months ago
sp 2bcb38f6af refactored training without shield 10 months ago
sp 36c04f1b81 set sb3 device to auto 10 months ago
sp 71854bae01 init evalCallback for training with sb3 10 months ago
sp 59c795348e changes in sb3 rl training 10 months ago
sp 315b0c8e7d added useful sb3 callbacks 10 months ago
sp 62c1198f25 removed observation changes from shielding wrapper 10 months ago
sp 16490a74f1 init Miniwrapper to switch to WxHxC observations 10 months ago
sp 7ccbe8f9bc changes according to refactoring of utils 10 months ago
sp 372006a1da major refactor in utils 10 months ago
Thomas Knoll 175171c035 added jpg remove on gif create 11 months ago
sp 521c71eba4 removed erroneous file write 11 months ago
sp 4486dba2c9 disabled shield creation at reset WIP 11 months ago
sp aa2cec0e0f disabled default shield creation on reset 11 months ago
sp 7eeb816013 changed default env 11 months ago
sp c7c2296c71 changed early stopping criterion in tuner 11 months ago
Thomas Knoll 555511bd34 args for turn prob 11 months ago
Thomas Knoll f8a3c52b9c fixed shielding 11 months ago
Thomas Knoll 1cbaac75cb cleanups 11 months ago
Thomas Knoll 5dcabef8e0 added utils classes 11 months ago
Thomas Knoll ae94b57876 changed iteration handling 11 months ago
Stefan Pranger f3b12f4caa removed callbacks for shield info 11 months ago