912 Commits (refactoring)
 

Author SHA1 Message Date
sp 3521dbe471 disabled evalcallback 4 months ago
sp 83076beb95 enabled evaluation 4 months ago
sp f891eea7b8 set agent to deadlock when no shield action 4 months ago
sp 8cbb9c4073 debugging prism file passing 4 months ago
sp 493f66a950 pass prism file to handler 4 months ago
sp 70c1e63107 refactored has_key for python3 4 months ago
sp 89675dfe80 added argument for predefined prism file 4 months ago
sp eec87050eb log if shield cannot provide mask 7 months ago
sp 5587144815 removed the eval callback 7 months ago
sp 5e824757d7 disabled debug reenabled training 7 months ago
sp 4ae274047f more debugging 7 months ago
sp 31d833bc57 disable learning 7 months ago
sp f589b10692 more debugging 7 months ago
sp b5dd43ca23 debug printout for len of shield 7 months ago
sp 8a201a1c3c enable export of shield 7 months ago
sp a4d360f474 do not allowed movement when shield is empty 8 months ago
sp dc8e4f320d reintroduced learning 9 months ago
sp 5c6e69ccaf disable learning 9 months ago
sp 1a3ad544b7 added info picked_up 9 months ago
sp 59f0011614 add info about opened_door 9 months ago
sp 3f032dd8b3 re-init evalCallback and save final model 9 months ago
sp 27004e0916 check all actions for mask 9 months ago
sp 775341c3af fixed typo in info callback 9 months ago
sp 62fddc1e27 fixed bug in info callback 9 months ago
sp b9ed2ac234 changed some callbacks 9 months ago
sp e21a578ad8 fixed some issues with start parsing regex 9 months ago
sp 24dec631aa record gifs when evaluating 9 months ago
sp d7e7a2411b use shield in evaluation when full shielding 9 months ago
sp 2a0b75bb15 added helper for ShieldingConfig 9 months ago
sp 028c942625 evaluate sb3 training 9 months ago
sp a69dab422c log to stdout 9 months ago
sp f68a4052f9 add expname suffix 9 months ago
sp c387d99a6c include unique id in experiment name 9 months ago
sp b696dac5f6 configure logger manually to change csv filename 9 months ago
sp d2aa224bc8 move isodate to expname 9 months ago
sp 8cbbef4006 ensure that exp log directory exists 9 months ago
sp 703b213248 log to csv and tensorboard only 9 months ago
sp 95a06bd0f0 pass nocleanup flag correctly 9 months ago
sp d194537f56 shortened expname 9 months ago
sp 09ef1aac10 fixed nocleanup argument 9 months ago
sp efbad0cc27 log to subdirectory 9 months ago
sp 5ab83b7460 only create ShieldHandler when necessary 9 months ago
sp 7c39ab3b87 pass cleanup flag to handler 9 months ago
sp 11d5c3c811 store shield files in local tmp dir 9 months ago
sp 29799bc52f resynthesize on reset is default False 9 months ago
sp 7627645e63 changed default expname 9 months ago
sp 2bcb38f6af refactored training without shield 9 months ago
sp 36c04f1b81 set sb3 device to auto 9 months ago
sp 71854bae01 init evalCallback for training with sb3 9 months ago
sp 59c795348e changes in sb3 rl training 9 months ago