910 Commits (f891eea7b8a80ad0a6159d7002772cd1d15b3598)
 

Author SHA1 Message Date
sp f891eea7b8 set agent to deadlock when no shield action 6 months ago
sp 8cbb9c4073 debugging prism file passing 6 months ago
sp 493f66a950 pass prism file to handler 6 months ago
sp 70c1e63107 refactored has_key for python3 6 months ago
sp 89675dfe80 added argument for predefined prism file 6 months ago
sp eec87050eb log if shield cannot provide mask 9 months ago
sp 5587144815 removed the eval callback 9 months ago
sp 5e824757d7 disabled debug reenabled training 9 months ago
sp 4ae274047f more debugging 9 months ago
sp 31d833bc57 disable learning 9 months ago
sp f589b10692 more debugging 9 months ago
sp b5dd43ca23 debug printout for len of shield 9 months ago
sp 8a201a1c3c enable export of shield 10 months ago
sp a4d360f474 do not allowed movement when shield is empty 11 months ago
sp dc8e4f320d reintroduced learning 11 months ago
sp 5c6e69ccaf disable learning 11 months ago
sp 1a3ad544b7 added info picked_up 11 months ago
sp 59f0011614 add info about opened_door 11 months ago
sp 3f032dd8b3 re-init evalCallback and save final model 11 months ago
sp 27004e0916 check all actions for mask 11 months ago
sp 775341c3af fixed typo in info callback 11 months ago
sp 62fddc1e27 fixed bug in info callback 11 months ago
sp b9ed2ac234 changed some callbacks 11 months ago
sp e21a578ad8 fixed some issues with start parsing regex 11 months ago
sp 24dec631aa record gifs when evaluating 11 months ago
sp d7e7a2411b use shield in evaluation when full shielding 11 months ago
sp 2a0b75bb15 added helper for ShieldingConfig 11 months ago
sp 028c942625 evaluate sb3 training 11 months ago
sp a69dab422c log to stdout 11 months ago
sp f68a4052f9 add expname suffix 11 months ago
sp c387d99a6c include unique id in experiment name 11 months ago
sp b696dac5f6 configure logger manually to change csv filename 11 months ago
sp d2aa224bc8 move isodate to expname 11 months ago
sp 8cbbef4006 ensure that exp log directory exists 11 months ago
sp 703b213248 log to csv and tensorboard only 11 months ago
sp 95a06bd0f0 pass nocleanup flag correctly 11 months ago
sp d194537f56 shortened expname 11 months ago
sp 09ef1aac10 fixed nocleanup argument 11 months ago
sp efbad0cc27 log to subdirectory 11 months ago
sp 5ab83b7460 only create ShieldHandler when necessary 11 months ago
sp 7c39ab3b87 pass cleanup flag to handler 11 months ago
sp 11d5c3c811 store shield files in local tmp dir 11 months ago
sp 29799bc52f resynthesize on reset is default False 11 months ago
sp 7627645e63 changed default expname 11 months ago
sp 2bcb38f6af refactored training without shield 11 months ago
sp 36c04f1b81 set sb3 device to auto 11 months ago
sp 71854bae01 init evalCallback for training with sb3 11 months ago
sp 59c795348e changes in sb3 rl training 11 months ago
sp 315b0c8e7d added useful sb3 callbacks 11 months ago
sp 62c1198f25 removed observation changes from shielding wrapper 11 months ago