|  sp | dc8e4f320d | reintroduced learning | 2 years ago | 
				
					
						|  sp | 5c6e69ccaf | disable learning This will likely be reverted or squashed away, since I am doing this to
get the synthesis times on the server... | 2 years ago | 
				
					
						|  sp | 1a3ad544b7 | added info picked_up | 2 years ago | 
				
					
						|  sp | 59f0011614 | add info about opened_door | 2 years ago | 
				
					
						|  sp | 3f032dd8b3 | re-init evalCallback and save final model Disabled eval logging to csv explicitely in the library, as this is
causing issues with the final progess*.csv | 2 years ago | 
				
					
						|  sp | 27004e0916 | check all actions for mask | 2 years ago | 
				
					
						|  sp | 775341c3af | fixed typo in info callback | 2 years ago | 
				
					
						|  sp | 62fddc1e27 | fixed bug in info callback | 2 years ago | 
				
					
						|  sp | b9ed2ac234 | changed some callbacks | 2 years ago | 
				
					
						|  sp | e21a578ad8 | fixed some issues with start parsing regex also keep track of time to postprocess shield | 2 years ago | 
				
					
						|  sp | 24dec631aa | record gifs when evaluating This currently does not use tensorboards feature, we have to leave this
as a TBD. The gifs will be left in the experiments log_dir | 2 years ago | 
				
					
						|  sp | d7e7a2411b | use shield in evaluation when full shielding | 2 years ago | 
				
					
						|  sp | 2a0b75bb15 | added helper for ShieldingConfig | 2 years ago | 
				
					
						|  sp | 028c942625 | evaluate sb3 training WIP: Not a 100% sure whether the masking will be used in the evaluation | 2 years ago | 
				
					
						|  sp | a69dab422c | log to stdout | 2 years ago | 
				
					
						|  sp | f68a4052f9 | add expname suffix | 2 years ago | 
				
					
						|  sp | c387d99a6c | include unique id in experiment name | 2 years ago | 
				
					
						|  sp | b696dac5f6 | configure logger manually to change csv filename | 2 years ago | 
				
					
						|  sp | d2aa224bc8 | move isodate to expname | 2 years ago | 
				
					
						|  sp | 8cbbef4006 | ensure that exp log directory exists | 2 years ago | 
				
					
						|  sp | 703b213248 | log to csv and tensorboard only | 2 years ago | 
				
					
						|  sp | 95a06bd0f0 | pass nocleanup flag correctly | 2 years ago | 
				
					
						|  sp | d194537f56 | shortened expname | 2 years ago | 
				
					
						|  sp | 09ef1aac10 | fixed nocleanup argument | 2 years ago | 
				
					
						|  sp | efbad0cc27 | log to subdirectory | 2 years ago | 
				
					
						|  sp | 5ab83b7460 | only create ShieldHandler when necessary this also renames camelcase variable logDir | 2 years ago | 
				
					
						|  sp | 7c39ab3b87 | pass cleanup flag to handler | 2 years ago | 
				
					
						|  sp | 11d5c3c811 | store shield files in local tmp dir | 2 years ago | 
				
					
						|  sp | 29799bc52f | resynthesize on reset is default False | 2 years ago | 
				
					
						|  sp | 7627645e63 | changed default expname | 2 years ago | 
				
					
						|  sp | 2bcb38f6af | refactored training without shield | 2 years ago | 
				
					
						|  sp | 36c04f1b81 | set sb3 device to auto This automatically detects whether a GPU can be used for training. | 2 years ago | 
				
					
						|  sp | 71854bae01 | init evalCallback for training with sb3 | 2 years ago | 
				
					
						|  sp | 59c795348e | changes in sb3 rl training - included callbacks for initial image and info plotting
- switched to CnnPolicy
- changed GRID_TO_PRISM_BINARY to environment var M2P_BINARY | 2 years ago | 
				
					
						|  sp | 315b0c8e7d | added useful sb3 callbacks | 2 years ago | 
				
					
						|  sp | 62c1198f25 | removed observation changes from shielding wrapper | 2 years ago | 
				
					
						|  sp | 16490a74f1 | init Miniwrapper to switch to WxHxC observations | 2 years ago | 
				
					
						|  sp | 7ccbe8f9bc | changes according to refactoring of utils | 2 years ago | 
				
					
						|  sp | 372006a1da | major refactor in utils - introduced common_parser for arguments
- the shield dict uses minigrid.core.State instead of strings
- switched shield query to minigrid get_symbolic_state | 2 years ago | 
				
					
						|  Thomas Knoll | 175171c035 | added jpg remove on gif create | 2 years ago | 
				
					
						|  sp | 521c71eba4 | removed erroneous file write | 2 years ago | 
				
					
						|  sp | 4486dba2c9 | disabled shield creation at reset WIP | 2 years ago | 
				
					
						|  sp | aa2cec0e0f | disabled default shield creation on reset | 2 years ago | 
				
					
						|  sp | 7eeb816013 | changed default env | 2 years ago | 
				
					
						|  sp | c7c2296c71 | changed early stopping criterion in tuner | 2 years ago | 
				
					
						|  Thomas Knoll | 555511bd34 | args for turn prob | 2 years ago | 
				
					
						|  Thomas Knoll | f8a3c52b9c | fixed shielding | 2 years ago | 
				
					
						|  Thomas Knoll | 1cbaac75cb | cleanups | 2 years ago | 
				
					
						|  Thomas Knoll | 5dcabef8e0 | added utils classes | 2 years ago | 
				
					
						|  Thomas Knoll | ae94b57876 | changed iteration handling | 2 years ago |