Browse Source

changed early stopping criterion in tuner

refactoring
sp 11 months ago
parent
commit
c7c2296c71
  1. 2
      examples/shields/rl/15_train_eval_tune.py

2
examples/shields/rl/15_train_eval_tune.py

@ -74,7 +74,7 @@ def ppo(args):
),
run_config=air.RunConfig(
stop = {"episode_reward_mean": 94,
stop = {"episode_reward_mean": 1,
"timesteps_total": args.steps,},
checkpoint_config=air.CheckpointConfig(checkpoint_at_end=True,
num_to_keep=1,

Loading…
Cancel
Save