framework.middleware.ckpt_handler¶
ckpt_handler¶
CkptSaver¶
- class ding.framework.middleware.ckpt_handler.CkptSaver(*args, **kwargs)[source]¶
- Overview:
The class used to save checkpoint data.
- __call__(ctx: OnlineRLContext | OfflineRLContext) None [source]¶
- Overview:
The method used to save checkpoint data. The checkpoint data will be saved in a file in following 3 cases: - When a multiple of self.train_freq iterations have elapsed since the beginning of training; - When the evaluation episode return is the best so far; - When task.finish is True.
- Input of ctx:
train_iter (
int
): Number of training iteration, i.e. the number of updating policy related network.eval_value (
float
): The episode return of current iteration.
- __init__(policy: Policy, save_dir: str, train_freq: int | None = None, save_finish: bool = True)[source]¶
- Overview:
Initialize the CkptSaver.
- Arguments:
policy (
Policy
): Policy used to save the checkpoint.save_dir (
str
): The directory path to save ckpt.train_freq (
int
): Number of training iterations between each saving checkpoint data.save_finish (
bool
): Whether save final ckpt whentask.finish = True
.