framework.middleware.collector¶

collector¶

StepCollector¶

class ding.framework.middleware.collector.StepCollector(*args, **kwargs)[source]¶

Overview:: The class of the collector running by steps, including model inference and transition process. Use the __call__ method to execute the whole collection process.

__call__(ctx: OnlineRLContext) → None[source]¶

Overview:

An encapsulation of inference and rollout middleware. Stop when completing the target number of steps.

Input of ctx:

env_step (int): The env steps which will increase during collection.

__init__(cfg: EasyDict, policy, env: BaseEnvManager, random_collect_size: int = 0) → None[source]¶

Arguments:

cfg (EasyDict): Config.
policy (Policy): The policy to be collected.
env (BaseEnvManager): The env for the collection, the BaseEnvManager object or its derivatives are supported.
random_collect_size (int): The count of samples that will be collected randomly, typically used in initial runs.

EpisodeCollector¶

class ding.framework.middleware.collector.EpisodeCollector(cfg: EasyDict, policy, env: BaseEnvManager, random_collect_size: int = 0)[source]¶

Overview:: The class of the collector running by episodes, including model inference and transition process. Use the __call__ method to execute the whole collection process.

__call__(ctx: OnlineRLContext) → None[source]¶

Overview:

An encapsulation of inference and rollout middleware. Stop when completing the target number of episodes.

Input of ctx:

env_episode (int): The env env_episode which will increase during collection.

__init__(cfg: EasyDict, policy, env: BaseEnvManager, random_collect_size: int = 0) → None[source]¶

Arguments:

cfg (EasyDict): Config.
policy (Policy): The policy to be collected.
env (BaseEnvManager): The env for the collection, the BaseEnvManager object or its derivatives are supported.
random_collect_size (int): The count of samples that will be collected randomly, typically used in initial runs.