Gail¶
gail_irl_model¶
GailRewardModel¶
- class ding.reward_model.gail_irl_model.BaseRewardModel[source]¶
- Overview:
the base class of reward model
- Interface:
default_config,estimate,train,clear_data,collect_data,load_expert_date
- __init__()¶
- abstract clear_data() None[source]¶
- Overview:
Clearing training data. This can be a side effect function which clears the data attribute in
self
- abstract collect_data(data) None[source]¶
- Overview:
Collecting training data in designated formate or with designated transition.
- Arguments:
data (
Any): Raw training data (e.g. some form of states, actions, obs, etc)
- Returns / Effects:
This can be a side effect function which updates the data attribute in
self
- abstract estimate(data: list) Any[source]¶
- Overview:
estimate reward
- Arguments:
data (
List): the list of data used for estimation
- Returns / Effects:
This can be a side effect function which updates the reward value
If this function returns, an example returned object can be reward (
Any): the estimated reward