Gail¶
gail_irl_model¶
GailRewardModel¶
- class ding.reward_model.gail_irl_model.BaseRewardModel[source]¶
- Overview:
the base class of reward model
- Interface:
default_config
,estimate
,train
,clear_data
,collect_data
,load_expert_date
- __init__()¶
- abstract clear_data() None [source]¶
- Overview:
Clearing training data. This can be a side effect function which clears the data attribute in
self
- abstract collect_data(data) None [source]¶
- Overview:
Collecting training data in designated formate or with designated transition.
- Arguments:
data (
Any
): Raw training data (e.g. some form of states, actions, obs, etc)
- Returns / Effects:
This can be a side effect function which updates the data attribute in
self
- abstract estimate(data: list) Any [source]¶
- Overview:
estimate reward
- Arguments:
data (
List
): the list of data used for estimation
- Returns / Effects:
This can be a side effect function which updates the reward value
If this function returns, an example returned object can be reward (
Any
): the estimated reward