ding.rl_utils¶
a2c¶
Please refer to ding/rl_utils/a2c
for more details.
a2c_error¶
a2c_error_continuous¶
acer¶
Please refer to ding/rl_utils/acer
for more details.
acer_policy_error¶
acer_value_error¶
acer_trust_region_update¶
adder¶
Please refer to ding/rl_utils/adder
for more details.
Adder¶
get_gae¶
get_gae_with_default_last_value¶
get_nstep_return_data¶
get_train_sample¶
beta_function¶
Please refer to ding/rl_utils/beta_function
for more details.
cpw¶
CVaR¶
beta_function_map¶
coma¶
Please refer to ding/rl_utils/coma
for more details.
coma_error¶
exploration¶
Please refer to ding/rl_utils/exploration
for more details.
get_epsilon_greedy_fn¶
BaseNoise¶
GaussianNoise¶
OUNoise¶
create_noise_generator¶
gae¶
Please refer to ding/rl_utils/gae
for more details.
gae_data¶
shape_fn_gae¶
gae¶
isw¶
Please refer to ding/rl_utils/isw
for more details.
compute_importance_weights¶
ppg¶
Please refer to ding/rl_utils/ppg
for more details.
ppg_data¶
ppg_joint_loss¶
ppg_joint_error¶
ppo¶
Please refer to ding/rl_utils/ppo
for more details.
ppo_data¶
ppo_policy_data¶
ppo_value_data¶
ppo_loss¶
ppo_policy_loss¶
ppo_info¶
shape_fn_ppo¶
ppo_error¶
ppo_policy_error¶
ppo_value_error¶
ppo_error_continuous¶
ppo_policy_error_continuous¶
retrace¶
Please refer to ding/rl_utils/retrace
for more details.
compute_q_retraces¶
sampler¶
Please refer to ding/rl_utils/sampler
for more details.
ArgmaxSampler¶
MultinomialSampler¶
MuSampler¶
ReparameterizationSampler¶
HybridStochasticSampler¶
HybridDeterminsticSampler¶
td¶
Please refer to ding/rl_utils/td
for more details.
q_1step_td_data¶
q_1step_td_error¶
m_q_1step_td_data¶
m_q_1step_td_error¶
q_v_1step_td_data¶
q_v_1step_td_error¶
nstep_return_data¶
nstep_return¶
dist_1step_td_data¶
dist_1step_td_error¶
dist_nstep_td_data¶
shape_fn_dntd¶
dist_nstep_td_error¶
v_1step_td_data¶
v_1step_td_error¶
v_nstep_td_data¶
v_nstep_td_error¶
q_nstep_td_data¶
dqfd_nstep_td_data¶
shape_fn_qntd¶
q_nstep_td_error¶
bdq_nstep_td_error¶
shape_fn_qntd_rescale¶
q_nstep_td_error_with_rescale¶
dqfd_nstep_td_error¶
dqfd_nstep_td_error_with_rescale¶
qrdqn_nstep_td_data¶
qrdqn_nstep_td_error¶
q_nstep_sql_td_error¶
iqn_nstep_td_data¶
iqn_nstep_td_error¶
fqf_nstep_td_data¶
fqf_nstep_td_error¶
evaluate_quantile_at_action¶
fqf_calculate_fraction_loss¶
td_lambda_data¶
shape_fn_td_lambda¶
td_lambda_error¶
generalized_lambda_returns¶
multistep_forward_view¶
upgo¶
Please refer to ding/rl_utils/upgo
for more details.
upgo_returns¶
upgo_loss¶
value_rescale¶
Please refer to ding/rl_utils/value_rescale
for more details.
value_transform¶
value_inv_transform¶
symlog¶
inv_symlog¶
vtrace¶
Please refer to ding/rl_utils/vtrace
for more details.