Shortcuts

ding.rl_utils

a2c

Please refer to ding/rl_utils/a2c for more details.

a2c_error

a2c_error_continuous

acer

Please refer to ding/rl_utils/acer for more details.

acer_policy_error

acer_value_error

acer_trust_region_update

adder

Please refer to ding/rl_utils/adder for more details.

Adder

get_gae

get_gae_with_default_last_value

get_nstep_return_data

get_train_sample

beta_function

Please refer to ding/rl_utils/beta_function for more details.

cpw

CVaR

beta_function_map

coma

Please refer to ding/rl_utils/coma for more details.

coma_error

exploration

Please refer to ding/rl_utils/exploration for more details.

get_epsilon_greedy_fn

BaseNoise

GaussianNoise

OUNoise

create_noise_generator

gae

Please refer to ding/rl_utils/gae for more details.

gae_data

shape_fn_gae

gae

isw

Please refer to ding/rl_utils/isw for more details.

compute_importance_weights

ppg

Please refer to ding/rl_utils/ppg for more details.

ppg_data

ppg_joint_loss

ppg_joint_error

ppo

Please refer to ding/rl_utils/ppo for more details.

ppo_data

ppo_policy_data

ppo_value_data

ppo_loss

ppo_policy_loss

ppo_info

shape_fn_ppo

ppo_error

ppo_policy_error

ppo_value_error

ppo_error_continuous

ppo_policy_error_continuous

retrace

Please refer to ding/rl_utils/retrace for more details.

compute_q_retraces

sampler

Please refer to ding/rl_utils/sampler for more details.

ArgmaxSampler

MultinomialSampler

MuSampler

ReparameterizationSampler

HybridStochasticSampler

HybridDeterminsticSampler

td

Please refer to ding/rl_utils/td for more details.

q_1step_td_data

q_1step_td_error

m_q_1step_td_data

m_q_1step_td_error

q_v_1step_td_data

q_v_1step_td_error

nstep_return_data

nstep_return

dist_1step_td_data

dist_1step_td_error

dist_nstep_td_data

shape_fn_dntd

dist_nstep_td_error

v_1step_td_data

v_1step_td_error

v_nstep_td_data

v_nstep_td_error

q_nstep_td_data

dqfd_nstep_td_data

shape_fn_qntd

q_nstep_td_error

bdq_nstep_td_error

shape_fn_qntd_rescale

q_nstep_td_error_with_rescale

dqfd_nstep_td_error

dqfd_nstep_td_error_with_rescale

qrdqn_nstep_td_data

qrdqn_nstep_td_error

q_nstep_sql_td_error

iqn_nstep_td_data

iqn_nstep_td_error

fqf_nstep_td_data

fqf_nstep_td_error

evaluate_quantile_at_action

fqf_calculate_fraction_loss

td_lambda_data

shape_fn_td_lambda

td_lambda_error

generalized_lambda_returns

multistep_forward_view

upgo

Please refer to ding/rl_utils/upgo for more details.

upgo_returns

upgo_loss

value_rescale

Please refer to ding/rl_utils/value_rescale for more details.

value_transform

value_inv_transform

symlog

inv_symlog

vtrace

Please refer to ding/rl_utils/vtrace for more details.

vtrace_nstep_return

vtrace_advantage

vtrace_data

vtrace_loss

vtrace_error_discrete_action

vtrace_error_continuous_action