How to train and deploy reinforcement learning agents¶

In GenerativeRL, the RL algorithms are implemented as a class under the grl.algorithms module, while the agents are implemented as a class under the grl.agents module.

Every algorithm class has a train method that takes the environment, dataset, and other hyperparameters as input and returns the trained model. Every algorithm class also has a deploy method that copys the trained model and returns the trained agent.

For training a specific RL algorithm, you need to follow these steps:

Create an instance of the RL algorithm class.

from grl.algorithms.qgpo import QGPOAlgorithm

Define the hyperparameters for the algorithm in a configurations dictionary. You can use the default configurations provided under the grl_pipelines module.

from grl_pipelines.diffusion_model.configurations.d4rl_halfcheetah_qgpo import config

Create an instance of algorithm class with the configurations dictionary.

algorithm = QGPOAlgorithm(config)

Train the algorithm using the train method.

trained_model = algorithm.train()

Deploy the trained model using the deploy method.

agent = algorithm.deploy()

Use the trained agent to interact with the environment and evaluate its performance.

import gym
env = gym.make(config.deploy.env.env_id)
observation = env.reset()
for _ in range(config.deploy.num_deploy_steps):
    env.render()
    observation, reward, done, _ = env.step(agent.act(observation))

For more information on how to train and deploy reinforcement learning agents, please refer to the API documentation and other sections of the GenerativeRL documentation.