LightZero

Tutorials

  • Installation and Quick Start Guide
  • How to Customize Your Algorithms in LightZero?
  • How to Customize Your Environments in LightZero?
  • How to Set Configuration Files in LightZero
  • LightZero’s Logging and Monitoring System

API Documentation

  • Agent
  • Config
  • Entry
  • Envs
  • MCTS
  • Model
  • Policy
  • Worker
LightZero
  • <no title>
  • View page source

  • Agent
  • Config
    • lzero.config.meta
      • __TITLE__
      • __VERSION__
      • __DESCRIPTION__
      • __AUTHOR__
      • __AUTHOR_EMAIL__
  • Entry
    • train_alphazero
      • train_alphazero.__init__()
    • eval_alphazero
      • eval_alphazero.__init__()
    • train_muzero
      • train_muzero.__init__()
    • eval_muzero
      • eval_muzero.__init__()
    • train_muzero_with_gym_env
      • train_muzero_with_gym_env.__init__()
    • eval_muzero_with_gym_env
      • eval_muzero_with_gym_env.__init__()
    • train_muzero_with_reward_model
      • train_muzero_with_reward_model.__init__()
  • Envs
    • LightZeroEnvWrapper
      • LightZeroEnvWrapper.__init__()
      • LightZeroEnvWrapper._is_protocol
      • LightZeroEnvWrapper._np_random
      • LightZeroEnvWrapper.action_space
      • LightZeroEnvWrapper.class_name()
      • LightZeroEnvWrapper.close()
      • LightZeroEnvWrapper.metadata
      • LightZeroEnvWrapper.np_random
      • LightZeroEnvWrapper.observation_space
      • LightZeroEnvWrapper.render()
      • LightZeroEnvWrapper.render_mode
      • LightZeroEnvWrapper.reset()
      • LightZeroEnvWrapper.reward_range
      • LightZeroEnvWrapper.seed()
      • LightZeroEnvWrapper.spec
      • LightZeroEnvWrapper.step()
      • LightZeroEnvWrapper.unwrapped
    • ActionDiscretizationEnvWrapper
      • ActionDiscretizationEnvWrapper.__init__()
      • ActionDiscretizationEnvWrapper._is_protocol
      • ActionDiscretizationEnvWrapper._np_random
      • ActionDiscretizationEnvWrapper.action_space
      • ActionDiscretizationEnvWrapper.class_name()
      • ActionDiscretizationEnvWrapper.close()
      • ActionDiscretizationEnvWrapper.metadata
      • ActionDiscretizationEnvWrapper.np_random
      • ActionDiscretizationEnvWrapper.observation_space
      • ActionDiscretizationEnvWrapper.render()
      • ActionDiscretizationEnvWrapper.render_mode
      • ActionDiscretizationEnvWrapper.reset()
      • ActionDiscretizationEnvWrapper.reward_range
      • ActionDiscretizationEnvWrapper.seed()
      • ActionDiscretizationEnvWrapper.spec
      • ActionDiscretizationEnvWrapper.step()
      • ActionDiscretizationEnvWrapper.unwrapped
  • MCTS
    • Buffer
      • GameBuffer
      • MuZeroBuffer
      • EfficientZeroBuffer
    • Tree Search
      • MuZeroMCTSCtree
      • EfficientZeroMCTSCtree
      • GumbelMuZeroMCTSCtree
  • Model
    • Common
    • MuZeroModel
    • MuZeroModelMLP
    • EfficientZeroModel
    • EfficientZeroModelMLP
    • AlphaZeroModel
    • SampledEfficientZeroModel
    • SampledEfficientZeroModelMLP
    • StochasticMuZeroModel
    • StochasticMuZeroModelMLP
  • Policy
    • AlphaZeroPolicy
      • AlphaZeroPolicy
    • MuZeroPolicy
    • EfficientZeroPolicy
    • Gumbel AlphaZeroPolicy
      • GumbelAlphaZeroPolicy
    • Gumbel MuZeroPolicy
    • Sampled AlphaZeroPolicy
      • SampledAlphaZeroPolicy
    • Sampled MuZeroPolicy
    • Sampled EfficientZeroPolicy
    • Stochastic MuZeroPolicy
    • UniZeroPolicy
  • Worker
    • MuZeroCollector
      • MuZeroCollector
    • MuZeroEvaluator
      • MuZeroEvaluator
Previous Next

© Copyright 2023, OpenDILab Contributors.

Built with Sphinx using a theme provided by Read the Docs.