DI-engine

latest

User Guide

Installation
Quick Start
Key Concept
Introduction to RL
Hands on RL
Best Practice
API Doc
- Config
- Env
- Policy
- Model
- Reward Model
- League
- Learner
- Collector
- Buffer
- Coordinator
- RL Utils
- Torch Utils
- Utils
- Interaction
FAQ
Feature

Developer Guide

Developer Guide
Tutorial-Developer
Architecture Design

DI-engine

»
API Doc »
RL Utils
Edit on GitHub

RL Utils¶

rl_tuils.td
- Temporal Differnece
rl_utils.gae
- gae
  - gae
rl_utils.ppo
- ppo
  - ppo_error
rl_utils.adder
- adder
  - Adder
rl_utils.exploration
- exploration
rl_utils.a2c
- a2c
  - a2c_error
rl_utils.isw
- isw
  - compute_importance_weights
rl_tuils.vtrace
- vtrace
  - vtrace_error
rl_tuils.value_rescale
- value_rescale
  - value_transform
  - value_inv_transform
rl_utils.coma
- coma
  - coma_error
rl_tuils.upgo
- UPGO
  - upgo_returns
  - upgo_loss

Next Previous

© Copyright 2021, OpenDILab Contributors. Revision ce42f771.

Built with Sphinx using a theme provided by Read the Docs.