Stochastic Policies

Stochastic Policies

Types for representing randomized policies:

StochasticPolicy{D, RNG <: AbstractRNG}

Represents a stochastic policy. Action are sampled from an arbitrary distribution.

Constructor:

`StochasticPolicy(distribution; rng=Random.GLOBAL_RNG)`

Fields

  • distribution::D
  • rng::RNG a random number generator
source
CategoricalTabularPolicy

represents a stochastic policy sampling an action from a categorical distribution with weights given by a ValuePolicy

constructor:

CategoricalTabularPolicy(mdp::Union{POMDP,MDP}; rng=Random.GLOBAL_RNG)

Fields

  • stochastic::StochasticPolicy
  • value::ValuePolicy
source
EpsGreedyPolicy

represents an epsilon greedy policy, sampling a random action with a probability eps or sampling from a given stochastic policy otherwise.

constructor:

EpsGreedyPolicy(mdp::Union{MDP,POMDP}, eps::Float64; rng=Random.GLOBAL_RNG)

source