Playback Policy
A policy that replays a fixed sequence of actions. When all actions are used, a backup policy is used.
POMDPPolicies.PlaybackPolicy
— TypePlaybackPolicy{A<:AbstractArray, P<:Policy, V<:AbstractArray{<:Real}}
a policy that applies a fixed sequence of actions until they are all used and then falls back onto a backup policy until the end of the episode.
Constructor:
`PlaybackPolicy(actions::AbstractArray, backup_policy::Policy; logpdfs::AbstractArray{Float64, 1} = Float64[])`
Fields
actions::Vector{A}
a vector of actions to play backbackup_policy::Policy
the policy to use when all prescribed actions have been taken but the episode continueslogpdfs::Vector{Float64}
the log probability (density) of actionsi::Int64
the current action index
Missing docstring for PlaybackPolicy
. Check Documenter's build log for details.