Running Simulations
Running a simulation consists of two steps, creating a simulator and calling the simulate
function. For example, given a POMDP or MDP model m
, and a policy p
, one can use the RolloutSimulator
from POMDPTools to find the accumulated discounted reward from a single simulated trajectory as follows:
sim = RolloutSimulator()
r = simulate(sim, m, p)
More inputs, such as a belief updater, initial state, initial belief, etc. may be specified as arguments to simulate
. See the docstring for simulate
and the appropriate "Input" sections in the Simulation Standard page for more information.
More examples can be found in the Simulations Examples section. A variety of simulators that return more information and interact in different ways can be found in POMDPTools.