Home > marl > agent > actfuns > stochact.m

stochact

PURPOSE ^

Implements stochastic action selection for policy indexed on agent's state

SYNOPSIS ^

function [a, action] = stochact(a, state, actions, rewards, params)

DESCRIPTION ^

Implements stochastic action selection for policy indexed on agent's state
  [A, ACTION] = STOCHACT(A, STATE, ACTIONS, REWARDS, PARAMS)
  Implements stochastic action selection. An action is chosen based on the
  current stochastic policy of the agent. The elements of the policy
  corresponding to a given state must form a valid probability
  distribution over the (discrete) actions.

  Supports discrete states and actions, with 1 action variable only.

  This policy must be stored under field 'PI' of the agent, as a flat
  vector representing a matrix with dimensions agent-action-space-size X
  agent-state-space-size. Moreover, this size must be cached in field
  'sizes.pi' of the agent.


  See also agent_act

CROSS-REFERENCE INFORMATION ^

This function calls: This function is called by:
Generated on Wed 04-Aug-2010 16:55:08 by m2html © 2005