Home > marl > agent > actfuns > fullstate_stochact.m

fullstate_stochact

PURPOSE ^

Implements stochastic action selection for policy indexed on full world state

SYNOPSIS ^

function [a, action] = fullstate_stochact(a, state, actions, rewards, params)

DESCRIPTION ^

Implements stochastic action selection for policy indexed on full world state
  [A, ACTION] = FULLSTATE_STOCHACT(A, STATE, ACTIONS, REWARDS, PARAMS)
  Implements stochastic action selection for policy indexed on full world state.
  An action is chosen based on the current stochastic policy of the agent.
  The elements of the policy corresponding to a given state must form a
  valid probability distribution over the (discrete) actions.

  This policy must be stored under field 'PI' of the agent, as a flat
  vector representing a matrix with dimensions agent-action-space-size X
  agent-state-space-size. Moreover, this size must be cached in field
  'sizes.pi' of the agent.

  Supports discrete states and actions, with 1 action variable only.

  See also agent_act

CROSS-REFERENCE INFORMATION ^

This function calls: This function is called by:
Generated on Wed 04-Aug-2010 16:55:08 by m2html © 2005