Home > marl > agent > actfuns > pigreedyact.m

pigreedyact

PURPOSE ^

Implements greedy policy-based action selection for policy indexed on agent's state

SYNOPSIS ^

function [a, action] = pigreedyact(a, state, actions, rewards, params)

DESCRIPTION ^

Implements greedy policy-based action selection for policy indexed on agent's state
  [A, ACTION] = PIGREEDYACT(A, STATE, ACTIONS, REWARDS, PARAMS)
  Implements greedy policy-based action selection. The policy elements are
  interpreted as a measure of the agent's preference towards the
  corresponding actions, and the actions are chosen greedly w.r.t. this
  preference. Chooses the action with the highest policy value (breaking
  ties randomly).

  Supports discrete states and actions, with 1 action variable only.

  The policy must be stored under field 'PI' of the agent, as a flat vector
  representing a matrix with dimensions agent-action-space-size X
  agent-state-space-size. Moreover, this size must be cached in field
  'sizes.pi' of the agent.


  See also agent_act

CROSS-REFERENCE INFORMATION ^

This function calls: This function is called by:
Generated on Wed 04-Aug-2010 16:55:08 by m2html © 2005