2.1 Markov decision -- type 3 gains

Topics in applied probability Page 1 / 2

An electronic store stocks a certain type of VCR. At the end of each week, an order is placed for early delivery the following Monday. A maximum of fourunits is stocked. Let the states be the number of units on hand at the end of the sales week: $E b = {0, 1, 2, 3, 4}$ . Two possible actions:

Order two units, at a cost of $150 each
Order four units, at a cost of $120 each

Units sell for $200. If demand exceeds the stock in hand, the retailer assumes a penalty of $40 per unit (in losses due to customer dissatisfaction, etc.).Because of turnover, return on sales is considered two percent per week, so that discount is $α = 1 / 1.02$ on a weekly basis.

In state 0, there are three possible actions: order 0, 2, or 4. In states 1 and 2 there are two possible actions: order 0 or 2. In states 3 and 4, the only actionis to order 0. Customer demand in week $n + 1$ is represented by a random variable $D_{n + 1}$ . The class is iid, uniformly distributed on the values 0, 1, 2, 3, 4. If X _n is the state at the end of week n , then ${X_{n}, D_{n + 1}}$ is independent for each n .

Analyze the system as a Markov decision process with case 3 gains, depending upon current state, action, and demand. Determine the transition probability matrixPA (properly padded) and the gain matrix (also padded). Sample calculations are as follows:

State 0, action 0: $p_{00} (0) = 1$ (all other $p_{0 k} (0) = 0$ )
State 0, action 2: $p_{00} (2) = P (D \geq 2) = 3 / 5, p_{01} (2) = P (D = 1) = 1 / 5$ , etc.
State 2, action 2: $p_{2 j} (k) = 1 / 5, k = 0, 1, 2, 3, 4$

For state = i , action = a , and demand = k , we seek $g (i, a, k)$

$\begin{matrix} g (0, 0, k) = - 40 k & 0 & -40 & -80 & -120 & -160 \\ g (0, 2, k) = - 300 + 200 min {k, 2} - 40 max {k - 2, 0} & -300 & -100 & 100 & 60 & 20 \\ g (0, 4, k) = - 480 + 200 k & -480 & -280 & -80 & 120 & 320 \end{matrix}$

Complete the transition probability table and the gain table.
Determine an optimum infinite-horizon strategy with no discounting.
Determine an optimum infinite-horizon strateby with discounting (alpha = 1/1.02).
The manager decides to set up a six-week strategy, after which new sales conditions may be established. Determine an optimum strategy for the six-week period.

Data file

% file orderdata.m

    % Version of 4/5/94% Data organized for computation

    type = 3;states = 0:4;

    A   = [0   2   4 ...                   % Actions (padded)0   2  02 ...

           0   2  02 ...0  00  00 ...

           0  00  00];

    C   = [0 -300 -480 ...           % Order costs (padded)0 -300 -300 ...

           0 -300 -300 ...0    0    0 ...

           0    0    0];

    SP = 200;                                   % Selling priceBP = 40;                                     % Backorder penalty

    PD = 0.2*ones(1,5);               % Demand probabilities

Transition probabilities and gains

The procedure

% file reorder.m
% Version of 4/11/94% Calculates PA and GA for reorder policy
states = input('Enter row vector of states   ');A   = input('Enter row vector A of actions (padded)   ');
C   = input('Enter row vector C of order costs (padded)   ');D   = input('Enter row vector D of demand values   ');
PD = input('Enter row vector PD of demand probabilities     ');SP = input('Enter unit selling price SP   ');
BP = input('Enter backorder penalty cost BP   ');m   = length(states');
q   = length(A);na = q/m;
N   = length(D);S   = ones(na,1)*states;
S   = S(:)';[d,s] = meshgrid(D,S);a   = A'*ones(1,N);
ca = C'*ones(1,N);TA = (s + a - d).*(s + a - d>= 0);
for i = 1:qPA(i,:) = tdbn(states,TA(i,:),PD);
endPA
GA = ca + SP*d - (SP + BP)*(d -s -a).*(d>s+a)

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Topics in applied probability. OpenStax CNX. Sep 04, 2009 Download for free at http://cnx.org/content/col10964/1.2

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Topics in applied probability' conversation and receive update notifications?

Ask

	21 AP 21 Lymphatic Immune System MCQ By OpenStax Start Quiz
©flickr:	The Minute Man Hose Load By Michael Pitt Start Quiz
	7 Arts Society: Theater 7 By Jonathan Long Start Quiz
	36 Biology 36 Sensory Systems MCQ By OpenStax Start Quiz
	23 AP Key Terms 23 The Digestive System By OpenStax Start Key Terms
	Business fundamentals By OpenStax Read Online Course
©flickr: Eduardo	Nursing Education NU589 Program Outcomes By Karen Gowdey Start Quiz
	10 Neuroanatomy 10 Corticospinal Tract By Stephen Voron Start Quiz
	14 AP 14 Brain Cranial Nerves Essay By OpenStax Start Flashcards
	27 Biology 27 Animal Diversity MCQ By OpenStax Start Quiz