subject
Mathematics, 14.02.2020 17:12 ldestl

In a coin game, you repeatedly toss a biased coin (0.4 for head, 0.6 for tail). Each head represents 3 points and the tail represents 1 point. You can either Toss or Stop if the total number of points you have tossed is no more than 7. Otherwise, you must Stop. When you Stop, your utility is equal to your total points (up to 7), or 0 if you get a total of 8 points or higher. When you Toss, you receive no utility. There is no discounting.
a. What are the states and the actions for this MDP? Which states are terminal?
b. What is the transition function and the reward function for this MDP? Hint: The problem may be simpler to formulate using the general version of rewards: R(s, a, s')
c. Run value iteration to find the optimal value function V* for the MDP. Show each Vk step (starting from Vo(s) = 0 for all states s). For a reasonable MDP formulation, this should converge in fewer than 10 steps. If you find it too tedious to do by hand, you may write a program to do this for you; however, there may be some benefit in seeing the calculation unfolding in front of you.
d. Using the V* you found, determine the optimal policy for this MDP.

ansver
Answers: 2

Another question on Mathematics

question
Mathematics, 21.06.2019 18:40
Acircle has a circumference of 28.36 units what is the diameter of the circle
Answers: 2
question
Mathematics, 21.06.2019 19:00
When keisha installed a fence along the 200 foot perimeter of her rectangular back yard, she left an opening for a gate.in the diagram below, she used x to represent the length in feet of the gate? what is the value? a. 10 b. 20 c. 25 d. 30
Answers: 1
question
Mathematics, 21.06.2019 20:30
Does the function satisfy the hypotheses of the mean value theorem on the given interval? f(x) = 4x^2 + 3x + 4, [−1, 1] no, f is continuous on [−1, 1] but not differentiable on (−1, 1). no, f is not continuous on [−1, 1]. yes, f is continuous on [−1, 1] and differentiable on (−1, 1) since polynomials are continuous and differentiable on . there is not enough information to verify if this function satisfies the mean value theorem. yes, it does not matter if f is continuous or differentiable; every function satisfies the mean value theorem.
Answers: 1
question
Mathematics, 21.06.2019 23:30
Arational number that is equivalent to -20/4?
Answers: 1
You know the right answer?
In a coin game, you repeatedly toss a biased coin (0.4 for head, 0.6 for tail). Each head represents...
Questions
question
Computers and Technology, 10.02.2020 22:02
question
History, 10.02.2020 22:02