COMP3270 – Problem 1 and 2 are finished. Problem 3 is doable but I ran out of time. (Solution)

$ 15.99

Category: COMP3270

Description
Reviews (0)

Description

In problem 1, I used a random choice function to do policy evaluation.
In problem 2, I used a policy evaluation function such that it has noise probability to go to the intended direction.
In problem 3, just need to choose the direction that returns maximum rewards. I spent 10 hours in total.

Reviews

There are no reviews yet.

Be the first to review “COMP3270 – Problem 1 and 2 are finished. Problem 3 is doable but I ran out of time. (Solution)”

COMP3270 – Problem 1 and 2 are finished. Problem 3 is doable but I ran out of time. (Solution)

Description

Reviews

Related products

COMP3270 – Objectives of this assignment: Solved

COMP3270 – Objectives of this assignment: (Solution)

COMP3270 – (Solution)

COMP3270 – Objectives of this assignment: (Solution)

COMP3270 – Sarah Pham (slp0042) (Solution)