Utility function for calculating the posterior probability of each machine being "good" in
two armed bandit problem. Calculated result is based on observed win loss data, prior belief about
which machine is good and the probability of the good and bad machine paying out.