Re: [Computer-go] MC in games of imperfect information

Álvaro Begué Thu, 20 May 2010 05:45:42 -0700

On Thu, May 20, 2010 at 7:57 AM, Isaac Deutsch <i...@gmx.ch> wrote:
> [...]
>
> All clear up to here. :) I guess the "P(raise | two_aces)" bit could be
> extracted from professional games, for example?


That would be a very good way to do it, yes. But this is part of how
you create the quick mapping from game situations to probability
distributions over actions, and I didn't get into how to do that.

> Is it possible to update these probabilities incrementally? So, in the
> example, you start with a uniform probability. Then, you see that
> the opponent raises, and you update the probability for the hand "two aces"
> to 0.04. Let's say he does another action that makes two aces more
> (or less) likely, can we use P(two_aces) = 0.04 as a starting point? Does it
> matter if the 2 actions correlate or not?

You can simply take the P(two_aces) = 0.04 as your prior when you find
the second action. This is part of the beauty of Bayes' theorem.

> I see, this scaling method seems easier indeed. But, in tichu, everyone has
> 14 cards at the beginning. So, it is impossible to do this for all possible
> hands. How should this problem be tackled? It seems beneficial to simulate
> the most likely hands first.

Correct: This is not feasible in Tichu the way I described it.
However, you don't have to compute those probabilities in advance.
Instead, deal the cards first and then revisit the history of the game
to figure out the correct weight for this hand.

Alternatively, you could have a pool of hands whose probabilities you
keep track of, and perhaps you generate new ones as they become
impossible (because someone plays a card they wouldn't have, for
example). The tricky part is making sure that you don't introduce
biases doing this. I wouldn't try this at first.

> > Notice that P(raise) = Sum_over_all_possible_hands ( P(raise | hand) ).
>
> Again, we cannot calculate all possible hands most of the time. Should we
> say that all hands except the most likely ones should be neglected because
> their weight would be minimal anyway?

This is potentially a more serious obstacle, I am not sure. Perhaps
you can skip the normalization step, so the weights you'll get are not
really probabilities anymore, but this shouldn't be a problem because
all of them have the same P(action) factors missing. You may have
problems with underflow in long games, but I would deal with those
when you find them.

Álvaro.
_______________________________________________
Computer-go mailing list
Computer-go@dvandva.org
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] MC in games of imperfect information

Reply via email to