Congratulations, Aja and David! Very remarkable win! Pasky,
It's too early to conclude any, I think, because no records of losing games have been published, ie., no weakpoints of AlphaGo are open. I believe that the essential problems which come from current MCTS (bottom-up) framework, such as solving complex semeai's and double-ko's, aren't solved yet. Hideki Petr Baudis: <20160309171109.gu12...@machine.or.cz>: > Hi! > >On Wed, Mar 09, 2016 at 04:43:23PM +0900, Hiroshi Yamashita wrote: >> AlphaGo won 1st game against Lee Sedol! > > Well, I have to eat my past words - of course, there are still four >games to go, but the first round does not look like a lucky win at all! > > Huge congratulations to the AlphaGo team, you have done truly amazing >work, with potential to spearhead a lot of further advances in AI in >general! It does seem to me that you must have made a lot of progress >since the Nature paper though - is that impression correct? > > Do you have some more surprising breakthroughs and techniques in store >for us, or was the progress mainly incremental, furthering the training >etc.? > > > By the way, there is a short snippet in the paper that maybe many >people overlooked (including me on the very first read!): > >> We introduce a new technique that caches all moves from the search >> tree and then plays similar moves during rollouts; a generalisation of >> the last good reply heuristic. At every step of the tree traversal, the >> most probable action is inserted into a hash table, along with the >> 3 × 3 pattern context (colour, liberty and stone counts) around both the >> previous move and the current move. At each step of the rollout, the >> pattern context is matched against the hash table; if a match is found >> then the stored move is played with high probability. > > This looks like it might overcome a lot of weaknesses re semeai etc., >enabling the coveted (by me) information flow from tree to playouts, if >you made this to work well (it's similar to my "liberty maps" attempts, >which always failed though - I tried to encode a larger context, which >maybe wasn't good idea). > > Would you say this improvement is important to AlphaGo's playing >strength (or its scaling), or merely a minor tweak? > > > Thanks, > >-- > Petr Baudis > If you have good ideas, good data and fast computers, > you can do almost anything. -- Geoffrey Hinton >_______________________________________________ >Computer-go mailing list >Computer-go@computer-go.org >http://computer-go.org/mailman/listinfo/computer-go -- Hideki Kato <mailto:hideki_ka...@ybb.ne.jp> _______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go