AlphaGo introduce a new approach to computer Go that uses ‘value networks’ to evaluate board positions and ‘policy networks’ to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play.
Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. AlphaGo also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks.
After LAMP (Linux, Apache, MySQL, PHP) stack die in slow death, and over-hyped MEAN stack (MongoDB, ExpressJS, AngularJS, NodeJS) has became so yesterday, now new kid in the block is SMACK stack (Spark, Mesos, Akka, Cassandra, Kafka).