>>642
方策の学習
to maximise the similarity of the neural network move probabilities p to the search probabilities π

論文読めば?