Reinforcement Learning A Tic Tac Toe Example