A reinforcement-learning approach for individual pitch control