Reinforcement Learning from Scarce Experience via Policy Search pocketEngelska, 2008