Parameter-Exploring Policy Gradients

Neural Networks - United Kingdom
doi 10.1016/j.neunet.2009.12.004