Abstract: Policy gradient algorithms have been shown to converge to the optimal controller in a linear quadratic regulator (LQR) design problem. Calculating policy gradients using the true system such ...