Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade | IEEE Journals & Magazine | IEEE Xplore