Faulty reward functions in the wild

OpenAI Blog
December 21, 2016
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.
Verticals
airesearch
Originally published on OpenAI Blog on 12/21/2016