Learning to summarize with human feedback

OpenAI Blog
September 4, 2020
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
Verticals
airesearch
Originally published on OpenAI Blog on 9/4/2020