Improving Model Safety Behavior with Rule-Based Rewards
OpenAI Blog
July 24, 2024
We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.
Verticals
airesearch
Originally published on OpenAI Blog on 7/24/2024