Finding GPT-4’s mistakes with GPT-4

OpenAI Blog
June 27, 2024
CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF
Verticals
airesearch
Originally published on OpenAI Blog on 6/27/2024