"OpenAI Develops CriticGPT Based on GPT-4 to Correct ChatGPT Errors; Helps Human Trainers Detect Mistakes 60% More Efficiently"

Finding GPT-4’s mistakes with GPT-4

June 27, 2024CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHFWe've trained a model, based on GPT-4, called CriticGPT to catch errors in ChatGPT's code output. We found that when people get help from CriticGPT to review ChatGPT code they outperform those without help 60% of the time. We are beginning the work to integrate CriticGPT-like models into our RLHF labeling pipeline, providing our trainers with explicit AI assistanc...