Learning to reason with LLMs
OpenAI Blog
September 12, 2024
We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding to the user.
Verticals
airesearch
Originally published on OpenAI Blog on 9/12/2024