Learning to summarize with human feedback
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
OpenAI
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
OpenAI