AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)
InstructGPT is a system fine-tuned from GPT-3 that demonstrates how human feedback can transform a capable language model into a far more useful and aligned assistant.
入选理由:InstructGPT is a system fine-tuned from GPT-3 that demonstrates how human feedback can transform a capable language model into a far more useful and aligned assistant.



