Edge 293: Instruction Following Language Models
Instruction following LLs, OpenAI's InstructGPT and the Dust LLM framework.
In this Issue:
The Concept: An overview of instruction following language models.
The Research: A review of OpenAI’s InstructGPT paper.
The Tech: Some details about the Dust LLM framework.
💡 ML Concept of the Day: Instruction Following Language Models
In a previous installment of this series, we discussed some of the ideas behind reinforcement learning with human feedback(RLHF) as a core component of modern large language models(LLMs). Arguably, the most relevant case of RLHF is to train LLMs to follow instructions. The main research in this area was unveiled by OpenAI with a model called InstructGPT.