TheSequence

TheSequence

Share this post

TheSequence
TheSequence
Edge 293: Instruction Following Language Models

Edge 293: Instruction Following Language Models

Instruction following LLs, OpenAI's InstructGPT and the Dust LLM framework.

May 23, 2023
∙ Paid
30

Share this post

TheSequence
TheSequence
Edge 293: Instruction Following Language Models
1
Share
Created Using Midjourney

In this Issue:

  1. The Concept: An overview of instruction following language models.

  2. The Research: A review of OpenAI’s InstructGPT paper.

  3. The Tech: Some details about the Dust LLM framework.

💡 ML Concept of the Day: Instruction Following Language Models   

In a previous installment of this series, we discussed some of the ideas behind reinforcement learning with human feedback(RLHF) as a core component of modern large language models(LLMs). Arguably, the most relevant case of RLHF is to train LLMs to follow instructions. The main research in this area was unveiled by OpenAI with a model called InstructGPT.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Jesus Rodriguez
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share