TheSequence

TheSequence

Share this post

TheSequence
TheSequence
Edge 347: What is Constitutional AI?

Edge 347: What is Constitutional AI?

Lets dive into fine-tuning paradigm behind the Claude LLM.

Nov 28, 2023
∙ Paid
18

Share this post

TheSequence
TheSequence
Edge 347: What is Constitutional AI?
2
Share
A futuristic artificial intelligence language model, depicted as a sleek humanoid robot with a metallic, chrome-like finish, sitting at a classic wooden desk in a well-lit, modern library. The robot is portrayed with a focused expression, reading a large, antique copy of the constitution, with its finger pointing at a specific section, symbolizing careful consideration of the rules. The setting exudes an aura of wisdom and technology merging, with bookshelves filled with various books in the background.
Created Using DALL-E

💡 ML Concept of the Day: What is Constitutional AI?

In the previous issue of this series we explored the concept of reinforcement learning with human feedback(RLHF) as one of the fundamental techniques to fine-tune the instruction alignment problem with LLMs. RLHF has proven to be incredibly effective but also quite hard to implement as it requires a decent number of people involved and the process is largely inconsistent from one model to another. An interesting alternative to RLHF was proposed by Anthropic as the main technique powering its Claude model.

In the Constitutional AI approach, the AI training process encompasses two key phases:

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Jesus Rodriguez
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share