Edge 347: What is Constitutional AI?

Lets dive into fine-tuning paradigm behind the Claude LLM.

Nov 28, 2023

∙ Paid

A futuristic artificial intelligence language model, depicted as a sleek humanoid robot with a metallic, chrome-like finish, sitting at a classic wooden desk in a well-lit, modern library. The robot is portrayed with a focused expression, reading a large, antique copy of the constitution, with its finger pointing at a specific section, symbolizing careful consideration of the rules. The setting exudes an aura of wisdom and technology merging, with bookshelves filled with various books in the background. — Created Using DALL-E

💡 ML Concept of the Day: What is Constitutional AI?

In the previous issue of this series we explored the concept of reinforcement learning with human feedback(RLHF) as one of the fundamental techniques to fine-tune the instruction alignment problem with LLMs. RLHF has proven to be incredibly effective but also quite hard to implement as it requires a decent number of people involved and the process is largely inconsistent from one model to another. An interesting alternative to RLHF was proposed by Anthropic as the main technique powering its Claude model.

In the Constitutional AI approach, the AI training process encompasses two key phases:

TheSequence

Edge 347: What is Constitutional AI?

Lets dive into fine-tuning paradigm behind the Claude LLM.

💡 ML Concept of the Day: What is Constitutional AI?

This post is for paid subscribers