Next Week in The Sequence:
Edge 399: Our series about autonomous agents continues with an overview of external aid planning. We dive into IBM’s Simplan method for planning in LLMs and review the Langroid framework for autonomous agents.
Edge 340: A must read about AlphaFold 3 which expanded capabilities to predict many of the life’s molecules.
You can subscribe to The Sequence below:
📝 Editorial: Gen AI Unicorn Capitulation
The week in generative AI was dominated by the news from the Microsoft Build conference, where "copilots, copilots, copilots" seems to be the new "developers, developers, developers." We also had another OpenAI drama with allegations from Scarlett Johansson about her voice being copied. And yet, today’s editorial is about none of those topics ;)
Instead, I would like to focus on an emerging phenomenon we are seeing in the market related to the “capitulation” of many high-flying startups that raised billions in venture funding. Just this week, we saw news of Adept and Humane AI both looking for prospective buyers.
Adept was founded by some of the authors of the transformer paper with the promise of reimagining productivity agents using foundation models. Specifically, Adept leveraged a computer-vision-based model to learn from user interactions so they can be automated. The company delivered incredible IP in this area with the Fuyu models but seems to have issues scaling the business model.
Humane is another high-profile AI company that tackled the big problem of creating an AI-first consumer device. The idea that AI will power a new hardware form factor makes perfect sense, and Humane’s pin is a very clever design. However, Humane didn’t have a particularly successful launch and is facing the traditional scaling challenges of hardware companies.
Adept and Humane joined recent stories such as Inflection, which proves that building businesses relying on generalist AI models is capital intensive and hard to scale. These early capitulations are the result of a heavily funded market evolving at a lightning-fast speed, making it really hard to identify successful business models this early. Regardless of this outcome, Adept and Humane have brought a tremendous level of innovation to generative AI, and likely a lot of their products will continue to evolve in their next phase.
🔎 ML Research
Inside the Mind of an LLM
Anthropic published a research paper outlining major advancements in foundation model interpretability. The research highlights how to identify interpretable features in mega large models like Claude Sonnet —> Read more.
Key-Value in Cross Layer Attention
Researchers from MIT published a paper outlining an optimization technique for key-value(KV) caching. The method, called l Cross-Layer Attention (CLA), extends paradigms such as Multi-Query Attention (MQA) to share values between adjacent layers —> Read more.
Modular LLM
Microsoft Research published a paper detailing module LLM architecture based on a library of LoRA adapters. The paper also introduce Arrow, a zero-shot routing mechanism for selecting the right adapter for a given task —> Read more.
LANISTR
Google Research published a paper introducing LANguage, Image, and STRuctured data(LANISTR), a multimodal model that is able to learn from unstructured and structured data. LANISTR works with both text, image as well as time-series and tabular datasets —> Read more.
KV-Runahead
Apple Research published a paper proposing a technique for parallelizing key-value cache generation. The key principle of this technique is to minimize the time to first token which optimizes the computation efficiency of the attention layer —> Read more.
🤖 Cool AI Tech Releases
Phi-3
Microsoft’s Phi-3 family of models is now available on Microsoft Azure —> Read more.
More Microsoft Announcements
Microsoft has many AI announcements at its Build conference —> Read more.
Aya 23
Cohere released Aya 23, an 8 and 35 billion parameter LLM supporting 23 languages —> Read more.
PaliGemma
Google’s small language-vision model PaliGemma is now open source —> Read more.
Llama3-ChatQA
NVIDIA released a RAG optimized version of Llama3 —> Read more.
SynthID
Google DeepMind expanded SynthID’s capabilities to insert watermark in AI generated text, images and videos —> Read more.
🛠 Real World AI
Data Management at Meta
Meta engineering discusses the composable data infrastructure powering part of its AI infrastructure —> Read more.
📡AI Radar
This week’s OpenAI drama has to do with Scarlett Johansson.
Scale AI raised a monster $1 billion at a $13.8 billion valuation.
Speculation around Elon Musk’s xAI valuation continues and now its rumoured to be $24 billion.
Generative audio startup Suno raised $125 million.
Nvidia delivered another monster earnings report.
French AI startup H raised $220 million to build action models.
Ideogram released Tiles, a new feature for generating patterns in images.
Inflection AI put together a management team to focus on emotional intelligence for business bots.
Snowflake and Reka AI ended their acquisition conversations.
Ai Pin creator Humane AI is seeking for a strategic buyer.
Adept AI has also been looking for prospective buyers.
LMSys introduced a Hard Prompt category in Chatbot Arena.
EnCharge AI, an AI chip company linked to the US Department of Defense, is raising $70 million.
Orca AI, an autonomous ship company, raised $23 million Series A.