📃🪄🖼 Edge#204: Inside Imagen. Google’s Impressive Text-to-Image Alternative to OpenAI’s DALLE-2
Imagen provides a simpler architecture able to generate photorealistic images from language inputs
On Thursdays, we dive deep into one of the freshest research papers or technology frameworks that is worth your attention. Our goal is to keep you up to date with new developments in AI to complement the concepts we debate in other editions of our newsletter.
💥 What’s New in AI: Inside Imagen. Google’s Impressive Text-to-Image Alternative to OpenAI’s DALLE-2
Text-to-image (TTI) is one of the most innovative areas in multi-modal learning these days. Transformer architectures played a significant role in the fast development of natural language understanding (NLU) and computer vision and catalyzed the research in the TTI space. In the last few months, OpenAI has made headlines by publishing two papers on their DALL-E model, which can generate photorealistic, artistic images based on language. Recently,