Inside image generation’s Renaissance moment - Episode 19

May 14
29 mins

View Transcript

Episode Description

People are generating over 1.5 billion images a week in ChatGPT. In this episode, Product lead Adele Li and researcher Kenji Hata share some of the new use cases and trends since the launch of Images 2.0. Together with host Andrew Mayne, they trace the progress from the early DALL-E days and dive into the latest capabilities, including better text rendering, photorealism, multilingual support, world knowledge, aspect ratios, and character consistency. They also explore what comes next as image generation models evolve into more capable creative assistants.


Chapters

00:36 How Adele and Kenji came to work on Images

02:27 Images 2.0 launch reception

05:25 Productivity use cases and and 360 images

09:34: Viral trends, authenticity, and imperfection

10:51 Training breakthroughs and photorealism

14:06 Evals, prompting, and creative control

22:16 Creative agents and what comes next

22:27 Images + Codex

28:08 Prompt tips

Hosted on Acast. See acast.com/privacy for more information.

See all episodes