Navigated to How Fal.ai Went From Inference Optimization to Hosting Image and Video Models

How Fal.ai Went From Inference Optimization to Hosting Image and Video Models

July 25
52 mins

View Transcript

Episode Description

Fal.ai, once focused on machine learning infrastructure, has evolved into a major player in generative media. In this episode of The New Stack Agents, hosts speak with Fal.ai CEO Burkay Gur and investor Glenn Solomon of Notable Capital. Originally aiming to optimize Python runtimes, Fal.ai shifted direction as generative AI exploded, driven by tools like DALL·E and ChatGPT. Today, Fal.ai hosts hundreds of models—from image to audio and video—and emphasizes fast, optimized inference to meet growing demand.

Speed became Fal.ai’s competitive edge, especially as newer generative models require GPU power not just for training but also for inference. Solomon noted that while optimization alone isn't a sustainable business model, Fal’s value lies in speed and developer experience. Fal.ai offers both an easy-to-use web interface and developer-focused APIs, appealing to both technical and non-technical users.

Gur also addressed generative AI’s impact on creatives, arguing that while the cost of creation has plummeted, the cost of creativity remains—and may even increase as content becomes easier to produce.

Learn more from The New Stack about AI’s impact on creatives:

AI Will Steal Developer Jobs (But Not How You Think) 

How AI Agents Will Change the Web for Users and Developers 

Join our community of newsletter subscribers to stay on top of the news and at the top of your game. 

See all episodes

Never lose your place, on any device

Create a free account to sync, back up, and get personal recommendations.