General Partner Anjney Midha explores the cutting-edge world of text-to-video AI with AI researchers Andreas Blattman and Robin Rombach.
Released in November, Stable Video Diffusion is their latest open-source generative video model, overcoming challenges in size and dynamic representation.
In this episode Robin and Andreas share why translating text to video is complex, the key role of datasets, current applications, and the future of video editing.
Topics Covered:
00:00 - Text to Video: The Next Leap in AI Generation
02:41 - The Stable Diffusion backstory
04:25 - Diffusion vs autoregressive models
06:09 - The benefits of single step sampling
09:15 - Why generative video?
11:19 - Understanding physics through AI video
12:20 - The challenge of creating generative video
15:36 - Data set selection and training
17:50 - Structural consistency and 3D objects
19:50 - Incorporating LoRAs
21:24 - How should creators think about these tools?
23:46 - Open challenges in video generation
25:42 - Infrastructure challenges and future research
Resources:
Find Robin on Twitter: https://twitter.com/robrombach
Find Andreas on Twitter: https://twitter.com/andi_blatt
Find Anjney on Twitter: https://twitter.com/anjneymidha
Stay Updated:
Find a16z on Twitter: https://twitter.com/a16z
Find a16z on LinkedIn: https://www.linkedin.com/company/a16z
Subscribe on your favorite podcast app: https://a16z.simplecast.com/
Follow our host: https://twitter.com/stephsmithio
Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.
The GenAI 100: The Apps that Stick
Finding a Single Source of AI Truth With Marty Chavez From Sixth Street
A Big Week in AI: GPT-4o & Gemini Find Their Voice
Remaking the UI for AI
How Discord Became a Developer Platform
Securing the Black Box: OpenAI, Anthropic, and GDM Discuss
Can AI Advance Science? DeepMind's VP of Science Weighs In
Next-Gen Gaming: AI Souls, Real-time Culture, Personalized Avatars
Why America Must Lead in AI Investment with Senator Young (R-IN)
Game On: Marc Andreessen & Andrew Chen Talk Creative Computers
Inside the Department of Defense and its Vision for the Future
Politics & the Future of Tech with Marc Andreessen and Ben Horowitz
The Real Price of Healthcare with Mark Cuban
Devoting Your Life to Reinventing a Broken System
Bringing AI to the Masses with Adam D’Angelo
A Nuclear Comeback: Are New Reactors the Answer?
Intelligence in the Age of AI with new CTO of the CIA
From Silicon Valley to the Pentagon: The Future of Defense Innovation
What is American Dynamism?
The Quest for True Signal: How Zynga Spotted Mobile
Create your
podcast in
minutes
It is Free
Stuff You Should Know
On Being with Krista Tippett
TED Radio Hour
Planet Money
The Dinner Party Download