Practical AI: Machine Learning, Data Science
Technology
Recently a16z released a diagram showing the “Emerging Architectures for LLM Applications.” In this episode, we expand on things covered in that diagram to a more general mental model for the new AI app stack. We cover a variety of things from model “middleware” for caching and control to app orchestration.
Leave us a comment
Changelog++ members save 2 minutes on this episode because they made the ads disappear. Join today!
Sponsors:
Featuring:
Show Notes:
Emerging Architectures for LLM Applications
Something missing or broken? PRs welcome!
Timestamps:
(00:07) - Welcome to Practical AI
(00:43) - Deep dive into LLMs
(02:25) - Emerging LLM app stack
(04:35) - Playgrounds
(08:07) - App Hosting
(10:46) - Stack orchestration
(15:50) - Maintenance breakdown
(19:08) - Sponsor: Changelog News
(20:43) - Vector databases
(22:36) - Embedding models
(24:27) - Benchmarks and measurements
(26:59) - Data & poor architecture
(29:42) - LLM logging
(33:01) - Middleware Caching
(37:32) - Validation
(40:53) - Key takeaways
(42:36) - Closing thoughts
(44:23) - Outro
Create your
podcast in
minutes
It is Free