Today we’re joined by Jay Emery, director of technical sales & architecture at Microsoft Azure. In our conversation with Jay, we discuss the challenges faced by organizations when building LLM-based applications, and we explore some of the techniques they are using to overcome them. We dive into the concerns around security, data privacy, cost management, and performance as well as the ability and effectiveness of prompting to achieve the desired results versus fine-tuning, and when each approach should be applied. We cover methods such as prompt tuning and prompt chaining, prompt variance, fine-tuning, and RAG to enhance LLM output along with ways to speed up inference performance such as choosing the right model, parallelization, and provisioned throughput units (PTUs). In addition to that, Jay also shared several intriguing use cases describing how businesses use tools like Azure Machine Learning prompt flow and Azure ML AI Studio to tailor LLMs to their unique needs and processes.
The complete show notes for this episode can be found at twimlai.com/go/657.
Learning Transformer Programs with Dan Friedman - #667
AI Trends 2024: Machine Learning & Deep Learning with Thomas Dietterich - #666
AI Trends 2024: Computer Vision with Naila Murray - #665
Are Vector DBs the Future Data Platform for AI? with Ed Anuff - #664
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
Responsible AI in the Generative Era with Michael Kearns - #662
Edutainment for AI and AWS PartyRock with Mike Miller - #661
Data, Systems and ML for Visual Understanding with Cody Coleman - #660
Patterns and Middleware for LLM Applications with Kyle Roche - #659
AI Access and Inclusivity as a Technical Challenge with Prem Natarajan - #658
Visual Generative AI Ecosystem Challenges with Richard Zhang - #656
Deploying Edge and Embedded AI Systems with Heather Gorr - #655
AI Sentience, Agency and Catastrophic Risk with Yoshua Bengio - #654
Delivering AI Systems in Highly Regulated Environments with Miriam Friedel - #653
Mental Models for Advanced ChatGPT Prompting with Riley Goodside - #652
Multilingual LLMs and the Values Divide in AI with Sara Hooker - #651
Scaling Multi-Modal Generative AI with Luke Zettlemoyer - #650
Pushing Back on AI Hype with Alex Hanna - #649
Personalization for Text-to-Image Generative AI with Nataniel Ruiz - #648
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week