Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs between relying solely on LLMs for retrieval tasks versus combining retrieval in vector databases and LLMs, the advantages and complexities of RAG with vector databases, the key considerations for building and deploying real-world RAG-based applications, and an in-depth look at Pinecone's new serverless offering. Currently in public preview, Pinecone Serverless is a vector database that enables on-demand data loading, flexible scaling, and cost-effective query processing. Ram discusses how the serverless paradigm impacts the vector database’s core architecture, key features, and other considerations. Lastly, Ram shares his perspective on the future of vector databases in helping enterprises deliver RAG systems.
The complete show notes for this episode can be found at twimlai.com/go/669.
Optimization, Machine Learning and Intelligent Experimentation with Michael McCourt - #545
Jupyter and the Evolution of ML Tooling with Brian Granger - #544
Creating a Data-Driven Culture at ADP with Jack Berkowitz - #543
re:Invent Roundup 2021 with Bratin Saha - #542
Multi-modal Deep Learning for Complex Document Understanding with Doug Burdick - #541
Predictive Maintenance Using Deep Learning and Reliability Engineering with Shayan Mortazavi - #540
Building a Deep Tech Startup in NLP with Nasrin Mostafazadeh - #539
Models for Human-Robot Collaboration with Julie Shah - #538
Four Key Tools for Robust Enterprise NLP with Yunyao Li - #537
Machine Learning at GSK with Kim Branson - #536
The Benefit of Bottlenecks in Evolving Artificial Intelligence with David Ha - #535
Facebook Abandons Facial Recognition. Should Everyone Else Follow Suit? With Luke Stark - #534
Building Blocks of Machine Learning at LEGO with Francesc Joan Riera - #533
Exploring the FastAI Tooling Ecosystem with Hamel Husain - #532
Multi-task Learning for Melanoma Detection with Julianna Ianni - #531
House Hunters: Machine Learning at Redfin with Akshat Kaul - #530
Attacking Malware with Adversarial Machine Learning, w/ Edward Raff - #529
Learning to Ponder: Memory in Deep Neural Networks with Andrea Banino - #528
Advancing Deep Reinforcement Learning with NetHack, w/ Tim Rocktäschel - #527
Building Technical Communities at Stack Overflow with Prashanth Chandrasekar - #526
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week