Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs between relying solely on LLMs for retrieval tasks versus combining retrieval in vector databases and LLMs, the advantages and complexities of RAG with vector databases, the key considerations for building and deploying real-world RAG-based applications, and an in-depth look at Pinecone's new serverless offering. Currently in public preview, Pinecone Serverless is a vector database that enables on-demand data loading, flexible scaling, and cost-effective query processing. Ram discusses how the serverless paradigm impacts the vector database’s core architecture, key features, and other considerations. Lastly, Ram shares his perspective on the future of vector databases in helping enterprises deliver RAG systems.
The complete show notes for this episode can be found at twimlai.com/go/669.
Scaling Enterprise ML in 2020: Still Hard! with Sushil Thomas - #429
Enabling Clinical Automation: From Research to Deployment with Devin Singh - #428
Pixels to Concepts with Backpropagation w/ Roland Memisevic - #427
Fighting Global Health Disparities with AI w/ Jon Wang - #426
Accessibility and Computer Vision - #425
NLP for Equity Investing with Frank Zhao - #424
The Future of Education and AI with Salman Khan - #423
Why AI Innovation and Social Impact Go Hand in Hand with Milind Tambe - #422
What's Next for Fast.ai? w/ Jeremy Howard - #421
Feature Stores for MLOps with Mike del Balso - #420
Exploring Causality and Community with Suzana Ilić - #419
Decolonizing AI with Shakir Mohamed - #418
Spatial Analysis for Real-Time Video Processing with Adina Trufinescu
How Deep Learning has Revolutionized OCR with Cha Zhang - #416
Machine Learning for Food Delivery at Global Scale - #415
Open Source at Qualcomm AI Research with Jeff Gehlhaar and Zahra Koochak - #414
Visualizing Climate Impact with GANs w/ Sasha Luccioni - #413
ML-Powered Language Learning at Duolingo with Burr Settles - #412
Bridging The Gap Between Machine Learning and the Life Sciences with Artur Yakimovich - #411
Understanding Cultural Style Trends with Computer Vision w/ Kavita Bala - #410
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week