Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs between relying solely on LLMs for retrieval tasks versus combining retrieval in vector databases and LLMs, the advantages and complexities of RAG with vector databases, the key considerations for building and deploying real-world RAG-based applications, and an in-depth look at Pinecone's new serverless offering. Currently in public preview, Pinecone Serverless is a vector database that enables on-demand data loading, flexible scaling, and cost-effective query processing. Ram discusses how the serverless paradigm impacts the vector database’s core architecture, key features, and other considerations. Lastly, Ram shares his perspective on the future of vector databases in helping enterprises deliver RAG systems.
The complete show notes for this episode can be found at twimlai.com/go/669.
AI for Ecology and Ecosystem Preservation with Bryan Carstens - #449
Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448
A Future of Work for the Invisible Workers in A.I. with Saiph Savage - #447
Trends in Graph Machine Learning with Michael Bronstein - #446
Trends in Natural Language Processing with Sameer Singh - #445
Trends in Computer Vision with Pavan Turaga - #444
Trends in Reinforcement Learning with Pablo Samuel Castro - #443
MOReL: Model-Based Offline Reinforcement Learning with Aravind Rajeswaran - #442
Machine Learning as a Software Engineering Enterprise with Charles Isbell - #441
Natural Graph Networks with Taco Cohen - #440
Productionizing Time-Series Workloads at Siemens Energy with Edgar Bahilo Rodriguez - #439
ML Feature Store at Intuit with Srivathsan Canchi - #438
re:Invent Roundup 2020 with Swami Sivasubramanian - #437
Predictive Disease Risk Modeling at 23andMe with Subarna Sinha - #436
Scaling Video AI at RTL with Daan Odijk - #435
Benchmarking ML with MLCommons w/ Peter Mattson - #434
Deep Learning for NLP: From the Trenches with Charlene Chambliss - #433
Feature Stores for Accelerating AI Development - #432
An Exploration of Coded Bias with Shalini Kantayya, Deb Raji and Meredith Broussard - #431
Common Sense as an Algorithmic Framework with Dileep George - #430
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week