Virtually Speaking Podcast

https://www.vspeakingpodcast.com/feed.xml

The Virtually Speaking Podcast is a weekly technical podcast dedicated to discussing VMware topics related to private and hybrid cloud. Each week Pete Flecha and John Nicholson bring in various subject matter experts from VMware, recently acquired by Broadcom, and within the industry to discuss their respective areas of expertise.

Episodes

Nov 25, 2024

Exploring RAG Pipelines with Private AI Foundation and NVIDIA

Nov 25, 2024

19 min

In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!

Comment (0)

No comments yet. Be the first to say something!