Scaling Private RAG with Open-Source and Custom Models πŸš€

Scaling Private RAG with Open-Source and Custom Models πŸš€

Tags
https://www.linkedin.com/events/scalingprivateragwithopen-sourc7176316704879239169/

Register here:

Date & Time:Β April 4th, 2024 | 8.00 AM PST | 5.00 PM CET

In this session, Chaoyu Yang, Founder and CEO at BentoML, will cover the practical considerations of building private Retrieval-Augmented Generation (RAG) applications, utilizing a mix of open-source and custom models. πŸš€

Learn seamlessly chaining language models with various components, including text/multi-modal embedding, OCR pipelines, semantic chunking, classification models, and reranking models.

Topics that will be covered:

βœ… Discover the benefits of self-hosting open-source LLMs or embedding models for RAG.

βœ… Uncover effective strategies for selecting the ideal model for your specific requirements.

βœ… Gain insights into AI/ML techniques in advanced RAG stack, beyond the obvious LLM or embedding model.

βœ… Common best practices in optimizing inference performance for RAG.