Webinars

Ops4AI: Optimizing RAG Architectures

Retrieval-Augmented Generation (RAG) helps enterprises customize off-the-shelf large language models (LLMs) with their own data. Watch our on-demand demo webinar to learn how our AI data center solution works with VAST Data storage to ensure low latency and optimal performance with this game-changing new inference process.

38:26

Learn how to optimize RAG performance

This session will show you how to deploy simple RAG network architectures and separate storage I/O traffic with user inference traffic on a shared physical network fabric while ensuring low latency and optimal performance.

Event Detail WYWL Icon

Segment traffic with EVPN/VXLAN

Understand how EVPN/VXLAN separates storage I/O from regular inference traffic on the front end fabric, enhancing network performance and reliability.

Event Detail WYWL Icon

Leverage VAST Data storage

Discover how RAG architectures leverage ultra-low latency and high-speed storage with VAST Data to optimize vector database performance.

Event Detail WYWL Icon

Simplify network design

Crafting network designs for diverse use cases can be complex, but Juniper Apstra streamlines the process, enabling effortless deployment of GenAI RAG architectures.