Welcome to R2R



R2R, short for RAG to Riches, provides the fastest and most efficient way to deliver high-quality Retrieval-Augmented Generation (RAG) to end users. The framework is built around customizable pipelines and a feature-rich FastAPI implementation.

Key Features

  • 🔧 Build: Use the framework to build arbitrary asynchronous pipelines.
  • 🚀 Deploy: Instantly launch production-ready asynchronous RAG pipelines with streaming capabilities.
  • 🧩 Customize: Tailor your multimodal pipeline with intuitive configuration files.
  • 🔌 Extend: Enhance your pipeline with custom code integrations.
  • 🤖 OSS: Benefit from a framework developed by the open-source community, designed to simplify RAG deployment.

Why did we build this framework?

R2R was conceived to bridge the gap between local LLM experimentation and scalable production solutions. It is built with observability and customization in mind, ensuring that users can seamlessly transition from development to deployment.


The R2R Demo provides a step by step outline to run the default R2R Retrieval-Augmented Generation (RAG) pipeline. The demo ingests the provided documents and then illustrates search and RAG functionality.

Getting Started

To get started with R2R, we recommend setting up the framework and following an initial example.


Join our Discord server (opens in a new tab) to get support and connect with both the R2R team and other developers in the community. Whether you're encountering issues, looking for advice on best practices, or just want to share your experiences, we're here to help.