Walkthrough

Setup

The quickstart guide which follows below assumes that you have installed R2R locally via pip install r2r and properly setup your environment, as described here.

If you would prefer to do the heavy lifting with Docker instead then refer to the Docker installation instructions and then proceed to the client-server cookbook.

Hello R2R

R2R supports configurable vector search and RAG right out of the box, as the example below shows:


from r2r import Document, GenerationConfig, R2R

app = R2R() # You may pass a custom configuration to `R2R`

app.ingest_documents(
    [
        Document(
            type="txt",
            data="John is a person that works at Google.",
            metadata={},
        )
    ]
)

rag_results = app.rag(
    "Who is john", GenerationConfig(model="gpt-3.5-turbo", temperature=0.0)
)
print(f"Search Results:\n{rag_results.search_results}")
print(f"Completion:\n{rag_results.completion}")

# RAG Results:
# Search Results:
# AggregateSearchResult(vector_search_results=[VectorSearchResult(id=2d71e689-0a0e-5491-a50b-4ecb9494c832, score=0.6848798582029441, metadata={'text': 'John is a person that works at Google.', 'version': 'v0', 'chunk_order': 0, 'document_id': 'ed76b6ee-dd80-5172-9263-919d493b439a', 'extraction_id': '1ba494d7-cb2f-5f0e-9f64-76c31da11381', 'associatedQuery': 'Who is john'})], kg_search_results=None)
# Completion:
# ChatCompletion(id='chatcmpl-9g0HnjGjyWDLADe7E2EvLWa35cMkB', choices=[Choice(finish_reason='stop', index=0, logprobs=None, message=ChatCompletionMessage(content='John is a person that works at Google [1].', role='assistant', function_call=None, tool_calls=None))], created=1719797903, model='gpt-3.5-turbo-0125', object='chat.completion', service_tier=None, system_fingerprint=None, usage=CompletionUsage(completion_tokens=11, prompt_tokens=145, total_tokens=156))

This guide will demonstrate document ingestion, management, search, and advanced RAG functionalities. For customization, refer to r2r/examples/quickstart.py.

To reconfigure the quickstart, create a local configuration file called my_config.json and override the defaults like shown:

my_config.json

{
    "vector_database": {
        "provider": "pgvector",
        "collection_name": "my_demo_vecs"
    }
}

When running any of the following commands, point to your custom config:

python -m r2r.examples.quickstart ingest_files --config_path=my_config.json --no-media=true

You can read more about configuration here.

Document Ingestion and Management

R2R efficiently handles diverse document types using PostgreSQL with pgvector, combining relational data management with vector search capabilities. This approach enables seamless ingestion, storage, and retrieval of multimodal data, while supporting flexible document management and user permissions. Expand below to dive deeper:

R2R offers a powerful data ingestion process that handles various file types including html, pdf, png, mp3, and txt. The ingestion pipeline parses, chunks, embeds, and stores documents efficiently with a fully asynchronous pipeline. To demonstrate this functionality:

r2r ingest --all-sample-files

This command initiates the ingestion process, producing output similar to:

r2r.base.providers.vector_db_provider - INFO - Initializing VectorDBProvider with config extra_fields={} provider='pgvector' collection_name='demo_vecs'. - 2024-06-19 15:38:13,151
...
{'results': ["File 'aristotle.txt' processed successfully.", ...]}

Key features of the ingestion process:

Unique document_id generation for each file
Metadata association, including user_id for document management
Efficient parsing, chunking, and embedding of diverse file types

R2R allows retrieval of high-level document information stored in a relational table within the PostgreSQL database. To fetch this information:

r2r documents-overview

This command returns document metadata, including:

[
    DocumentInfo(
        document_id=UUID('c9bdbac7-0ea3-5c9e-b590-018bd09b127b'),
        version='v0',
        size_in_bytes=73353,
        metadata={'title': 'aristotle.txt', 'user_id': '063edaf8-3e63-4cb9-a4d6-a855f36376c3'},
        title='aristotle.txt'
    ),
    ...
]

This overview provides quick access to document versions, sizes, and associated metadata, facilitating efficient document management.

R2R enables retrieval of specific document chunks and associated metadata. To fetch chunks for a particular document by id:

r2r document-chunks c9bdbac7-0ea3-5c9e-b590-018bd09b127b

This command returns detailed chunk information:

[
  {
    'text': 'Aristotle[A] (Greek: Ἀριστοτέλης Aristotélēs, pronounced [aristotélɛːs]; 384–322 BC) was an Ancient Greek philosopher and polymath. His writings cover a broad range of subjects spanning the natural sciences, philosophy, linguistics, economics, politics, psychology, and the arts. As the founder of the Peripatetic school of philosophy in the Lyceum in Athens, he began the wider Aristotelian tradition that followed, which set the groundwork for the development of modern science.', 
    'title': 'aristotle.txt',
    'user_id': '063edaf8-3e63-4cb9-a4d6-a855f36376c3', 
    'version': 'v0', 
    'chunk_order': 0, 
    'document_id': 'c9bdbac7-0ea3-5c9e-b590-018bd09b127b', 
    'extraction_id': 'aeba6400-1bd0-5ee9-8925-04732d675434',
    'fragment_id': 'f48bcdad-4155-52a4-8c9d-8ba06e996ba3',
  },
  ...
]

These features allow for granular access to document content.

R2R supports flexible document deletion based on various metadata fields. To delete a document by its ID:

r2r delete --keys=document_id --values=4a4fb848-fc03-5487-a7e5-33c9fdfb73cc

This command produces output similar to:

r2r.base.providers.vector_db_provider - INFO - Initializing VectorDBProvider with config extra_fields={} provider='pgvector' collection_name='demo_vecs'. - 2024-06-20 09:55:12,632
r2r.base.providers.embedding_provider - INFO - Initializing EmbeddingProvider with config extra_fields={'text_splitter': {'type': 'recursive_character', 'chunk_size': 512, 'chunk_overlap': 20}} provider='openai' base_model='text-embedding-3-small' base_dimension=512 rerank_model=None rerank_dimension=None rerank_transformer_type=None batch_size=128. - 2024-06-20 09:55:13,309
r2r.base.providers.llm_provider - INFO - Initializing LLM provider with config: extra_fields={} provider='litellm' - 2024-06-20 09:55:13,869
r2r.main.services.management_service - INFO - Deleting entries with metadata: document_id=4a4fb848-fc03-5487-a7e5-33c9fdfb73cc - 2024-06-20 09:55:14,104
{'results': "Documents ['4a4fb848-fc03-5487-a7e5-33c9fdfb73cc'] deleted successfully."}

Key features of the deletion process:

Deletion by document ID, extraction ID, or fragment ID, or other.
Cascading deletion of associated chunks and metadata
Confirmation of successful deletion

This flexible deletion mechanism ensures precise control over document management within the R2R system.

R2R provides robust document update capabilities through two main endpoints: update_documents and update_files. These endpoints allow for seamless updating of existing documents while maintaining version control.

Key features of the update process:

Automatic versioning: When updating a document, R2R automatically increments the version (e.g., from “v0” to “v1”).
Metadata preservation: The update process maintains existing metadata while allowing for updates.
Content replacement: The new document content completely replaces the old content in the order shown below
- Ingest the new version of the document
- Delete the old version

Executing the command below will update one of the sample documents ingested earlier.

python -m r2r.examples.quickstart update_files

Expected Output:

r2r.base.providers.vector_db_provider - INFO - Initializing VectorDBProvider with config extra_fields={} provider='pgvector' collection_name='demo_vecs_xyz_123_abc_456_789_111'. - 2024-06-20 11:10:07,774
r2r.base.providers.embedding_provider - INFO - Initializing EmbeddingProvider with config extra_fields={'text_splitter': {'type': 'recursive_character', 'chunk_size': 512, 'chunk_overlap': 20}} provider='openai' base_model='text-embedding-3-small' base_dimension=512 rerank_model=None rerank_dimension=None rerank_transformer_type=None batch_size=128. - 2024-06-20 11:10:08,668
r2r.base.providers.llm_provider - INFO - Initializing LLM provider with config: extra_fields={} provider='litellm' - 2024-06-20 11:10:09,241
r2r.main.services.ingestion_service - INFO - Processing file: /..../SciPhi/demo/R2R-kg/r2r/examples/data/aristotle_v2.txt - 2024-06-20 11:10:09,539
r2r.main.services.ingestion_service - INFO - File read successfully: /.../SciPhi/demo/R2R-kg/r2r/examples/data/aristotle_v2.txt - 2024-06-20 11:10:09,543
r2r.main.services.ingestion_service - INFO - Deleting documents which match on these keys and values: (['document_id', 'version'], ['c9bdbac7-0ea3-5c9e-b590-018bd09b127b', 'v0']) - 2024-06-20 11:10:10,600
Time taken to update files: 1.57 seconds
{'results': 'Files updated successfully.'}

Behind the scenes, this command utilizes the update_files endpoint. The process involves:

Reading the new file content
Incrementing the document version
Ingesting the new version with updated metadata
Deleting the old version of the document

For programmatic updates, you can use the RESTful API endpoint /update_files. This endpoint accepts a R2RUpdateFilesRequest, which includes:

files: List of UploadFile objects containing the new document content
document_ids: UUIDs of the documents to update
metadatas: Optional updated metadata for each document

The update process ensures data integrity and maintains a clear history of document changes through versioning.

R2R’s document ingestion and management system efficiently handles diverse file types, offering customizable parsing, chunking, and embedding processes. The flexible architecture allows for easy integration with existing workflows and supports custom extensions to meet specific project requirements. Moreover, the R2R system provides comprehensive document management, which you can read more about in the cookbook here.

AI Powered Search

R2R offers powerful search capabilities, including vector search, hybrid search, and knowledge graph-enhanced search. These features allow for more accurate and contextually relevant information retrieval.

To perform a basic vector search using RAG, execute the following command:

r2r search --query="What was Uber's profit in 2020?"

Example Output:

{'results': [
    {
        'id': UUID('37993d2c-b61a-58b4-9a89-f167d59b8633'),
        'score': 0.7662125334175588,
        'metadata': {
            'text': "Uber's profit in 2020 was a net loss of $6,768 million.",
            'title': 'uber_2021.pdf',
            'user_id': '063edaf8-3e63-4cb9-a4d6-a855f36376c3',
            'version': 'v0',
            'chunk_order': 15,
            'document_id': 'c996e617-88a4-5c65-ab1e-948344b18d27',
            'extraction_id': 'aeba6400-1bd0-5ee9-8925-04732d675434',
            'associatedQuery': "What was Uber's profit in 2020?"
        }
    },
    // ... more results
]}

This search uses vector embeddings to find the most relevant chunks of text from the ingested documents.

R2R supports hybrid search, which combines traditional keyword-based search with vector search for improved results. To perform a hybrid search:

r2r search --query="What was Uber's profit in 2020?" --do-hybrid-search

Example Output:

{'results': [
    {
        'id': UUID('316a6231-0b7a-50e6-9efb-dfd5e5e28188'),
        'score': 0.933503176778746,
        'metadata': {
            'text': 'Fierce nerds are highly competitive individuals who are socially awkward but extremely intelligent and confident in their abilities...',
            'title': 'pg_essay_2.html',
            'user_id': '063edaf8-3e63-4cb9-a4d6-a855f36376c3',
            'version': 'v0',
            'document_id': 'fab36a79-1d7a-5377-ba40-0c8d3e0a90d2',
            'extraction_id': '2c06a97b-91a9-53c0-bb3d-9831325fcffb',
            'associatedQuery': 'What is a fierce nerd?'
        }
    },
    // ... more results
]}

Hybrid search combines the strengths of keyword matching and semantic understanding, often resulting in more comprehensive and relevant results.

R2R integrates knowledge graph capabilities to enhance search results with structured relationships. To utilize knowledge graph search:

Knowledge Graphs are not constructed by default, refer to the cookbook here before attempting to run the command below!

r2r search --query="What was Uber's profit in 2020?" --use-kg-search

Example Output:

{'results': [
    {
        'id': 'Joe Gebbia',
        'relation': 'FOUNDED',
        'target': 'Airbnb'
    },
    {
        'id': 'Brian Chesky',
        'relation': 'FOUNDED',
        'target': 'Airbnb'
    },
    {
        'id': 'Nathan Blecharczyk',
        'relation': 'FOUNDED',
        'target': 'Airbnb'
    }
]}

Knowledge graph search provides structured information about entities and their relationships, complementing the text-based search results.

Behind the scenes, R2R’s RetrievalService handles these search requests. The search method accepts VectorSearchSettings and KGSearchSettings to customize the search behavior:

async def search(
    self,
    query: str,
    vector_search_settings: VectorSearchSettings = VectorSearchSettings(),
    kg_search_settings: KGSearchSettings = KGSearchSettings(),
):
    # ... implementation details ...

This flexible architecture allows for combining different search strategies to provide the most relevant and comprehensive results for your queries.

Retrieval-Augmented Generation (RAG)

R2R is built around a comprehensive Retrieval-Augmented Generation (RAG) engine, allowing you to generate contextually relevant responses based on your ingested documents. The RAG process combines the search functionality with language model generation to produce more accurate and informative answers.

To generate a response using RAG, use the following command:

r2r rag --query="What was Uber's profit in 2020?"

Example Output:

{'results': [
    ChatCompletion(
        id='chatcmpl-9RCB5xUbDuI1f0vPw3RUO7BWQImBN',
        choices=[
            Choice(
                finish_reason='stop',
                index=0,
                logprobs=None,
                message=ChatCompletionMessage(
                    content="Uber's profit in 2020 was a net loss of $6,768 million [10].",
                    role='assistant',
                    function_call=None,
                    tool_calls=None)
                )
            ],
        created=1716268695,
        model='gpt-3.5-turbo-0125',
        object='chat.completion',
        system_fingerprint=None,
        usage=CompletionUsage(completion_tokens=20, prompt_tokens=1470, total_tokens=1490)
    )
]}

This command performs a search on the ingested documents and uses the retrieved information to generate a response.

R2R also supports hybrid search in RAG, combining the power of vector search and keyword-based search. To use hybrid search in RAG, simply add the --do_hybrid_search flag:

r2r rag --query="Who is John Snow?" --do-hybrid-search

Example Output:

{'results': [
    ChatCompletion(
        id='chatcmpl-9cbRra4MNQGEQb3BDiFujvDXIehud',
        choices=[
            Choice(
                finish_reason='stop',
                index=0,
                logprobs=None,
                message=ChatCompletionMessage(
                    content="John Snow is mentioned in the context as one of Samwell (Sam) Tarly's closest companions at the Wall [5], [6].",
                    role='assistant',
                    function_call=None,
                    tool_calls=None)
                )
            ],
        created=1718987443,
        model='gpt-4o-2024-05-13',
        object='chat.completion',
        system_fingerprint=None,
        usage=CompletionUsage(completion_tokens=20, prompt_tokens=1192, total_tokens=1221)
    )
]}

This example demonstrates how hybrid search can enhance the RAG process by combining semantic understanding with keyword matching, potentially providing more accurate and comprehensive results.

R2R also supports streaming RAG responses, which can be useful for real-time applications. To use streaming RAG:

r2r rag --query="What was Lyft's profit in 2020?" --stream

Example Output:

r2r.main.r2r_config - INFO - Loading configuration from <YOUR_WORKDIR>/config.json - 2024-05-20 22:27:31,890
...
<search>["{\"id\":\"808c47c5-ebef-504a-a230-aa9ddcfbd87 .... </search>
<completion>Lyft reported a net loss of $1,752,857,000 in 2020 according to [2]. Therefore, Lyft did not make a profit in 2020.</completion>

Streaming allows the response to be generated and sent in real-time, chunk by chunk.

R2R offers extensive customization options for its Retrieval-Augmented Generation (RAG) functionality:

Search Settings: Customize vector and knowledge graph search parameters using VectorSearchSettings and KGSearchSettings.
Generation Config: Fine-tune the language model’s behavior with GenerationConfig, including:
- Temperature, top_p, top_k for controlling randomness
- Max tokens, model selection, and streaming options
- Advanced settings like beam search and sampling strategies
Multiple LLM Support: Easily switch between different language models and providers:
- OpenAI models (default)
- Anthropic’s Claude models
- Local models via Ollama
- Any provider supported by LiteLLM

Example of customizing the model:

r2r rag --query="who was aristotle?" --rag-model="claude-3-haiku-20240307" --stream --do-hybrid-search

This flexibility allows you to optimize RAG performance for your specific use case and leverage the strengths of various LLM providers.

Behind the scenes, R2R’s RetrievalService handles RAG requests, combining the power of vector search, optional knowledge graph integration, and language model generation. The flexible architecture allows for easy customization and extension of the RAG pipeline to meet diverse requirements.

User Management

R2R provides powerful user management capabilities, allowing you to track and manage documents on a per-user basis. This section covers key user management features.

To retrieve an overview for your users, run the following command:

streaming RAG responses, which can be useful for real-time applications. To use streaming RAG:

r2r users-overview

This command returns user-specific information, including the number of files, total size of documents, and associated document IDs:

[
    UserStats(
        user_id=UUID('063edaf8-3e63-4cb9-a4d6-a855f36376c3'),
        num_files=3,
        total_size_in_bytes=313137,
        document_ids=[UUID('c9bdbac7-0ea3-5c9e-b590-018bd09b127b'), ...]
    ),
    UserStats(
        user_id=UUID('45c3f5a8-bcbe-43b1-9b20-51c07fd79f14'),
        num_files=3,
        total_size_in_bytes=327742,
        document_ids=[UUID('4a4fb848-fc03-5487-a7e5-33c9fdfb73cc'), ...]
    ),
    ...
]

Note that the quickstart has ingested documents as different users in order to simulate a more rich environment.

To retrieve an overview for your users, run the following command:

r2r documents-overview --user-ids=063edaf8-3e63-4cb9-a4d6-a855f36376c3

This command returns user-specific information, including the number of files, total size of documents, and associated document IDs:

[
    UserStats(
        user_id=UUID('063edaf8-3e63-4cb9-a4d6-a855f36376c3'),
        num_files=3,
        total_size_in_bytes=313137,
        document_ids=[UUID('c9bdbac7-0ea3-5c9e-b590-018bd09b127b'), ...]
    ),
    UserStats(
        user_id=UUID('45c3f5a8-bcbe-43b1-9b20-51c07fd79f14'),
        num_files=3,
        total_size_in_bytes=327742,
        document_ids=[UUID('4a4fb848-fc03-5487-a7e5-33c9fdfb73cc'), ...]
    ),
    ...
]

Note that the quickstart has ingested documents as different users in order to simulate a more rich environment.

To search over documents associated with a specific user, you can use the search command with a user ID filter:

r2r search --query="Who was Aristotle?" --search-filters='{"user_id":"063edaf8-3e63-4cb9-a4d6-a855f36376c3"}'

This command will return search results only for the specified user ID. Here’s an example of the output:

Vector search results:
{'id': UUID('7ed3a01c-88dc-5a58-a68b-6e5d9f292df2'), 'score': 0.774126139387358, 'metadata': {'text': 'Aristotle[A] (Greek: Ἀριστοτέλης Aristotélēs, pronounced [aristotélɛːs]; 384–322 BC) was an Ancient Greek philosopher and polymath. His writings cover a broad range of subjects spanning the natural sciences, philosophy, linguistics, economics, politics, psychology, and the arts. As the founder of the Peripatetic school of philosophy in the Lyceum in Athens, he began the wider Aristotelian tradition that followed, which set the groundwork for the development of modern science.', 'title': 'aristotle.txt', 'user_id': '063edaf8-3e63-4cb9-a4d6-a855f36376c3', 'version': 'v0', 'chunk_order': 0, 'document_id': 'c9bdbac7-0ea3-5c9e-b590-018bd09b127b', 'fragment_id': '7ed3a01c-88dc-5a58-a68b-6e5d9f292df2', 'extraction_id': '472d6921-b4cd-5514-bf62-90b05c9102cb', 'associatedQuery': 'Who was Aristotle?'}}
...

You can use the same search_filters parameter with the rag command to filter RAG results by user ID:

python -m r2r.examples.quickstart rag --query="Who was Aristotle?" --search_filters="{'user_id':'063edaf8-3e63-4cb9-a4d6-a855f36376c3'}"

This allows for user-specific searches and RAG operations, ensuring that results are limited to the specified user’s documents.

To delete all documents associated with a specific user, use the delete command with the user_id as the key:

r2r delete --keys=user_id --values=063edaf8-3e63-4cb9-a4d6-a855f36376c3

This command will erase all ingested data for the specified user. Use with caution!

After deletion, you can confirm that all user documents have been removed by checking the documents_overview:

r2r documents-overview --user-ids=063edaf8-3e63-4cb9-a4d6-a855f36376c3

If the deletion was successful, this should return an empty result:

{'results': []}

This process allows for complete management of user data within the R2R system, including the ability to remove all traces of a user’s documents if necessary.

These user management features provide granular control over user data, allowing for efficient organization and management of documents on a per-user basis within the R2R system. Read more about these features in the document management cookbook here.

Observability and Analytics

R2R provides robust observability and analytics features, allowing you to monitor system performance, track usage patterns, and gain insights into your RAG application’s behavior.

R2R automatically logs various events and metrics during its operation. You can access these logs using the logs command:

r2r logs

This command returns detailed log entries for various operations, including search and RAG requests. Here’s an example of a log entry:

{
    'run_id': UUID('27f124ad-6f70-4641-89ab-f346dc9d1c2f'),
    'run_type': 'rag',
    'entries': [
        {'key': 'search_results', 'value': '["{\\"id\\":\\"7ed3a01c-88dc-5a58-a68b-6e5d9f292df2\\",...}"]'},
        {'key': 'search_query', 'value': 'Who is aristotle?'},
        {'key': 'rag_generation_latency', 'value': '3.79'},
        {'key': 'llm_response', 'value': 'Aristotle (Greek: Ἀριστοτέλης Aristotélēs; 384–322 BC) was...'}
    ]
}

These logs provide detailed information about each operation, including search results, queries, latencies, and LLM responses.

R2R offers an analytics feature that allows you to aggregate and analyze log data. You can use the analytics command to retrieve various statistics:

r2r analytics --filters '{"search_latencies": "search_latency"}' --analysis-types '{"search_latencies": ["basic_statistics", "search_latency"]}'

This command returns aggregated statistics based on the specified filters and analysis types. Here’s an example output:

{
    'results': {
        'filtered_logs': {
            'search_latencies': [
                {
                    'timestamp': '2024-06-20 21:29:06',
                    'log_id': UUID('0f28063c-8b87-4934-90dc-4cd84dda5f5c'),
                    'key': 'search_latency',
                    'value': '0.66',
                    'rn': 3
                }
            ]
        },
        'search_latencies': {
          'Mean': 0.66,
            'Median': 0.66,
            'Mode': 0.66,
            'Standard Deviation': 0,
            'Variance': 0
        }
    }
}

This analytics feature allows you to:

Filter logs based on specific criteria
Perform statistical analysis on various metrics (e.g., search latencies)
Track performance trends over time
Identify potential bottlenecks or areas for optimization

These observability and analytics features provide valuable insights into your R2R application’s performance and usage, enabling data-driven optimization and decision-making.

Command Execution and Output

Here are some example commands along with their outputs:

Quickstart GIF

Get Started

RAG Cookbooks

App Features

Deep Dive

Setup

Hello R2R

Document Ingestion and Management

AI Powered Search

Retrieval-Augmented Generation (RAG)

User Management

Observability and Analytics

Command Execution and Output

Get Started

RAG Cookbooks

App Features

Deep Dive

​Setup

​Hello R2R

​Document Ingestion and Management

​AI Powered Search

​Retrieval-Augmented Generation (RAG)

​User Management

​Observability and Analytics

​Command Execution and Output

Setup

Hello R2R

Document Ingestion and Management

AI Powered Search

Retrieval-Augmented Generation (RAG)

User Management

Observability and Analytics

Command Execution and Output