Walkthrough | The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

This guide shows how to use R2R to:

Ingest files into R2R
Search over ingested files
Use your data as input to RAG (Retrieval-Augmented Generation) / Deep Research
Extract entities and relationships from your data to create a graph.
Perform basic user auth
Observe and analyze an R2R deployment

Introduction

R2R is an engine for building user-facing Retrieval-Augmented Generation (RAG) applications. At its core, R2R provides this service through an architecture of providers, services, and a centralized RESTful API. This cookbook provides a detailed walkthrough of how to interact with R2R.

Refer here for a deeper dive on the R2R system architecture.

Hello R2R

R2R gives developers configurable vector search and RAG right out of the box, as well as direct method calls instead of the client-server architecture seen throughout the docs:

core/examples/hello_r2r.py

1 from r2r import R2RClient
2 
3 client = R2RClient() # optional, pass in "http://localhost:7272" or "https://api.sciphi.ai"
4 
5 with open("test.txt", "w") as file:
6     file.write("John is a person that works at Google.")
7 
8 client.documents.create(file_path="test.txt")
9 
10 # Call RAG directly
11 rag_response = client.retrieval.rag(
12     query="Who is john",
13     rag_generation_config={"model": "openai/gpt-4o-mini", "temperature": 0.0},
14 )
15 
16 print(f"Search Results:\n{rag_response.results.search_results}")
17 # AggregateSearchResult(chunk_search_results=[ChunkSearchResult(score=0.685, text=John is a person that works at Google.)], graph_search_results=[], web_search_results=[], context_document_results=[])
18 
19 print(f"Completion:\n{rag_response.results.generated_answer}")
20 # John is a person that works at Google [e123456].
21 
22 print(f"Citations:\n{rag_response.results.citations}")
23 # [Citation(id='e123456', object='citation', payload=ChunkSearchResult(...))]

Document Ingestion and Management

R2R efficiently handles diverse document types using Postgres with pgvector, combining relational data management with vector search capabilities. This approach enables seamless ingestion, storage, and retrieval of multimodal data, while supporting flexible document management and user permissions.

Key features include:

Unique Document, with corresponding id, created for each ingested file or context, which contains the downstream Chunks and Entities & Relationships.
User and Collection objects for comprehensive document permissions.
Graph, construction and maintenance.
Flexible document deletion and update mechanisms at global document and chunk levels.

Note, all document related commands are gated to documents the user has uploaded or has access to through shared collections, with the exception of superusers.

Create Documents

R2R offers a powerful data ingestion process that handles various file types including html, pdf, png, mp3, and txt.

The ingestion process parses, chunks, embeds, and stores documents efficiently. A durable orchestration workflow coordinates the entire process.

Python

JavaScript

1 # export R2R_API_KEY=...
2 from r2r import R2RClient
3 
4 client = R2RClient() # or set base_url=...
5 # when using auth, do client.users.login(...)
6 
7 client.documents.create_sample(hi_res=True)
8 # to ingest your own document, client.documents.create(file_path="/path/to/file")

This command initiates the ingestion process, producing output similar to:

IngestionResponse(message='Document created and ingested successfully.', task_id=None, document_id=UUID('e43864f5-a36f-548e-aacd-6f8d48b30c7f'))

Key features of the ingestion process:

Unique document_id generation for each file
Metadata association, including user_id and collection_ids for document management
Efficient parsing, chunking, and embedding of diverse file types

Retrieving Documents

R2R allows retrieval of high-level document information stored in a relational table within the Postgres database. To fetch this information:

Python

JavaScript

Curl

1 result = client.documents.list(
2     limit=10,
3     offset=0
4 )

This command returns document metadata, including:

[
  DocumentResponse(
    id=UUID('e43864f5-a36f-548e-aacd-6f8d48b30c7f'), 
    collection_ids=[UUID('122fdf6a-e116-546b-a8f6-e4cb2e2c0a09')], 
    owner_id=UUID('2acb499e-8428-543b-bd85-0d9098718220'), 
    document_type=<DocumentType.PDF: 'pdf'>, 
    metadata={'title': 'DeepSeek_R1.pdf', 'version': 'v0'}, 
    version='v0', 
    size_in_bytes=1768572, 
    ingestion_status=<IngestionStatus.SUCCESS: 'success'>, 
    extraction_status=<GraphExtractionStatus.PENDING: 'pending'>, 
    created_at=datetime.datetime(2025, 2, 8, 3, 31, 39, 126759, tzinfo=TzInfo(UTC)), 
    updated_at=datetime.datetime(2025, 2, 8, 3, 31, 39, 160114, tzinfo=TzInfo(UTC)), 
    ingestion_attempt_number=None, 
    summary="The document contains a comprehensive overview of DeepSeek-R1, a series of reasoning models developed by DeepSeek-AI, which includes DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero utilizes large-scale reinforcement learning (RL) without supervised fine-tuning, showcasing impressive reasoning capabilities but facing challenges like readability and language mixing. To enhance performance, DeepSeek-R1 incorporates multi-stage training and cold-start data, achieving results comparable to OpenAI's models on various reasoning tasks. The document details the models' training processes, evaluation results across multiple benchmarks, and the introduction of distilled models that maintain reasoning capabilities while being smaller and more efficient. It also discusses the limitations of current models, such as language mixing and sensitivity to prompts, and outlines future research directions to improve general capabilities and efficiency in software engineering tasks. The findings emphasize the potential of RL in developing reasoning abilities in large language models and the effectiveness of distillation techniques for smaller models.", summary_embedding=None, total_tokens=29673)] total_entries=1
  ), ...
]

This overview provides quick access to document versions, sizes, and associated metadata, facilitating efficient document management.

Retrieving Document Chunks

R2R enables retrieval of specific document chunks and associated metadata. To fetch chunks for a particular document by id:

Python

JavaScript

Curl

1 client.documents.list_chunks(id="9fbe403b-c11c-5aae-8ade-ef22980c3ad1")

This command returns detailed chunk information:

results=[ChunkResponse(id=UUID('27a2e605-2916-59fe-a4da-b19853713298'), document_id=UUID('30f950f0-c692-57c5-b6ec-ff78ccf5ccdc'), owner_id=UUID('2acb499e-8428-543b-bd85-0d9098718220'), collection_ids=[UUID('122fdf6a-e116-546b-a8f6-e4cb2e2c0a09')], text='John is a person that works at Google.', metadata={'version': 'v0', 'chunk_order': 0, 'document_type': 'txt'}, vector=None)] total_entries=1

These features allow for granular access to document content.

Deleting Documents

R2R supports flexible document deletion through a method that can run arbitrary deletion filters. To delete a document by its ID:

Python

JavaScript

Curl

1 client.documents.delete(id="9fbe403b-c11c-5aae-8ade-ef22980c3ad1")

This command produces output similar to:

GenericBooleanResponse(success=True)

Key features of the deletion process:

Deletion by document ID,
Cascading deletion of associated chunks and metadata
Deletion by filter, e.g. by text match, user id match, or other with documents/by-filter.

This flexible deletion mechanism ensures precise control over document management within the R2R system.

For more advanced document management techniques and user authentication details, refer to the user documentation.

AI Powered Search

R2R offers powerful and highly configurable search capabilities, including vector search, hybrid search, and knowledge graph-enhanced search. These features allow for more accurate and contextually relevant information retrieval.

Vector Search

Vector search parameters inside of R2R can be fine-tuned at runtime for optimal results. Here’s how to perform a basic vector search:

Python

JavaScript

Curl

1 client.retrieval.search(
2   query="What is DeepSeek R1?",
3 )

Expected Output

AggregateSearchResult(
  chunk_search_results=[
    ChunkSearchResult(
      score=0.643, 
      text="Document Title: DeepSeek_R1.pdf
      Text: could achieve an accuracy of over 70%.
      DeepSeek-R1 also delivers impressive results on IF-Eval, a benchmark designed to assess a
      models ability to follow format instructions. These improvements can be linked to the inclusion
      of instruction-following data during the final stages of supervised fine-tuning (SFT) and RL
      training. Furthermore, remarkable performance is observed on AlpacaEval2.0 and ArenaHard,
      indicating DeepSeek-R1s strengths in writing tasks and open-domain question answering. Its
      significant outperformance of DeepSeek-V3 underscores the generalization benefits of large-scale
      RL, which not only boosts reasoning capabilities but also improves performance across diverse
      domains. Moreover, the summary lengths generated by DeepSeek-R1 are concise, with an
      average of 689 tokens on ArenaHard and 2,218 characters on AlpacaEval 2.0. This indicates that
      DeepSeek-R1 avoids introducing length bias during GPT-based evaluations, further solidifying
      its robustness across multiple tasks."
    ), ...
  ],
  graph_search_results=[],
  web_search_results=[],
  context_document_results=[]
)

Key configurable parameters for vector search can be inferred from the retrieval API reference.

Hybrid Search

R2R supports hybrid search, which combines traditional keyword-based search with vector search for improved results. Here’s how to perform a hybrid search:

Python

JavaScript

Curl

1 client.retrieval.search(
2     "What was Uber's profit in 2020?",
3     search_settings={
4         "index_measure": "l2_distance",
5         "use_hybrid_search": True,
6         "hybrid_settings": {
7             "full_text_weight": 1.0,
8             "semantic_weight": 5.0,
9             "full_text_limit": 200,
10             "rrf_k": 50,
11         }
12     },
13 )

AI Retrieval (RAG)

R2R is built around a comprehensive Retrieval-Augmented Generation (RAG) engine, allowing you to generate contextually relevant responses based on your ingested documents. The RAG process combines all the search functionality shown above with Large Language Models to produce more accurate and informative answers.

Basic RAG

To generate a response using RAG, use the following command:

Python

JavaScript

Curl

1 client.retrieval.rag(query="What is DeepSeek R1?")

Example Output:

1 RAGResponse(
2     generated_answer='DeepSeek-R1 is a model that demonstrates impressive performance across various tasks, leveraging reinforcement learning (RL) and supervised fine-tuning (SFT) to enhance its capabilities. It excels in writing tasks, open-domain question answering, and benchmarks like IF-Eval, AlpacaEval2.0, and ArenaHard [1], [2]. DeepSeek-R1 outperforms its predecessor, DeepSeek-V3, in several areas, showcasing its strengths in reasoning and generalization across diverse domains [1]. It also achieves competitive results on factual benchmarks like SimpleQA, although it performs worse on the Chinese SimpleQA benchmark due to safety RL constraints [2]. Additionally, DeepSeek-R1 is involved in distillation processes to transfer its reasoning capabilities to smaller models, which perform exceptionally well on benchmarks [4], [6]. The model is optimized for English and Chinese, with plans to address language mixing issues in future updates [8].', 
3     search_results=AggregateSearchResult(
4       chunk_search_results=[ChunkSearchResult(score=0.643, text='Document Title: DeepSeek_R1.pdf...')]
5     ),
6     citations=[
7       Citation(
8         id='123456', 
9         object='citation', 
10         payload=ChunkSearchResult(score=0.643, text='Document Title: DeepSeek_R1.pdf...', id='e760bb76-1c6e-52eb-910d-0ce5b567011b', document_id='e43864f5-a36f-548e-aacd-6f8d48b30c7f', owner_id='2acb499e-8428-543b-bd85-0d9098718220', collection_ids=['122fdf6a-e116-546b-a8f6-e4cb2e2c0a09'])
11       )
12     ],
13     metadata={'id': 'chatcmpl-B0BaZ0vwIa58deI0k8NIuH6pBhngw', 'choices': [...], 'created': 1739384247, 'model': 'gpt-4o-2024-08-06', ...}
14 )

This command performs a search on the ingested documents and uses the retrieved information to generate a response.

RAG w/ Hybrid Search

R2R also supports hybrid search in RAG, combining the power of vector search and keyword-based search. To use hybrid search in RAG, simply add the use_hybrid_search flag to your search settings input:

Python

JavaScript

Curl

1 results = client.retrieval.rag("Who is Jon Snow?", {"use_hybrid_search": True})

This example demonstrates how hybrid search can enhance the RAG process by combining semantic understanding with keyword matching, potentially providing more accurate and comprehensive results.

Streaming RAG

R2R also supports streaming RAG responses, which can be useful for real-time applications.

When using streaming RAG, you’ll receive different types of events:

SearchResultsEvent - Contains the initial search results from your documents
MessageEvent - Streams partial tokens of the response as they are generated
CitationEvent - Indicates when a citation is added to the response, with relevant metadata including:
- id - Unique identifier for the citation
- object - Always “citation”
- source_type - The type of source (chunk, graph, web, etc.)
- source_title - Title of the source document when available
FinalAnswerEvent - Contains the complete generated answer and structured citations
ThinkingEvent - For reasoning agents, contains the model’s step-by-step reasoning process

The citations in the final response are structured objects that link specific passages in the response to their source documents, enabling proper attribution and verification. To use streaming RAG:

Generate a streaming RAG response:

Python

JavaScript

Curl

1 from r2r import (
2     CitationEvent,
3     FinalAnswerEvent,
4     MessageEvent,
5     SearchResultsEvent,
6     R2RClient,
7 )
8 
9 
10 result_stream = client.retrieval.rag(
11     query="What is DeepSeek R1?",
12     search_settings={"limit": 25},
13     rag_generation_config={"stream": True},
14 )
15 
16 # can also do a switch on `type` field
17 for event in result_stream:
18     if isinstance(event, SearchResultsEvent):
19         print("Search results:", event.data)
20     elif isinstance(event, MessageEvent):
21         print("Partial message:", event.data.delta)
22     elif isinstance(event, CitationEvent):
23         print("New citation detected:", event.data)
24     elif isinstance(event, FinalAnswerEvent):
25         print("Final answer:", event.data.generated_answer)

Example output:

Search results: id='run_1' object='rag.search_results' data={'chunk_search_results': [{'id': '1e40ee7e-2eef-524f-b5c6-1a1910e73ccc', 'document_id': '652075c0-3a43-519f-9625-f581e7605bc5', 'owner_id': '2acb499e-8428-543b-bd85-0d9098718220', 'collection_ids': ['122fdf6a-e116-546b-a8f6-e4cb2e2c0a09'], 'score': 0.7945216641038179, 'text': 'data, achieving strong performance across various tasks. DeepSeek-R1 is more powerful,\nleveraging cold-start data alongside iterative RL fine-tuning. Ultimately ... 
...
Partial message: {'content': [MessageDelta(type='text', text={'value': 'Deep', 'annotations': []})]}
Partial message: {'content': [MessageDelta(type='text', text={'value': 'Seek', 'annotations': []})]}
Partial message: {'content': [MessageDelta(type='text', text={'value': '-R', 'annotations': []})]}
...
New Citation Detected: 'cit_3a35e39'
...
Final answer: DeepSeek-R1 is a large language model developed by the DeepSeek-AI research team. It is a reasoning model that has been trained using multi-stage training and cold-start data before reinforcement learning (RL). The model demonstrates superior performance on various benchmarks, including MMLU, MMLU-Pro, GPQA Diamond, and FRAMES, particularly in STEM-related questions. ...

Streaming allows the response to be generated and sent in real-time, chunk by chunk.

Customizing RAG

R2R offers extensive customization options for its Retrieval-Augmented Generation (RAG) functionality:

Search Settings: Customize vector and knowledge graph search parameters using VectorSearchSettings and KGSearchSettings.
Generation Config: Fine-tune the language model’s behavior with GenerationConfig, including:
- Temperature, top_p, top_k for controlling randomness
- Max tokens, model selection, and streaming options
- Advanced settings like beam search and sampling strategies
Multiple LLM Support: Easily switch between different language models and providers:
- OpenAI models (default)
- Anthropic’s Claude models
- Local models via Ollama
- Any provider supported by LiteLLM

Example of customizing the model:

Python

JavaScript

Curl

1 # requires ANTHROPIC_API_KEY is set
2 response = client.retrieval.rag(
3   "Who was Aristotle?",
4   rag_generation_config={"model":"anthropic/claude-3-haiku-20240307", "stream": True}
5 )
6 for chunk in response:
7     print(chunk, flush=False)

This flexibility allows you to optimize RAG performance for your specific use case and leverage the strengths of various LLM providers.

Streaming Agent (Deep Research Mode)

R2R offers a powerful agentic retrieval mode that performs in-depth analysis of documents through iterative research and reasoning. This mode can replicate Deep Research-like results by leveraging a variety of tools to thoroughly investigate your data and the web:

Python

JavaScript

1 from r2r import (
2     ThinkingEvent,
3     ToolCallEvent,
4     ToolResultEvent,
5     CitationEvent,
6     FinalAnswerEvent,
7     MessageEvent,
8     R2RClient,
9 )
10 client = R2RClient("http://localhost:7272")
11 
12 results = client.retrieval.agent(
13     message={"role": "user", "content": "What does deepseek r1 imply for the future of AI?"},
14     rag_generation_config={
15         "model": "anthropic/claude-3-7-sonnet-20250219",
16         "extended_thinking": True,
17         "thinking_budget": 4096,
18         "temperature": 1,
19         "top_p": None,
20         "max_tokens_to_sample": 16000,
21         "stream": True
22     },
23     mode="research" # for `deep research`, otherwise omit
24 )
25 
26 # Process the streaming events
27 for event in results:
28     if isinstance(event, ThinkingEvent):
29         print(f"🧠 Thinking: {event.data.delta.content[0].payload.value}")
30     elif isinstance(event, ToolCallEvent):
31         print(f"🔧 Tool call: {event.data.name}({event.data.arguments})")
32     elif isinstance(event, ToolResultEvent):
33         print(f"📊 Tool result: {event.data.content[:60]}...")
34     elif isinstance(event, CitationEvent):
35         print(f"📑 Citation: {event.data}")
36     elif isinstance(event, MessageEvent):
37         print(f"💬 Message: {event.data.delta.content[0].payload.value}")
38     elif isinstance(event, FinalAnswerEvent):
39         print(f"✅ Final answer: {event.data.generated_answer[:100]}...")
40         print(f"   Citations: {len(event.data.citations)} sources referenced")

Example of streaming output:

🧠 Thinking: Analyzing the query about DeepSeek R1 implications...
🔧 Tool call: search_file_knowledge({"query":"DeepSeek R1 capabilities advancements"})
📊 Tool result: DeepSeek-R1 is a reasoning-focused LLM that uses reinforcement learning...
🧠 Thinking: The search provides valuable information about DeepSeek R1's capabilities
🧠 Thinking: Need more specific information about its performance in reasoning tasks
🔧 Tool call: search_file_knowledge({"query":"DeepSeek R1 reasoning benchmarks performance"})
📊 Tool result: DeepSeek-R1 achieves strong results on reasoning benchmarks including MMLU...
📑 Citation: cit_54c45c8
🧠 Thinking: Now I need to understand the implications for AI development
🔧 Tool call: web_search({"query":"AI reasoning capabilities future development"})
📊 Tool result: Advanced reasoning capabilities are considered a key milestone toward...
📑 Citation: cit_d1152e7
💬 Message: DeepSeek-R1 has several important implications for the future of AI development:
💬 Message: 1. **Reinforcement Learning as a Key Approach**: DeepSeek-R1's success demonstrates...
📑 Citation: cit_eb5ba04
💬 Message: 2. **Efficiency Through Distillation**: The model shows that reasoning capabilities...
✅ Final answer: DeepSeek-R1 has several important implications for the future of AI development: 1. Reinforcement Learning...
  Citations: 3 sources referenced

Behind the scenes, R2R’s RetrievalService handles RAG requests, combining the power of vector search, optional knowledge graph integration, and language model generation.

Graphs in R2R

R2R implements a Git-like model for knowledge graphs, where each collection has a corresponding graph that can diverge and be independently managed. This approach allows for flexible knowledge management while maintaining data consistency.

Graph-Collection Relationship

Each collection has an associated graph that acts similar to a Git branch
Graphs can diverge from their underlying collections through independent updates
The pull operation syncs the graph with its collection, similar to a Git pull
This model enables experimental graph modifications without affecting the base collection

Knowledge Graph Workflow

Extract Document Knowledge

Extract entities and relationships from the previously ingested document:

Python

JavaScript

Curl

1 client.documents.extract(document_id)

This step processes the document to identify entities and their relationships.

Initialize and Populate Graph

Sync the graph with the collection and view extracted knowledge:

Python

JavaScript

Curl

1 collection_id="122fdf6a-e116-546b-a8f6-e4cb2e2c0a09" # default collection_id for admin
2 
3 # Sync graph with collection
4 pull_response = client.graphs.pull(collection_id)
5 
6 # View extracted knowledge
7 entities = client.graphs.list_entities(collection_id)
8 relationships = client.graphs.list_relationships(collection_id)

Build Graph Communities

Build and list graph communities:

Python

JavaScript

Curl

1 # Build communities
2 build_response = client.graphs.build(collection_id, settings={})
3 
4 # List communities
5 communities = client.graphs.list_communities(collection_id)

[
  Community(
    name='Large Language Models and AGI Community', 
    summary='The Large Language Models and AGI Community focuses on the development and implications of advanced AI technologies, particularly in the pursuit of Artificial General Intelligence.', 
    level=None, 
    findings=['Large Language Models (LLMs) are rapidly evolving towards capabilities akin to Artificial General Intelligence (AGI) [Data: Descriptions (1579a46f-be12-4e60-a96b-e5b5afe026d9)].', 'The primary aim of LLMs is to achieve functionalities that closely resemble AGI [Data: Relationships (22bb116d-ab0b-4390-a68f-6ef1a1c99999)].', 'AGI systems are designed to outperform humans in most economically valuable tasks, indicating their potential impact on various industries [Data: Descriptions (80a34efa-d569-488f-91fd-db08fd93667b)].', 'The development of LLMs is a critical step towards realizing the goals of AGI, highlighting the interconnectedness of these technologies [Data: Relationships (22bb116d-ab0b-4390-a68f-6ef1a1c99999)].', 'Research in LLMs is essential for understanding the ethical implications of AGI deployment in society [Data: Descriptions (1579a46f-be12-4e60-a96b-e5b5afe026d9)].'], 
    id=UUID('62fd3478-f303-47ba-941a-fcf41576615d'), 
    community_id=None, 
    collection_id=UUID('122fdf6a-e116-546b-a8f6-e4cb2e2c0a09'), 
    rating=9.0,
    rating_explanation='This community has a significant impact on the future of AI, as it drives research towards achieving AGI capabilities.',
    ...
  ), ...
]

Knowledge Graph Search

Perform knowledge graph-enhanced search (enabled by default):

Python

JavaScript

Curl

1 client.retrieval.search(
2     "What was DeepSeek R1"
3 )

Cleanup

Reset the graph to a clean state:

Python

JavaScript

Curl

1 client.graphs.reset(collection_id)

Best Practices

Graph Synchronization
- Always pull before attempting to list or work with entities
- Keep track of which documents have been added to the graph
Community Management
- Build communities after significant changes to the graph
- Use community information to enhance search results
Version Control
- Treat graphs like Git branches - experiment freely
- Use reset to start fresh if needed
- Maintain documentation of graph modifications

This Git-like model provides a flexible framework for knowledge management while maintaining data consistency and enabling experimental modifications.

User Management

R2R provides robust user auth and management capabilities. This section briefly covers user authentication features and how they relate to document management.

User Registration

To register a new user:

Python

JavaScript

Curl

1 from r2r import R2RClient
2 
3 client.users.create("[email protected]", "password123")

Example output:

1 User(
2   id=UUID('fcbcbc64-f85c-5025-877c-37f4c7a12d6e'),
3   email='[email protected]',
4   is_active=True,
5   is_superuser=False,
6   created_at=datetime.datetime(2025, 2, 8, 5, 8, 17, 376293,
7   tzinfo=TzInfo(UTC)),
8   updated_at=datetime.datetime(2025, 2, 8, 5, 8, 17, 376293,
9   tzinfo=TzInfo(UTC)),
10   is_verified=False,
11   collection_ids=[UUID('d3ef9c77-cb13-59a9-be70-0db46de619db')],
12   graph_ids=[],
13   document_ids=[],
14   limits_overrides={},
15   metadata={},
16   verification_code_expiry=None,
17   name=None,
18   bio=None, 
19   rofile_picture=None,
20   total_size_in_bytes=None,
21   num_files=None,
22   account_type='password',
23   hashed_password='JDJiJDEyJDE4UFdOTWZTSHNxdzRRMDdKZXU2Nk9qMFNNbXFxVFZldmpHaGhjdTcwdk5hNDZubEMxblVD',
24   google_id=None,
25   github_id=None
26 )

Email Verification

After registration, users need to verify their email:

Python

JavaScript

Curl

1 client.users.verify_email("123456")  # Verification code sent to email

User Login

To log in and obtain access tokens:

Python

JavaScript

Curl

1 client.users.login("[email protected]", "password123")

1 LoginResponse(access_token=Token(token='eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiJ0ZXN0QGV4YW1wbGUuY29tIiwidG9rZW5fdHlwZSI6ImFjY2VzcyIsImV4cCI6MTc0MjU5MTQ0Ni43MTY2MzcsImlhdCI6MTczODk5MTQ0Ni43MTY3MDUsIm5iZiI6MTczODk5MTQ0Ni43MTY3MDUsImp0aSI6IkhkWWVfeWxOSm9Yc2tvaU5ZVkdoNHc9PSIsIm5vbmNlIjoiMkhOOUs3bU40QVNfVnkzOTdXR2Vpdz09In0.gG_9oa-7_ZHqfHHo-bE1ooynCm7YCQFCYbJoiEgGmTg', token_type='access'), refresh_token=Token(token='eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiJ0ZXN0QGV4YW1wbGUuY29tIiwidG9rZW5fdHlwZSI6InJlZnJlc2giLCJleHAiOjE3Mzk1OTYyNDYuNzE3MzQxLCJpYXQiOjE3Mzg5OTE0NDYuNzE3MzQ5LCJuYmYiOjE3Mzg5OTE0NDYuNzE3MzQ5LCJqdGkiOiJybXltZTk5bGNtZklOWDZLQWNaTmpBPT0iLCJub25jZSI6InExRGdqZm96YkpjYXpDbzdTcE5XcWc9PSJ9.Zn-2pncsEdvyuig36N4APO_U9AWDQcJi6E5EjglN16U', token_type='refresh'))

User-Specific Search

Once authenticated, search results are automatically filtered to include only documents associated with the current user:

Python

JavaScript

Curl

1 # requires client.users.login(...)
2 client.retrieval.search(query="What was DeepSeek R1 about?"

1 AggregateSearchResult(chunk_search_results=[], graph_search_results=[], web_search_results=[], context_document_results=[])

Refresh Access Token

To refresh an expired access token:

Python

JavaScript

Curl

1 # requires client.users.login(...)
2 client.users.refresh_access_token()["results"]

User Logout

To log out and invalidate the current access token:

Python

Curl

JavaScript

1 # requires client.users.login(...)
2 client.users.logout()

These authentication features ensure that users can only access and manage their own documents. When performing operations like search, RAG, or document management, the results are automatically filtered based on the authenticated user’s permissions.

Remember to replace YOUR_ACCESS_TOKEN and YOUR_REFRESH_TOKEN with actual tokens obtained during the login process.

These observability and analytics features provide valuable insights into your R2R application’s performance and usage, enabling data-driven optimization and decision-making.

Next Steps

Now that you have a basic understanding of R2R’s core features, you can explore more advanced topics:

Dive into document ingestion and the document reference.
Learn about search and RAG and the retrieval reference.
Try advanced techniques like knowledge-graphs and refer to the graph reference.
Learn about user authentication to secure your application permissions and the users API reference.
Organize your documents using collections for granular access control.