Deploying R2R on Amazon Web Services (AWS)

Amazon Web Services (AWS) provides a robust and scalable platform for deploying R2R. This guide will walk you through the process of setting up R2R on an Amazon EC2 instance, making it accessible both locally and publicly.

Overview

Deploying R2R on AWS involves the following main steps:

Creating an Amazon EC2 instance
Installing necessary dependencies
Setting up R2R
Configuring port forwarding for local access
Exposing ports for public access (optional)

This guide assumes you have an AWS account and the necessary permissions to create and manage EC2 instances.

Creating an Amazon EC2 Instance

Log in to the AWS Management Console.
Navigate to EC2 under “Compute” services.
Click “Launch Instance”.
Choose an Amazon Machine Image (AMI):
- Select “Ubuntu Server 22.04 LTS (HVM), SSD Volume Type”
Choose an Instance Type:
- For a small-mid sized organization (< 5000 users), select t3.xlarge (4 vCPU, 16 GiB Memory) or higher
Configure Instance Details:
- Leave default settings or adjust as needed
Add Storage:
- Set the root volume to at least 500 GiB
Add Tags (optional):
- Add any tags for easier resource management
Configure Security Group:
- Create a new security group
- Add rules to allow inbound traffic on ports 22 (SSH) and 7272 (R2R API)
Review and Launch:
- Review your settings and click “Launch”
- Choose or create a key pair for SSH access

Installing Dependencies

SSH into your newly created EC2 instance:

1 ssh -i /path/to/your-key.pem ubuntu@your-instance-public-dns

Now, run the following commands to install the necessary R2R dependencies:

1 # Update package list
2 sudo apt update
3 
4 # Install Git
5 sudo apt install git -y
6 
7 # Install Docker
8 sudo apt-get update
9 sudo apt-get install ca-certificates curl gnupg
10 sudo install -m 0755 -d /etc/apt/keyrings
11 curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /etc/apt/keyrings/docker.gpg
12 sudo chmod a+r /etc/apt/keyrings/docker.gpg
13 
14 echo \
15   "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.gpg] https://download.docker.com/linux/ubuntu \
16   $(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
17   sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
18 
19 sudo apt-get update
20 sudo apt-get install docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin -y
21 
22 # Add your user to the Docker group
23 sudo usermod -aG docker $USER
24 newgrp docker
25 
26 # Verify Docker installation
27 docker run hello-world

Setting up R2R

Clone the R2R repository:

1 git clone https://github.com/SciPhi-AI/R2R.git
2 cd R2R/docker

Set up environment variables:

1 cd env
2 # Edit r2r-full.env with your preferred text editor
3 nano r2r-full.env

Add the necessary environment variables:

# Choose configuration
R2R_CONFIG_NAME=full
# Add your API key(s)
OPENAI_API_KEY=sk-...
# Optional - Add agent tool API keys if needed
# SERPER_API_KEY=your_serper_api_key_here
# FIRECRAWL_API_KEY=your_firecrawl_api_key_here

Start the R2R services:

1 cd ..
2 docker compose -f compose.full.yaml --profile postgres up -d

Verify the health of the system:

1 # Wait for services to start
2 sleep 30
3 
4 # Check health
5 curl http://localhost:7272/v3/health

Test ingesting and searching a sample document from a remote environment:

1 # From your local machine
2 curl -X POST "http://YOUR_INSTANCE_IP:7272/v3/documents/create-sample"
3 sleep 10
4 curl -X POST "http://YOUR_INSTANCE_IP:7272/v3/search" \
5   -H "Content-Type: application/json" \
6   -d '{"query": "Who was aristotle?"}'

Replace YOUR_INSTANCE_IP with your EC2 instance’s public IP address.

Configuring Port Forwarding for Local Access

To access R2R from your local machine, use SSH port forwarding:

1 ssh -i /path/to/your-key.pem -L 7272:localhost:7272 -L 7273:localhost:7273 ubuntu@your-instance-public-dns

This will allow you to access:

The API at http://localhost:7272
The dashboard at http://localhost:7273

Exposing Ports for Public Access (Optional)

To make R2R publicly accessible:

In the AWS Management Console, go to EC2 > Security Groups.
Select the security group associated with your EC2 instance.
Click “Edit inbound rules”.
Add new rules:
- Type: Custom TCP, Port range: 7272, Source: Anywhere (0.0.0.0/0), Description: R2R API
- Type: Custom TCP, Port range: 7273, Source: Anywhere (0.0.0.0/0), Description: R2R Dashboard
Click “Save rules”.

After starting your R2R application, users can access:

The API at http://YOUR_INSTANCE_IP:7272
The dashboard at http://YOUR_INSTANCE_IP:7273

Security Considerations

Use HTTPS (port 443) with a valid SSL certificate for production.
Restrict source IP addresses in the security group rule if possible.
Regularly update and patch your system and applications.
Use AWS VPC for network isolation.
Enable and configure AWS CloudTrail for auditing.
Use AWS IAM roles for secure access management.
Consider using AWS Certificate Manager for SSL/TLS certificates.
Monitor incoming traffic using AWS CloudWatch.
Remove or disable the security group rule when not needed for testing.

Troubleshooting

If you encounter issues:

Check Docker container status:

docker ps
docker logs <container_id>

Verify environment variables are correctly set in r2r-full.env.
Ensure ports are correctly exposed in your security group.
Check disk space and system resources:

df -h
free -m

Conclusion

You have now successfully deployed R2R on Amazon Web Services. The application should be accessible locally through SSH tunneling and optionally publicly through direct access to the EC2 instance.

For more information on configuring and using R2R, refer to the configuration documentation or join our Discord community for assistance.