Documents
A Document
in R2R represents a piece of content that has been ingested into the system, resulting in downstream Chunks
, Entities
, and more. Documents are the fundamental unit of content management and can be:
- Text files, PDFs, images, audio files, and other supported formats
- Broken down into chunks for efficient retrieval
- Processed to extract entities and relationships for knowledge graph creation
- Associated with metadata and collections
- Tracked for ingestion and knowledge graph extraction status