ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

17 articles 73.6k View on GitHub ↗
17 articles
Debugging RAGFlow Applications: 8 Essential Techniques for Micro-Service Troubleshooting

Master RAGFlow application debugging with 8 essential techniques. Learn to use centralized logging, control verbosity, and attach remote debuggers for efficient micro-service troubleshooting.

best-practices
Feb 23, 2026
How RAGFlow Supports Querying Heterogeneous Data Sources: Architecture and Implementation

Discover how RAGFlow unifies heterogeneous data sources with its connector abstraction and unified doc-store for seamless cross-source querying without complex logic. Explore architecture and implementation.

architecture
Feb 23, 2026
How RAGFlow Implements Python and JavaScript Code Execution for AI Agents

Discover how RAGFlow securely executes Python and JavaScript code for AI agents using a novel three-layer sandbox architecture. Learn about its unified CodeExec component.

internals
Feb 23, 2026
How RAGFlow Manages Document Metadata for Efficient Retrieval and Indexing

Discover how RAGFlow manages document metadata for efficient retrieval and indexing. Leverage per-tenant search indices for millisecond-level filtering alongside vector search.

deep-dive
Feb 23, 2026
Scaling RAGFlow for Large Volumes of Documents and Queries: Architecture and Best Practices

Discover how to scale RAGFlow for massive document volumes and high query loads. Explore architecture, best practices, and key considerations for enterprise-grade performance.

architecture
Feb 23, 2026
How RAGFlow Handles Authentication and Authorization for API Endpoints

Learn how RAGFlow secures its API endpoints with a two layer authentication and authorization mechanism involving UUID tokens and login required decorators.

how-to-guide
Feb 23, 2026
How to Use the RAGFlow Python SDK to Build Custom RAG Applications

Build custom RAG applications with the RAGFlow Python SDK. Programmatically manage datasets, parse docs, and create chat sessions effortlessly without HTTP boilerplate.

how-to-guide
Feb 23, 2026
How RAGFlow Parses Multi-Modal Information from PDFs and DOCX Files: A Deep Dive into the DeepDoc Parser

Discover how RAGFlow's DeepDoc parser extracts text, tables, and figures from PDFs and DOCX files for advanced RAG pipelines. Learn about OCR, layout, and table detection for unified data representation.

deep-dive
Feb 23, 2026
How Cross-Language Query Support Works in RAGFlow: A Technical Deep Dive

Discover how RAGFlow implements cross-language query support. Learn how LLM translation and prompt templates enable seamless retrieval across multilingual documents. Read the technical deep dive.

deep-dive
Feb 23, 2026
How RAGFlow Manages State and Memory for Its AI Agents: A Deep Dive into the Document-Store Architecture

Explore RAGFlow's document-level memory architecture for AI agents. Discover how per-user indices and hybrid retrieval manage conversational state across sessions efficiently.

deep-dive
Feb 23, 2026
How RAGFlow Handles Data Synchronization from Confluence, S3, Notion, Discord, and Google Drive

Discover how RAGFlow synchronizes data from Confluence, S3, Notion, Discord, and Google Drive using its unified SyncBase abstraction and configurable batch processing for seamless updates.

how-to-guide
Feb 23, 2026
RAGFlow Agent Tools: Complete Catalog and Extension Guide

Explore RAGFlow agent tools: discover built-in tools like Wikipedia, Google Search & more, and learn how to extend RAGFlow by adding new tools to its agent tools directory.

api-reference
Feb 23, 2026

Have a question about this repo?

These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:

Share the following with your agent to get started:
curl -s "https://instagit.com/install.md"

Works with
Claude Codex Cursor VS Code OpenClaw Any MCP Client

Maintain an open-source project? Get it listed too →