ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

🔍 Table of Contents

📜 Introduction
⚙️ Environment setup
🔨 Data construction
🏋🏻‍♂️ ReaRAG Training
🤖️ Inference
📝 Citation

📜 Introduction

Large Reasoning Models (LRMs) exhibit remarkable reasoning abilities but rely primarily on parametric knowledge, which limits their factual accuracy. To address this limitation, we propose ReaRAG, a factuality enhanced reasoning model that iteratively constructs reasoning chains guided by knowledge retrieval, while efficiently exploring diverse queries without excessive iterations.

⚙️ Environment setup

Conda environment

conda create --name rearag python=3.10 -y && conda activate rearag

pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu124

pip install vllm==0.6.5

pip install datasets flask langid uvicorn termcolor jieba fuzzywuzzy rouge

conda install -c pytorch -c nvidia faiss-gpu=1.8.0

RAG engine environment

Preliminaries

Please note that the RAG engine provided here differs from the implementation described in the original paper. To facilitate public usage, we offer a practical and simplified version in this repository.

To setup RAG engine, first download the following:

E5-base-v2 Retriever. 🤗 Link
2018 Wikipedia Corpus. 🤗 Link
Indexed 2018 Wikipedia Corpus. FlashRAG_Dataset/retrieval_corpus/wiki18_100w_e5_index.zip

Deployment steps:

Modify the config in ReaRAG/deploy/deploy_config.sh and ReaRAG/deploy/retriever_config.yaml
Run the deployment script, make sure you see the phrase 'xx running on http://{host}:{port}' to confirm deployment:

# From within ReaRAG/deploy/
bash deploy_rag_engine.sh

🔨 Data construction

Deployment step:

Before starting, make sure you have deployed the rag_engine (see RAG Engine Deployment). Then, follow the steps below to deploy a LLM/LRM (e.g., QwQ-32b-preview) for data construction:

Modify the environment variables in ReaRAG/deploy/deploy_async.sh.
Run the deployment script, make sure you see the phrase 'xx running on http://{host}:{port}' to confirm deployment:

# From within ReaRAG/deploy/
bash deploy_async.sh

Begin data construction:

We construct data from HotpotQA, MuSiQue, and NQ. Therefore, make sure you have downloaded them and processed them into following structure:

{
  "question": "What is the capital of ...",
  "answer": "The capital of xxx is ...",
}

Next, modify the config in ReaRAG/src_data/data_config.yaml. Then, execute script below:

# From within ReaRAG/
bash data_construct.sh

The result of the data will be saved at ReaRAG/src_data/data, named conv_qwq.json for example, where each data is a list of conversation, structured as below:

{
    "messages": [{"role": "user", "content": "..."}, 
                 {"role": "assistant", "reasoning": "..."},
                 {"role": "observation", "content": "..."}, ...]
}

During sft, the loss is computed only on messages that contain the reasoning key, rather than the content key.

🏋🏻‍♂️ Training

Training data can be found from (🤗 huggingface).
You can mix it with general SFT data such as ShareGPT. We adopt Metragon-LM for model training. For a more lightweight implementation, you may adopt the code and environment from LongAlign.

🤖️ Inference

Deployment step:

Before starting, make sure you have deployed the rag_engine (see RAG Engine Deployment). Then, follow the steps below to deploy ReaRAG:

Modify the config in ReaRAG/deploy/deploy_config.sh.
Run the deployment script, make sure you see the phrase 'xx running on http://{host}:{port}' to confirm deployment:

# From within ReaRAG/deploy/
bash deploy.sh

Begin usage:

Next, modify the config in ReaRAG/infer.sh. Then, execute script below:

# From within ReaRAG/
bash infer.sh

📝 Citation

If you find our work useful, please consider citing ReaRAG:

@article{lee2025rearag,
  title={ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation},
  author={Lee, Zhicheng and Cao, Shulin and Liu, Jinxin and Zhang, Jiajie and Liu, Weichuan and Che, Xiaoyin and Hou, Lei and Li, Juanzi},
  journal={arXiv preprint arXiv:2503.21729},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
deploy		deploy
figs		figs
src		src
src_data		src_data
test_script		test_script
.gitignore		.gitignore
README.md		README.md
data_construct.sh		data_construct.sh
infer.sh		infer.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

🔍 Table of Contents

📜 Introduction

⚙️ Environment setup

Conda environment

RAG engine environment

Preliminaries

Deployment steps:

🔨 Data construction

Deployment step:

Begin data construction:

🏋🏻‍♂️ Training

🤖️ Inference

Deployment step:

Begin usage:

📝 Citation

About

Releases

Packages

Contributors 2

Languages

THU-KEG/ReaRAG

Folders and files

Latest commit

History

Repository files navigation

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

🔍 Table of Contents

📜 Introduction

⚙️ Environment setup

Conda environment

RAG engine environment

Preliminaries

Deployment steps:

🔨 Data construction

Deployment step:

Begin data construction:

🏋🏻‍♂️ Training

🤖️ Inference

Deployment step:

Begin usage:

📝 Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages