HydraRAG: Structured Cross-Source Enhanced LLM Reasoning (EMNLP2025, Main Conference)

News!

Our paper has been accepted for publication at EMNLP 2025 Main Conference!

How to cite

If you are interested or inspired by this work, you can cite us by:

@inproceedings{tan2025hydrarag,
  title={HydraRAG: Structured Cross-Source Enhanced Large Language Model Reasoning},
  author={Tan, Xingyu and Wang, Xiaoyang and Liu, Qing and Xu, Xiwei and Yuan, Xin and Zhu, Liming and Zhang, Wenjie},
  booktitle={Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing},
  pages={14442--14470},
  year={2025}
}

Code Documentation

Hydra unifies structured knowledge graphs, Wikipedia documents, and live web search so that large language models can reason over verified multi‑source evidence.

Directory layout

Hydra/
├── answer/                 evaluation helpers and scoring scripts
├── data/                   benchmark datasets (CWQ, AdvHotpotQA, QALD10-en, SimpleQA, WebQSP, Webquestions, Zeroshot RE)
├── Freebase/               Freebase environment setting. See Freebase/README.md for details.
├── freebase_subgraph/      Freebase subgraph KG
├── Hydra_run/              main source folder – run code from here
│   ├── hydra_main.py       entry point
│   ├── cot_prompt_list.py  chain‑of‑thought prompts
│   ├── freebase_func.py    Freebase SPARQL helpers
│   ├── wiki_client.py      WikiKG client helpers
│   ├── subgraph_helper.py  extra graph helpers
│   ├── subgraph_utilts.py  subgraph construction utilities
│   ├── detected_kgsub.py   KG maximum subgraph detection
│   ├── resp_process.py     response post‑processing
│   ├── utilts.py           shared utilities
│   └── utilts2.py          extra utilities
├── online_search/          live web search and documents caching
├── wiki_subgraph/          Wiki subgraph KG
├── Wikidata/               Wikidata environment setting. See Wikidata/README.md for details.
├── requirements.txt        Python dependencies
└── README.md               this file

Get started

Before running Hydra, please ensure you have successfully installed Freebase, and Wikidata on your local machine. The comprehensive installation instructions and necessary configuration details can be found in the /Freebase/README.md and /Wikidata/README.md.

Once Wikidata is set up, copy the server_urls.txt files from the Wikidata directory into the Hydra_run folder.

You must use your own API in the run_LLM function of utilts.py for the APIs, and your own SerpAPI in utilts2.py for online search.

To set up the environment, install the required dependencies using:

cd Hydra

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Command‑line interface

hydra_main.py exposes the options defined in build_parser.

cd Hydra/Hydra_run

python hydra_main.py \
  webqsp \                 # positional: dataset name prefix
  --depth 3 \              # maximum hop depth
  --allr \                 # Using Hydra instead of Hydra-E
  --model llama70b \       # LLM backend

Positional argument

Argument	Meaning
`file_name`	Dataset prefix. Supported values include `CWQ`, `hotpot` (AdvHotpotQA), `qald`, `simpleqa`, `webqsp`, `webquestions`, `zeroshotre`.

Main options

Flag	Description	Default
`--allsource`	Enable all sources in every expansion step	off
`--allr`	(Hydra) rather than a single relation (Hydra‑E)	off
`--incomplete`	Work with an incomplete KG split	off
`--ratio {100,80,50,30}`	When `--incomplete` is on, sample the KG at the given percentage	`100`
`--no-summary`	Disable summary generation	on
`--no-freebase`	Disable Freebase KG	on
`--no-wikikg`	Disable WikiKG	on
`--no-web`	Disable web search	on
`--no-wikidocu`	Disable retrieval from Wikipedia documents	on
`--model {gpt3,gpt4,llama,deepseek,llama70b}`	LLM backend. `gpt3` = GPT-3.5-Turbo, `gpt4` = GPT-4-Turbo, `llama` = Llama‑3.1‑8B, `deepseek` = DeepSeek‑v3, `llama70b` = Llama‑3.1‑70B	`llama70b`
`--depth {1,2,3,4}`	Maximum search depth	`3`

Modules remain active unless explicitly disabled with a --no‑* flag.

Examples

Run Hydra with all sources and a four‑hop search on CWQ:

python hydra_main.py CWQ --allsource --depth 4 --model gpt3

Run an ablation without web evidence and using an incomplete KG sampled at 50 percent:

python hydra_main.py webqsp \
  --no-web \
  --incomplete --ratio 50 \
  --depth 3

Outputs are stored under /Hydra/answer/ and include logs, intermediate paths, and final predictions.

Subgraph Loading:

Hydra will load the subgraph at maximum depths first from KG as part of the database preparation. The loading time depends on the environment setup and memory allocated for the Freebase and Wikidata server. You can run the subgraph loading individually:

python detected_kgsub.py webqsp # positional: dataset name prefix

KG usage:

Hydra utilze the Freebase and Wikidata KG. For more details about installation, please take a look at the Freebase and Wikidata folders.

Evaluation

Accuracy is computed with the answer in Hydra/answer/. The positional argument is the same as hydra_main.py

python check_answer.py \
  webqsp \                 # positional: dataset name prefix
  --depth 3 \              # maximum hop depth
  --allr \                 # Using Hydra instead of Hydra-E
  --model llama70b \       # LLM backend

Notes

Ensure that the dataset files and model configurations are correctly set up before running the scripts.
Use appropriate depth values based on the complexity of the dataset and required accuracy.

Claims

This project uses the Apache 2.0 protocol. The project assumes no legal responsibility for any of the model's output and will not be held liable for any damages that may result from the use of the resources and output.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
Freebase		Freebase
Hydra_run		Hydra_run
Wikidata		Wikidata
data		data
LICENSE		LICENSE
README.md		README.md
hydra-logo-purple-1024.png		hydra-logo-purple-1024.png
index.html		index.html
intro.png		intro.png
overall.png		overall.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HydraRAG: Structured Cross-Source Enhanced LLM Reasoning (EMNLP2025, Main Conference)

News!

How to cite

Code Documentation

Directory layout

Get started

Command‑line interface

Positional argument

Main options

Examples

Subgraph Loading:

KG usage:

Evaluation

Notes

Claims

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

SteveTANTAN/HydraRAG

Folders and files

Latest commit

History

Repository files navigation

HydraRAG: Structured Cross-Source Enhanced LLM Reasoning (EMNLP2025, Main Conference)

News!

How to cite

Code Documentation

Directory layout

Get started

Command‑line interface

Positional argument

Main options

Examples

Subgraph Loading:

KG usage:

Evaluation

Notes

Claims

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages