Skip to content

antgroup/ANT-Fin-RAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

This repository primarily consists of two parts: our RLFKV code implementation and the Financial Data Fidelity Evaluation Dataset FDD-ANT, which includes various types of data such as stocks, funds, and macroeconomics.

RLFKV Implementation Details

Our RLFKV addresses hallucination mitigation in financial text generation through fine-grained knowledge verification. The implementation is built on the ms-swift framework, with the core contribution being the RLFKV function.

This method decomposes long-form responses into fine-grained atomic knowledge units and verifies the factual accuracy of each unit, providing granular signals to guide the model during reinforcement fine-tuning. The prompt template is available at figs/prompt_template.jpg.

Once our paper is accepted, we will upload the code.

Installation

cd ms-swift-rlfkv
pip install -e .

training

bash scripts/train.sh

evaluating

bash scripts/evaluate.sh

FDD-ANT Details

The comparative analysis with BizFinBench's FDD reveals the following advantages of FDD-ANT: Superior coverage (3 financial data types versus FDD's single type) and Enhanced complexity (13× longer average context length).

Dimension FDD FDD-ANT
Data Types Stocks only Stocks, Funds, Macro
Data Volume 1,461 samples 2,000 samples
Avg. Length 506 tokens 6,763 tokens

Contact

For questions or feedback, please reach out to our team:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •