Skip to content

A real-time financial assistant that combines market data, news, and SEC filings using Retrieval-Augmented Generation (RAG). Built with Python, KDB.AI Cloud for vector search, and HuggingFace embeddings. Ideal for research, trading insights, or fintech prototyping.

License

Notifications You must be signed in to change notification settings

chankeypathak/rag-stock-assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAG-Based Stock Assistant

This project is a Retrieval-Augmented Generation (RAG) stock assistant, built using:

  • Python CLI interface
  • KDB.AI Cloud as the vector store
  • HuggingFace embeddings
  • Real-time stock data sources: yFinance, RSS, and SEC EDGAR filings

This project implements a Retrieval-Augmented Generation (RAG) pipeline tailored for financial use cases. It ingests real-time stock prices (via yFinance), market-moving news (via RSS), and company disclosures (via SEC EDGAR).

The documents are embedded using a HuggingFace transformer and stored in KDB.AI Cloud, a high-performance vector database. Queries from users are embedded, top-k relevant documents are retrieved, and responses are generated via an LLM.

Designed for easy replication with Docker, this assistant can support decision-making for investors, researchers, or algorithmic trading environments.


Use case

Category Example Queries
Market Summaries "Summarize today’s financial news."
"What happened in the stock market today?"
Stock-specific News "Any news about Reliance today?"
"What's new with TCS stock?"
Sentiment or Trend Inference "Is the market sentiment bullish today?"
"What's the tone of today's financial headlines?"
Keyword Searches "Find documents mentioning interest rates."
"Which news mentions inflation?"
Sector-based Questions "What's going on in the IT sector?"
"Any updates in the banking industry?"

Process

  1. Ingest stock prices, news, and SEC filings.
  2. Embed documents using a SentenceTransformer model.
  3. Store embeddings in KDB.AI Cloud.
  4. Accept a user query via CLI.
  5. Embed the query and retrieve top-k similar documents.
  6. Generate a natural-language answer using an LLM.

Components Overview

Component Description
ingest.py Fetch & preprocess news, stock data, filings
embed.py Generate embeddings using HuggingFace
query.py Query KDB.AI and trigger LLM-based response
docker-compose.yml Run CLI + processing modules in containers
KDB.AI Vector DB (SaaS) used for storing and querying embeddings
CLI Interface User interacts with RAG system via terminal

TODO

  • Add Streamlit UI
  • Add alerts for stock anomalies
  • Key management in Docker

About

A real-time financial assistant that combines market data, news, and SEC filings using Retrieval-Augmented Generation (RAG). Built with Python, KDB.AI Cloud for vector search, and HuggingFace embeddings. Ideal for research, trading insights, or fintech prototyping.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages