Skip to content

RAG reference implementation: Here’s how to structure a RAG service with evals, guardrails, observability, and governance — so teams don’t reinvent the wheel

License

Notifications You must be signed in to change notification settings

Synapse-Flux-Lab/rag-loom

Repository files navigation

📚 RAG Loom - RAG Service (Reference Implementation)

🔹 Overview

This project is an Reference implementation of a Retrieval-Augmented Generation (RAG) service. Explore the hosted documentation at RAG Loom Docs.

Unlike most RAG demos, this repo shows how to integrate document ingestion, vector search, LLM orchestration, evaluation, observability, and guardrails into a cohesive, deployable microservice.

👉 It’s a starter kit for AI platform teams: opinionated, modular, and focused on enterprise readiness.

( This repository is the community edition of RAG Loom. Looking for the closed-source pro version? or want to become our design partners shaping RAG Loom’s next features: Join the waiting list via our contact form. )


🔹 Why This Matters

For professionals: Most RAG examples stop at “fetch docs and query an LLM.” This project goes further — adding eval pipelines, observability, and safety mechanisms so engineers and product teams can see what production looks like.


🔹 Features

  • FastAPI microservice for clean API endpoints
  • Vector database integration (pgvector / Milvus / Weaviate)
  • RAG orchestration (chunking, embedding, retrieval, answer generation)
  • Evaluation harness (Ragas / Evals) for quality scoring
  • Tracing & Observability (Langfuse, structured logging, metrics)
  • Enterprise Guardrails (PII redaction, profanity filter)
  • Prompt & dataset versioning (Weights & Biases)
  • Deployment ready (Dockerfile, Kubernetes manifests)

🔹 Architecture

Pipeline flow: PDFs → Chunking → Embeddings → Vector DB → Retrieval → LLM → Eval/Guardrails → API Response

Supporting layers:

  • Observability: logs, traces, metrics
  • Governance: SLOs, versioning, risk register

🔹 Roadmap

  • Core infra (FastAPI, vector DB, RAG pipeline, basic observability)
  • Add evals (Ragas/Evals), tracing (Langfuse), Docker/K8s deployment
  • Add guardrails (PII filter, profanity check), prompt/dataset versioning

License

This project is licensed under the PolyForm Noncommercial License 1.0.0. Commercial licensing and support are provided by Synapse Flux Lab.

  • ✅ Free for noncommercial use: research, personal projects, internal testing, and prototyping
  • 💼 Commercial/production use requires a paid license from Synapse Flux Lab

Third-party libraries remain under their original licenses (see NOTICE).

For commercial licensing and support, contact us.

About

RAG reference implementation: Here’s how to structure a RAG service with evals, guardrails, observability, and governance — so teams don’t reinvent the wheel

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published