Hitesh Sahu
Hitesh SahuHitesh Sahu
  1. Home
  2. ›
  3. work
  4. ›
  5. …

  6. ›
  7. 3 rag factory

Loading ⏳
Fetching content, this won’t take long…


💡 Did you know?

🐙 Octopuses have three hearts and blue blood.

🍪 This website uses cookies

No personal data is stored on our servers however third party tools Google Analytics cookies to measure traffic and improve your website experience. Learn more

AI-Machine-Learning

    AI & Machine Learning
    • AI Infrastructure & LLM Platform

    • Model Gym

    • RAG Factory

    • NVIDIA Super POD

    • GPU Fabric Bench

    • Prompt Bridge


    Cloud & DevOps

    Full-Stack Applications

    Mobile Development

Cover Image for RAG Factory
AI & Machine Learning

RAG Factory

Personal / Open Source

Ongoing

Creator / Maintainer

AI Infrastructure & LLM

Tech Stack
RAG
pgvector
ChromaDB
Mistral
Ollama
OpenAI API
React
FastAPI

Summary

Universal RAG engine converting any input source into a queryable vector knowledge base across multiple LLM providers.


What I Built

Project Overview

RAG Factory is an open-source AI platform that transforms virtually any data source into a searchable knowledge base for Large Language Models. The project provides a unified ingestion, indexing, retrieval, and generation pipeline capable of processing documents, websites, repositories, cloud storage, and structured data sources.

Unlike traditional RAG implementations that focus on a single document format or vector database, RAG Factory was designed as a universal retrieval platform with pluggable components for ingestion, embeddings, vector stores, and LLM providers.

The goal is to simplify enterprise knowledge management by providing a consistent framework for converting fragmented information into AI-accessible context while remaining independent of any specific model vendor or storage technology.


Key Features

Universal Data Ingestion

Supports ingestion from multiple data sources including:

  • Text documents
  • PDF and Office documents
  • OpenDocument formats (ODF)
  • Websites and URLs
  • Git repositories
  • AWS S3 buckets
  • Markdown documentation
  • Structured datasets

Pluggable Vector Storage

Supports multiple vector database backends through a common abstraction layer:

  • PostgreSQL + pgvector
  • ChromaDB
  • Future extensibility for additional vector stores

Multi-Provider LLM Support

Provides a unified interface for interacting with multiple model providers:

  • OpenAI
  • Mistral AI
  • Ollama
  • Self-hosted models

Retrieval-Augmented Generation

Implements semantic search, retrieval, context assembly, and response generation workflows for knowledge-intensive applications.

End-to-End User Experience

Includes a React-based frontend and FastAPI backend for ingestion, indexing, administration, and querying.


My Contributions

  • Designed the overall architecture and plugin-based framework.
  • Built the document ingestion pipeline supporting heterogeneous data sources.
  • Implemented chunking, embedding generation, and indexing workflows.
  • Developed abstraction layers for vector databases and LLM providers.
  • Integrated pgvector and ChromaDB storage backends.
  • Implemented retrieval pipelines and semantic search capabilities.
  • Developed FastAPI backend services for ingestion and querying.
  • Built React-based user interfaces for knowledge management and search.
  • Created deployment workflows and open-source project infrastructure.
  • Authored documentation, examples, and developer onboarding guides.

Technical Highlights

Universal Knowledge Ingestion

Designed a modular ingestion framework capable of normalizing diverse data sources into a consistent retrieval pipeline.

Vendor-Neutral Architecture

Decoupled retrieval, storage, embedding, and generation layers, enabling users to switch providers without application-level changes.

Extensible Plugin Framework

Implemented provider abstractions that allow new vector stores, embedding models, and LLMs to be integrated with minimal effort.

Production-Oriented Design

Focused on scalability, maintainability, and deployment flexibility rather than building a single-purpose chatbot.

Enterprise Knowledge Enablement

Created a foundation for internal copilots, document assistants, knowledge search platforms, and AI-powered workflows.


Challenges & Solutions

Challenge

Most RAG frameworks are tightly coupled to specific document formats, vector databases, or model providers, making migrations and enterprise adoption difficult.

Solution

Built a modular architecture that separates ingestion, embedding generation, retrieval, storage, and generation into independent components connected through well-defined interfaces.

Outcome

RAG Factory enables organizations to build retrieval-powered AI applications while remaining flexible in their choice of data sources, vector databases, and language models.


Technology Stack

Backend FastAPI, Python

Frontend React, TypeScript

Vector Databases pgvector, ChromaDB

LLMs OpenAI, Mistral, Ollama

Embeddings Sentence Transformers, OpenAI Embeddings

Storage PostgreSQL, AWS S3

AI Retrieval-Augmented Generation (RAG), Semantic Search

Architecture Plugin Framework, Provider Abstractions, Multi-Tenant Knowledge Systems

Domain Enterprise Search, Knowledge Management, AI Infrastructure, Agentic Systems

← Previous

Model Gym

Next →

NVIDIA Super POD

Let's work together
+49 176-2019-2523
hiteshkrsahu@gmail.com
WhatsApp
Skype
Munich 🥨, Germany 🇩🇪, EU
Playstore
Hitesh Sahu's apps on Google Play Store
Need Help?
Let's Connect
Navigation
  Home/About
  Skills
  Work/Projects
  Lab/Experiments
  Contribution
  Awards
  Art/Sketches
  Thoughts
  Contact
Links
  Sitemap
  Legal Notice
  Privacy Policy

Made with

NextJS logo

NextJS by

hitesh Sahu

| © 2026 All rights reserved.