AI & Machine Learning

RAG Factory

Personal / Open Source

Ongoing

Creator / Maintainer

AI Infrastructure & LLM

Tech Stack

RAG

pgvector

ChromaDB

Mistral

Ollama

OpenAI API

React

FastAPI

Summary

Universal RAG engine converting any input source into a queryable vector knowledge base across multiple LLM providers.

What I Built

Project Overview

RAG Factory is an open-source AI platform that transforms virtually any data source into a searchable knowledge base for Large Language Models. The project provides a unified ingestion, indexing, retrieval, and generation pipeline capable of processing documents, websites, repositories, cloud storage, and structured data sources.

Unlike traditional RAG implementations that focus on a single document format or vector database, RAG Factory was designed as a universal retrieval platform with pluggable components for ingestion, embeddings, vector stores, and LLM providers.

The goal is to simplify enterprise knowledge management by providing a consistent framework for converting fragmented information into AI-accessible context while remaining independent of any specific model vendor or storage technology.

Key Features

Universal Data Ingestion

Supports ingestion from multiple data sources including:

Text documents
PDF and Office documents
OpenDocument formats (ODF)
Websites and URLs
Git repositories
AWS S3 buckets
Markdown documentation
Structured datasets

Pluggable Vector Storage

Supports multiple vector database backends through a common abstraction layer:

PostgreSQL + pgvector
ChromaDB
Future extensibility for additional vector stores

Multi-Provider LLM Support

Provides a unified interface for interacting with multiple model providers:

OpenAI
Mistral AI
Ollama
Self-hosted models

Retrieval-Augmented Generation

Implements semantic search, retrieval, context assembly, and response generation workflows for knowledge-intensive applications.

End-to-End User Experience

Includes a React-based frontend and FastAPI backend for ingestion, indexing, administration, and querying.

My Contributions

Designed the overall architecture and plugin-based framework.
Built the document ingestion pipeline supporting heterogeneous data sources.
Implemented chunking, embedding generation, and indexing workflows.
Developed abstraction layers for vector databases and LLM providers.
Integrated pgvector and ChromaDB storage backends.
Implemented retrieval pipelines and semantic search capabilities.
Developed FastAPI backend services for ingestion and querying.
Built React-based user interfaces for knowledge management and search.
Created deployment workflows and open-source project infrastructure.
Authored documentation, examples, and developer onboarding guides.

Technical Highlights

Universal Knowledge Ingestion

Designed a modular ingestion framework capable of normalizing diverse data sources into a consistent retrieval pipeline.

Vendor-Neutral Architecture

Decoupled retrieval, storage, embedding, and generation layers, enabling users to switch providers without application-level changes.

Extensible Plugin Framework

Implemented provider abstractions that allow new vector stores, embedding models, and LLMs to be integrated with minimal effort.

Production-Oriented Design

Focused on scalability, maintainability, and deployment flexibility rather than building a single-purpose chatbot.

Enterprise Knowledge Enablement

Created a foundation for internal copilots, document assistants, knowledge search platforms, and AI-powered workflows.

Challenges & Solutions

Challenge

Most RAG frameworks are tightly coupled to specific document formats, vector databases, or model providers, making migrations and enterprise adoption difficult.

Solution

Built a modular architecture that separates ingestion, embedding generation, retrieval, storage, and generation into independent components connected through well-defined interfaces.

Outcome

RAG Factory enables organizations to build retrieval-powered AI applications while remaining flexible in their choice of data sources, vector databases, and language models.

Technology Stack

Backend FastAPI, Python

Frontend React, TypeScript

Vector Databases pgvector, ChromaDB

LLMs OpenAI, Mistral, Ollama

Embeddings Sentence Transformers, OpenAI Embeddings

Storage PostgreSQL, AWS S3

AI Retrieval-Augmented Generation (RAG), Semantic Search

Architecture Plugin Framework, Provider Abstractions, Multi-Tenant Knowledge Systems

Domain Enterprise Search, Knowledge Management, AI Infrastructure, Agentic Systems

← Previous

Model Gym

NVIDIA Super POD

AI & Machine Learning

RAG Factory

Personal / Open Source

Ongoing

Creator / Maintainer

AI Infrastructure & LLM

Tech Stack

RAG

pgvector

ChromaDB

Mistral

Ollama

OpenAI API

React

FastAPI

Summary

Universal RAG engine converting any input source into a queryable vector knowledge base across multiple LLM providers.

What I Built

Project Overview

Key Features

Universal Data Ingestion

Supports ingestion from multiple data sources including:

Text documents
PDF and Office documents
OpenDocument formats (ODF)
Websites and URLs
Git repositories
AWS S3 buckets
Markdown documentation
Structured datasets

Pluggable Vector Storage

Supports multiple vector database backends through a common abstraction layer:

PostgreSQL + pgvector
ChromaDB
Future extensibility for additional vector stores

Multi-Provider LLM Support

Provides a unified interface for interacting with multiple model providers:

OpenAI
Mistral AI
Ollama
Self-hosted models

Retrieval-Augmented Generation

Implements semantic search, retrieval, context assembly, and response generation workflows for knowledge-intensive applications.

End-to-End User Experience

Includes a React-based frontend and FastAPI backend for ingestion, indexing, administration, and querying.

My Contributions

Designed the overall architecture and plugin-based framework.
Built the document ingestion pipeline supporting heterogeneous data sources.
Implemented chunking, embedding generation, and indexing workflows.
Developed abstraction layers for vector databases and LLM providers.
Integrated pgvector and ChromaDB storage backends.
Implemented retrieval pipelines and semantic search capabilities.
Developed FastAPI backend services for ingestion and querying.
Built React-based user interfaces for knowledge management and search.
Created deployment workflows and open-source project infrastructure.
Authored documentation, examples, and developer onboarding guides.

Technical Highlights

Universal Knowledge Ingestion

Designed a modular ingestion framework capable of normalizing diverse data sources into a consistent retrieval pipeline.

Vendor-Neutral Architecture

Decoupled retrieval, storage, embedding, and generation layers, enabling users to switch providers without application-level changes.

Extensible Plugin Framework

Implemented provider abstractions that allow new vector stores, embedding models, and LLMs to be integrated with minimal effort.

Production-Oriented Design

Focused on scalability, maintainability, and deployment flexibility rather than building a single-purpose chatbot.

Enterprise Knowledge Enablement

Created a foundation for internal copilots, document assistants, knowledge search platforms, and AI-powered workflows.

Challenges & Solutions

Challenge

Most RAG frameworks are tightly coupled to specific document formats, vector databases, or model providers, making migrations and enterprise adoption difficult.

Solution

Built a modular architecture that separates ingestion, embedding generation, retrieval, storage, and generation into independent components connected through well-defined interfaces.

Outcome

RAG Factory enables organizations to build retrieval-powered AI applications while remaining flexible in their choice of data sources, vector databases, and language models.

Technology Stack

Backend FastAPI, Python

Frontend React, TypeScript

Vector Databases pgvector, ChromaDB

LLMs OpenAI, Mistral, Ollama

Embeddings Sentence Transformers, OpenAI Embeddings

Storage PostgreSQL, AWS S3

AI Retrieval-Augmented Generation (RAG), Semantic Search

Architecture Plugin Framework, Provider Abstractions, Multi-Tenant Knowledge Systems

Domain Enterprise Search, Knowledge Management, AI Infrastructure, Agentic Systems

← Previous

Model Gym

NVIDIA Super POD

AI-Machine-Learning

AI & Machine Learning

Cloud & DevOps

Full-Stack Applications

Mobile Development

RAG Factory

Personal / Open Source

Tech Stack

Summary

What I Built

Project Overview

Key Features

Universal Data Ingestion

Pluggable Vector Storage

Multi-Provider LLM Support

Retrieval-Augmented Generation

End-to-End User Experience

My Contributions

Technical Highlights

Universal Knowledge Ingestion

Vendor-Neutral Architecture

Extensible Plugin Framework

Production-Oriented Design

Enterprise Knowledge Enablement

Challenges & Solutions

Challenge

Solution

Outcome

Technology Stack

Fetching content, this won’t take long…

🐙 Octopuses have three hearts and blue blood.

AI-Machine-Learning

AI & Machine Learning

Cloud & DevOps

Full-Stack Applications

Mobile Development

RAG Factory

Personal / Open Source

Tech Stack

Summary

What I Built

Project Overview

Key Features

Universal Data Ingestion

Pluggable Vector Storage

Multi-Provider LLM Support

Retrieval-Augmented Generation

End-to-End User Experience

My Contributions

Technical Highlights

Universal Knowledge Ingestion

Vendor-Neutral Architecture

Extensible Plugin Framework

Production-Oriented Design

Enterprise Knowledge Enablement

Challenges & Solutions

Challenge

Solution

Outcome

Technology Stack