RAGv2: Next-Gen Retrieval System

🌟 Welcome to RAGv2!

If you’ve ever used ChatGPT and wished it knew about your private documents or today’s news without making things up, you’re looking for RAG.

🧐 What is RAG? (The “Open-Book Exam” Analogy)

Imagine a brilliant student taking an exam.

  • Standard AI: The student answers from memory. If they forgot a detail, they might guess (this is called a “hallucination”).
  • RAG System: The student is allowed to take the exam with their textbooks open.
    1. When asked a question, they first search the textbook for the right page.
    2. They read the information on that page.
    3. they write an answer based only on what they just read.

RAGv2 is that student, and your PDFs/Text files are the textbooks.


🏗️ Core Architecture

RAGv2 uses a unique Parent-Child Chunking strategy. Instead of feeding the AI random snippets of text, we find the specific needle in the haystack (the Child) but give the AI the whole context (the Parent) to ensure it understands the “big picture.”

📚 Quick Start Navigation


🛠️ The Beginner’s Tech Stack

We use specific tools to make this run on your local computer (no expensive cloud required!):

ToolPurposeAnalogy
llama_cppThe BrainThe actual AI model that speaks and thinks.
faissThe MemoryA specialized index that finds the right page in your “textbook” instantly.
pypdfThe EyeReads your PDF files and turns them into text.
GGUFThe FormatA special way of shrinking models (Quantization) so they fit on standard GPUs.

Last Updated: 2026-05-01

7 items under this folder.