Innopolis University DevOps Playground

RAG

Retrieval Augmented Generation - generation based on the context from documents.

RAG includes several steps:

Splitting documents
Embedding splits into the vector database
Retrieving most similar to the user's query chunks from the vector db
Providing these chunks (+ the user's query) to the LLM as a context

We need to implement the following:

Function, to split and embed .md documents
Function, to split and embed .pdf documents
Function (or class) to have a single instance of vector db per folder
Retriever (+reranker - advanced RAG with better outcome) function
LLM call function

Moreover, user wants to have a reference to the document, from which LLM got information

Edited Jun 15, 2024 by Aleksandr Malyy