RAG
Retrieval Augmented Generation - generation based on the context from documents.
RAG includes several steps:
- Splitting documents
- Embedding splits into the vector database
- Retrieving most similar to the user's query chunks from the vector db
- Providing these chunks (+ the user's query) to the LLM as a context
We need to implement the following:
- Function, to split and embed .md documents
- Function, to split and embed .pdf documents
- Function (or class) to have a single instance of vector db per folder
- Retriever (+reranker - advanced RAG with better outcome) function
- LLM call function
Moreover, user wants to have a reference to the document, from which LLM got information
Edited by Aleksandr Malyy