(WIP) A RAG with LLM that generates boring text. Implemented everything from scratch.
Ref:
- Stanford CS25: V3 I Retrieval Augmented Language Models - YouTube
- https://github.com/mrdbourke/simple-local-rag/tree/main
-
Preprocessing + PDF Reader
-
Chunking, SentenceSplit
-
Embedding chunks
-
Save the embedding
-
Similarity Search
-
(TBD) Embedding pooling
-
BaseIndex Draft
-
Ingestion Pipeline: Pack into splitter and embedding into IngestionPipeline
-
Storage Context
-
BaseEmbedding.similarity vs SimpleVectorStore.query vs RetrieverQueryEngine._query
-
Retrieval
-
Generation
-
Re-org demo_rag.py
- PDF doc needs post processing
- Chunking's metadata is not saved
- Pydantic requires 1.10.14