Skip to content

An API clone of LlamaIndex core functionality with less code.

Notifications You must be signed in to change notification settings

TimS-ml/Boring-RAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

😑😑😑 Boring RAG 😑😑😑

(WIP) A RAG with LLM that generates boring text. Implemented everything from scratch.

Ref:

TODO

  • Preprocessing + PDF Reader

  • Chunking, SentenceSplit

  • Embedding chunks

  • Save the embedding

  • Similarity Search

  • (TBD) Embedding pooling

  • BaseIndex Draft

  • Ingestion Pipeline: Pack into splitter and embedding into IngestionPipeline

  • Storage Context

  • BaseEmbedding.similarity vs SimpleVectorStore.query vs RetrieverQueryEngine._query

  • Retrieval

  • Generation

  • Re-org demo_rag.py

BUGs

  • PDF doc needs post processing
  • Chunking's metadata is not saved
  • Pydantic requires 1.10.14

About

An API clone of LlamaIndex core functionality with less code.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages