Hi guys. I’m new to building RAG pipelines. Currently I’m trying to build a RAG that interfaces with my schools cs department info so I can ask it basic questions. I have a couple questions.
- I used fire crawl to crawl the website and deeper links, so I have a huge json file. I took that JSON and cleaned that up and made it a txt.
- Used hugging face embeddings but from here I’m stuck on what indexing and storing actually means and how I should do it effectively. Right now the Llm given context hallucinates A LOT!