I need help with RAG. I've provided a markdown table with six columns, with the latest two containing money values. The model includes 'llama2 7b llm' and 'HuggingFaceEmbeddings(model_name="sentence-transformers/all-mpnet-base-v2")', with a service context chunk size of 1024. Unfortunately, I didn't obtain any correct information. The model mixes up columns, unable to sum according to the ID column, and it doesn't recognize the latest row. Any advice would be greatly appreciated.
@ravitheja I already got almost the same advice on this server to use text2sql or markdown if tables are smaller. So I use: MarkdownReader = download_loader("MarkdownReader") but I it is not the problem to use pandas query engine if it works with llm models other then openai. But I don't think that it is a problem in a loader. Do you think that the model and embedding, chunk size as well could return to me satisfied results? I mean on a dataset I described?
hmm, I'm feeling that there is no lots of information about structured numeric data although some examples shows that llm read financial data from pdf. I'm little confused with that fact. Thank you @ravitheja on sharing your thoughts.