Find answers from the community

Updated last year

RAG with tables.

I need help with RAG. I've provided a markdown table with six columns, with the latest two containing money values. The model includes 'llama2 7b llm' and 'HuggingFaceEmbeddings(model_name="sentence-transformers/all-mpnet-base-v2")', with a service context chunk size of 1024. Unfortunately, I didn't obtain any correct information. The model mixes up columns, unable to sum according to the ID column, and it doesn't recognize the latest row. Any advice would be greatly appreciated.
r
M
7 comments
@MarioZ for tables, it would be good to use text2sql rather than chunking process.
@ravitheja I already got almost the same advice on this server to use text2sql or markdown if tables are smaller. So I use:
MarkdownReader = download_loader("MarkdownReader")
but I it is not the problem to use pandas query engine if it works with llm models other then openai. But I don't think that it is a problem in a loader. Do you think that the model and embedding, chunk size as well could return to me satisfied results? I mean on a dataset I described?
Not sure. Tables directly given to LLM did not always give correct answer based on my experiments.
hmm, I'm feeling that there is no lots of information about structured numeric data although some examples shows that llm read financial data from pdf. I'm little confused with that fact. Thank you @ravitheja on sharing your thoughts.
Add a reply
Sign up and join the conversation on Discord