Csv

At a glance

The community member is using pagedcsvreader and chromadb to query CSV data, but is finding that the top 10 most similar documents are not relevant to the query, even when specifying important keywords. The comments suggest that using top_k similarity may not be the best approach for most CSV data, and that text2sql or a similar tool may be more appropriate, especially if the CSV contains textual data like QA pairs. The community members also discuss challenges with setting up a chatbot using CSV or SQL data, including issues with handling history and memory, and mention a tutorial on using a QueryPipelineAgent. However, there is no explicitly marked answer in the comments.

Useful resources

TTech explorer

Hi all , iam used pagedcsvreader along with chromadb and when iam querying with similarity_top_k as 10 but none of the documents are relevant to query. Although iam directly specifying important keywords in query. What can I do to improve rag with CSV data

10 comments

LLogan M

You probably don't want to be doing top k with most csv data

LLogan M

Probably better to be doing text2sql or something like that

LLogan M

Unless your csv is something textual, like QA pairs

TTech explorer

Tried doing text2sql by loading data into sqlite, but it's not able to answer based on previous question/ previous output . Also even for a hello it's trying to execute SQL query . Is there any tutorial to create chatbot with CSV , SQL data where history is also taken care or we have memory buffer

LLogan M

probably you want an agent with text2sql as a tool

TTech explorer

Tried to build agent as per documentation many times it keeps executing never ending loop without response.

LLogan M

what llm are you using? what kind of agent?

LLogan M

how did you setup the tool?

TTech explorer

https://docs.llamaindex.ai/en/stable/examples/agent/agent_runner/query_pipeline_agent/

TTech explorer

I followed this article and sometimes agent is running in loop

Add a reply

Find answers from the community

Csv