Wrong answers

At a glance

The community member is experiencing issues with their index queries, where the answers they receive are not from the specific document they are querying, but rather from other related sources on the internet. They have tried using the "embedding" mode, but are still not getting the correct answers, especially for questions about the author of the document. The community members discuss different types of indexes, such as vector, list, and tree indexes, and suggest adjustments like increasing the similarity_top_k and using response_mode="compact" to improve the query results. They also note that the LLM may not always provide the correct answer even if the source data contains the right information. The community members continue to troubleshoot the issue, with one member suggesting trying a different LLM model, and another recommending posting an issue on the repository if the problem persists.

eemmett

hey, does anyone experienced "wrong" answers from index queries? After I created the index with a .md document, I ask a very specific question and the answer comes from articles in the internet and not from my document. I'm asking who is the author of the document and the document has a section literally saying "The authors are..." but I'm getting wrong authors from other sources related with the topic. I tried making the query with mode="embedding" but still not getting right answers for specific questions. I'm pretty sure I am missing something here. Thanks

12 comments

LLogan M

Which type of index are you using? mode="embedding" is only needed for list and tree indexes.

You can try increasing the top k as well response = index.query(..., similarity_top_k=3)

Also, you can check the response object to see which nodes were used to create the answer: response.source_nodes

LLogan M

At the end of the day though, it's up to the LLM to figure out the final answer even if they correct answer is in the source nodes, and they aren't always the smartest lol 😆 but usually it should be working

eemmett

this is really helpful, thanks @Logan M . Currently I'm using simple vector index, I simply don't know what's the best index for my case, I want to index around 15 markdown documents and use the chat to provide answers about this documents. It seems a list index is more "precise" for documentation but queries are more expensive since it needs to iterate the whole list for every query, right?

LLogan M

Exactly! A vector index should work fine though for that case as well, you might just need a higher top_k. You can also set response_mode="compact" to speed up response times in the query call. (It will stuff as much as it can into each LLM call, rather than one call per node)

A tree index might also be interesting try, but it's a little expensive to build as it uses the LLM during index construction (the build cost would be similar to one list index query). But the queries will be more efficient than a list index

eemmett

I'll try those adjustments! thanks a lot!

eemmett

similarity_top_k=3 did the trick! now answers are sooo good! thanks a lot @Logan M

eemmett

@Logan M I noticed that answers are really good locally but not on prod. Do you think this is related with cpu power? The very same question, super precise and simple it's found locally but not in the server. Although in both cases now is using the documents, that's a huge progress 🙂

eemmett

ok, in prod it improved by increasing the similarity_top_k to 5 and also I changes the answer mode to compact. But I can see the right answer in the node 4 (out of 5) with lower similarity then 1, 2 and 3 even though the exact words of my questions are in the number 4.

All this happen only in the server, locally it makes sense the similarity

LLogan M

Huh that's super strange 🤔

Your index has the exact same data on the server and locally? Created with the same settings?

eemmett

created with the exact same script, I've checked the beginning of the index and it looks the same. To be sure I'll use the local index in the server and see. Besides that only hardware and OS are different

eemmett

I will also try gpt3.5-turbo model and see how it goes

LLogan M

Yea, if the problem continues, maybe gather up a solid example and post an issue on the repo. To me, it should be working if everything is the exact same 🙃

Add a reply

Find answers from the community

Wrong answers