Find answers from the community

Updated last year

I'm having an issue with llama index

At a glance

I'm having an issue with llama index response time in context chat mode. I am using a Database reader to read a row of sel which contains a very long text that is not very well formatted as well. It should have been a regular pdf/text document honestly. Does that make the performance a lot worse because it seems like it does? Sometimes I just wait indefinitely for a response. Is there a way to set some time limit for the retrieval and just give me some answer in the chat.

6 comments

LLogan M

What version of llama-index do you have? It definitely should not be waiting forever, and should also be chunking your data

ttoniuyt

It does make chunks of nodes but its one document per sql row. While if I use a proper it chunks it into multiple documents.

Version is llama-index==0.8.45.post1

ttoniuyt

I don't know if the system prompt or chat memory make it bugged somehow. I've manually set a chat memory with 3000 tokens. I've also noticed that without a system prompt (or a different one) retrieval works better.

ttoniuyt

Right now it often tells me that it doesnt have information about said thing while it clearly has.

ttoniuyt

@Logan M biggest issue is the insane waiting time. I dont know if it does something or its stuck as I havent been able to wait until the end. Tried updating to latest stable version but it still happens from time to time

LLogan M

tbh I think openai has been quite flakey lately.

The default timeout is 60s with 3 retries 🤔 You can modify this in the LLM or embeddings definition

Add a reply