ChatMemory Buffer

At a glance

The community member is encountering an issue with the Llama Index OpenAI chat engine, where long responses result in the context string exceeding the token limit, causing an error. The comments suggest several solutions, such as setting a lower token limit for the chat memory, changing the model to one with a higher token limit (e.g., GPT-4), or making the prompts shorter. The community members discuss these options and note that the latest version of the library may have a default token limit of 1500, which could resolve the issue.

Useful resources

ttoniuyt

Hello, I tried using llama index open ai chat engine but I am encountering one problem. If I start chatting in a way that the responses become long it seems like the context string that gets passed to the LLM becomes too long and I hit a token limit error.

Plain Text

openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens. However, your messages resulted in 4121 tokens (4070 in the messages, 51 in the functions). Please reduce the length o
f the messages or functions.

Has this happened to anyone else and what could I do to fix it?

5 comments

WWhiteFang_Jr

Which ChatEngine are you using?
You need to set token limit for the chat memory.

https://docs.llamaindex.ai/en/stable/examples/chat_engine/chat_engine_context.html

Plain Text

from llama_index.memory import ChatMemoryBuffer

memory = ChatMemoryBuffer.from_defaults(token_limit=3000)

chat_engine = index.as_chat_engine(
    chat_mode="context",
    memory=memory,
    system_prompt="You are a chatbot, able to have normal interactions, as well as talk about an essay discussing Paul Grahams life.",
)

This should solve your case!

llucastonon

That is model dependant.
gpt-3.5 will only have 4097 max tokens
gpt-4 however will have twice as much
you can either change the model (gpt-3.5-16k could be a good options), change the memory buffer, or make your prompts smaller

WWhiteFang_Jr

Yep, for that only, they can set the token limit as per their choice of model.

ttoniuyt

Thanks I'll try it. I actually updated to the latest version and it seems like I'm not hitting the error now but I'll have it mind if it comes up again.

WWhiteFang_Jr

Yeah the new version has 1500 token limit set as default I guess.

Add a reply

Find answers from the community

ChatMemory Buffer