Find answers from the community

Updated 5 months ago

Logan M how would I use a simple

At a glance

how would I use a simple Langchain LLM with a simple chat engine and handle memory and token limits in llama index? All id like to do is make a chat interface using llama index and a custom langchain LLM. Docs seem to suggest chatmemorybuffer, cant really understand how it works (does it summarize when token limit is reached or does it just remove least recent messages?)

8 comments

LLogan M

The chat memory buffer just includes X number of most recent messages that are below a token limit

Every chat agent is instansiated with one. But you might have to adjust the token limit

VVish

Anyway to include different types of memory?

VVish

Every chat agent is instantiated with one, but what about a chat engine?

Simple chat engines?

same thing

I see

it's the only memory module we have, definitely welcome a PR on this

VVish

I see, gotcha

Add a reply