Troubleshooting interactive conversation issues with a ...

At a glance

The community member is using the context mode chat engine and a new embedding model (OpenAI's text-embedding-3-large), but they are no longer able to have an interactive conversation. They share their code settings and how they create the chat history, but they are not facing any errors. The issue seems to be that the chat engine is not responding to follow-up questions anymore, even though the prompt has not changed.

The community members discuss potential solutions, such as checking if the follow-up questions are bringing the correct nodes, and whether they need to implement ChatMemory into their code to get the chat functionality to work with the newer llama-index versions. They also mention that they are manually managing the chat history by taking the 3 most recent messages, but this approach is no longer working when they switch to the new embedding model.

The community members suggest creating a Colab notebook to reproduce the issue, and one member shares a link to their code files, but there is no explicitly marked answer in the comments.

Useful resources

TTorsten

Hello, I'm using the context mode chat engine and am testing out a new embedding model (OpenAI's text-embedding-3-large). However, when I then run my chat engine, I am no longer able to have an interactive conversation. Can someone help me out?

19 comments

WWhiteFang_Jr

hey, would you mind sharing the code ?

TTorsten

storage_context = StorageContext.from_defaults(persist_dir=f"{product_code}_llama")
    index = load_index_from_storage(storage_context)
    engine = index.as_chat_engine(
        chat_mode="context",
        verbose=True,
        system_prompt=prompt,
    )

TTorsten

And these are my settings:
Settings.llm = OpenAI(model = GPT_MODEL, temperature = 0.0, max_tokens = 3000)
Settings.embed_model = OpenAIEmbedding(model="text-embedding-3-large")
Settings.chunk_size = 256
Settings.chunk_overlap = 64

TTorsten

How I create the chat_history:
prior_conv.append(
ChatMessage(
role=MessageRole.USER, content=START_CONTENT + question + END_CONTENT
)
)
prior_conv.append(
ChatMessage(
role=MessageRole.ASSISTANT, content=START_CONTENT + answer + END_CONTENT
)

WWhiteFang_Jr

What error are you facing, can you share that as well

TTorsten

No error, it just won't answer follow-up questions anymore, while my prompt didn't change

WWhiteFang_Jr

You can check if follow up questions are bringing correct nodes or not

LLogan M

I'm not sure what you mean by "it won't follow up" ?

LLogan M

How are you calling the engine? Are you passing in that cht history list?

TTorsten

Ok let's say I ask the chatbot how to upload data. Then it gives me a 5 step plan like step 1 clikck here step 2 do this, et cetera. Now let's say step 3 is a little unclear to me, then as a follow-up question I might ask: "Can you explain step 3 a little more clearly?". And this breaks with the new embedding.

TTorsten

This is how I call the chat engine: response = query_engine.chat(question, chat_history=prior_conv)

TTorsten

I'll try that, thanks!

TTorsten

Should the final nodes used for the answer include the provided chat_history?

TTorsten

Anything obvious I'm missing or do you think the problem is more nuanced?

TTorsten

@WhiteFang_Jr @Logan M Could it be that I have to implement ChatMemory into my code to get the chat functionality to work with the newer llama-index versions? I currently manage that manually

LLogan M

Seems like however you manage it manually might be buggy? Both approaches should work fine

TTorsten

The way I manage it manually is by just taking the 3 most recent messages. Anything beyond that gets cut-off. That currently works, but when switching to a the new embedding model and using Settings, it all of a sudden doesn't work anymore and it's really frustrating.

LLogan M

@Torsten possible to make a colab notebook to reproduce? I'm sure it's an easy fix

TTorsten

Hi @Logan M , sorry for the late response but if you could still help me that would be very much appreciated :). This is the link to the code files. https://drive.google.com/drive/folders/1G6FRsClhqwGYZDJPww2ouAouMXJr7Uwt?usp=drive_link I can't share everything unfortunately, but these files capture how I implemented the chat_history. If it's too unclear or you need some more information just let me know

Add a reply

Find answers from the community

Troubleshooting interactive conversation issues with a new embedding model