Optimizing Chat Engine Response Times

At a glance

The post asks about the arguments the chat method of a chat engine takes and how to make the response faster. The comments suggest that the main requirement for querying is the query itself, and optionally the chat history. Community members also recommend using streaming to get the response faster, without waiting for the entire response to generate. They provide example code for streaming the response. Additionally, community members mention that the response time depends on the hardware if using an open-source language model. They also suggest referring to the documentation for more information on streaming support and accessing custom prompts.

Useful resources

PPragyan Mohapatra

What are the arguments chat method of chat engine takes and how can I make it faster to give me response
@WhiteFang_Jr @Logan M

10 comments

WWhiteFang_Jr

for querying, the prime requirement is the query along with this you can also pass the chat history if you want

WWhiteFang_Jr

If you are using open source llm then the repsonse time totally depends on your hardware

WWhiteFang_Jr

You can also try streaming the response that way you dont have to wait for the entire response to generate

PPragyan Mohapatra

How to stream the response from the chat engine

LLogan M

Plain Text

resp = chat_engine.stream_chat(...)
for r in resp.response_gen:
  print(r, end="", flush=True)

Or async

Plain Text

resp = await chat_engine.astream_chat(...)
async for r in resp.async_response_gen():
  print(r, end="", flush=True)

WWhiteFang_Jr

For more you can refer to the docs here as well: https://docs.llamaindex.ai/en/stable/examples/chat_engine/chat_engine_condense_plus_context/#streaming-support

PPragyan Mohapatra

How do I display this in chat bot , print statements cannot be displayed, right?
@WhiteFang_Jr

WWhiteFang_Jr

Here is an example for FastAPI backend: https://discord.com/channels/1059199217496772688/1217020898180075621/1217520651691360277
You can refer to this one

PPragyan Mohapatra

Thanks @WhiteFang_Jr But this doesn’t seem to have custom prompt to be fed. Can we give our custom prompt to this?

WWhiteFang_Jr

Yes, just need to change the default prompt: https://docs.llamaindex.ai/en/stable/module_guides/models/prompts/usage_pattern/#accessing-prompts

Add a reply

Find answers from the community

Optimizing Chat Engine Response Times