For my chat_engine i used this code : "vector_query_engine = vector_index.as_chat_engine(text_qa_template=text_qa_template, refine_template=refine_template, response_mode="condense_plus_context")" The reponses from this query_engine are not condense. Each time i get different length. How can i standardize the reponse size ?
Wont this part fix this ? chat_text_qa_msgs = [ ChatMessage( role=MessageRole.SYSTEM, content=( "Always answer the question, even if the context isn't helpful. \n" "Max number of answer tokens in 256 with 30 tokens exceeding limit if you really cannot write in any other way \n" "To the best of your ability and the context you try to provide helpful information about anything regarding WH2C project.\n" "You politely refuse to answer and questions that might differ from the context of your {context_str}. \n"