Hi guys if any one can help me this would be great. query_engine = index.as_query_engine( similarity_top_k=1, retriever_mode='embedding', response_mode='compact', text_qa_template= QA_PROMPT, service_context=service_context, verbose = True ) here i am setting the response mode to compact but still the query_engine is using the create and refine method. Can anyone help, please. PS: The context is less than 200 tokens so the context window is not fully used. ( I mentioned this because i read in the documentation that if the chunk can't fit the context then it will use the create and refine prompt method, but that is not the case here.)