@afewell My apologies. I should have clarified that I am not seeing the previous responses. I believe the LLM is being queried multiple times to refine the response so what I wrote above is all I'm getting form an initial question. I'm not using any refine templates so I didn't think this was the case? For additional context this is my query engine syntax:
query_engine = index.as_query_engine(similarity_top_k=3, text_qa_template=QA_PROMPT)
response = query_engine.query(query_str)
and the qa prompt is the taken straight from the docs. I appreciate the assist!