index = GPTSimpleVectorIndex(..., chunk_size_limit=256)
and setting similarity_top_k to a value > 1 during query: index.query(..., similarity_top_k=5, response_mode="compact")
max_tokens
in the custom llm's section here: https://gpt-index.readthedocs.io/en/latest/how_to/custom_llms.html