Two questions, for Qdrant as the vector store, does creating the index also pass the extra_info from the documents into Qdrant as the payload? And for PromptHelper, what would be the configuration to use if I want the output to use as much of the tokens not taken up by the prompt as possible?
Hmm, so for PromptHelper I see it calculates token space as context_length - num_output - num_input, what is this used for? Is it just to decide whether to refine the given text? What happens if I set num_output to like 0 if I want it to use as much of the available space as possible? xD