Hello guys~ I need some help,, LlamaIndex keeps repeating the same responses or generating odd responses after completing an answer. I think it might be trying to fill up the maximum token length. What should I do?
Hello guys~ I need a help,, My Llama index take about 60sec to search, 130sec to generate. And, in the profiling result, that torch~~~ take a long time. How can I solve this?