Find answers from the community

Updated 5 months ago

Hello guys~ I need a help,,

Hello guys~ I need a help,,
My Llama index take about 60sec to search, 130sec to generate.
And, in the profiling result, that torch~~~ take a long time.
How can I solve this?
Attachment
2024-07-31_5.46.32.png
L
W
h
6 comments
Hi, we need more details.
How big is the index? Which model are you using? Which retriever? Which Query Engine?
Are you running your llm on a CPU based machine
Thank you,
The model used is meta-llama/Meta-Llama-3-8B-Instruct,
"The retriever used is based on the ChromaVectorStore,
The query engine used is part of the LlamaIndex framework, which includes a response synthesizer configured with ResponseMode.TREE_SUMMARIZE. The query engine is created from a VectorStoreIndex,
Index is not too big, about 17 PDF, 10 TXT
Oh i thought that i am using my GPU
but, I lose my authority to access to the GPU.
and i solved. Thank you guys for helping me.
you helped yourself πŸ™‚
Add a reply
Sign up and join the conversation on Discord