Find answers from the community

Updated 6 months ago

Hello guys~ I need a help,,

At a glance

Hello guys~ I need a help,,
My Llama index take about 60sec to search, 130sec to generate.
And, in the profiling result, that torch~~~ take a long time.
How can I solve this?

Attachment

6 comments

LLeonardo Oliva

Hi, we need more details.
How big is the index? Which model are you using? Which retriever? Which Query Engine?

WWhiteFang_Jr

Are you running your llm on a CPU based machine

hhyunjung0906

Thank you,
The model used is meta-llama/Meta-Llama-3-8B-Instruct,
"The retriever used is based on the ChromaVectorStore,
The query engine used is part of the LlamaIndex framework, which includes a response synthesizer configured with ResponseMode.TREE_SUMMARIZE. The query engine is created from a VectorStoreIndex,
Index is not too big, about 17 PDF, 10 TXT

hhyunjung0906

Oh i thought that i am using my GPU
but, I lose my authority to access to the GPU.
and i solved. Thank you guys for helping me.

hhyunjung0906

Have a good day

LLeonardo Oliva

you helped yourself 🙂

Add a reply