Find answers from the community

Updated 8 months ago

llama index is spamming too much of

TTree of Life

memory issue

19 comments

WWhiteFang_Jr

For LLM/embed/index ?
You can use them from remote if they are causing you memory issue.

Or is it something else?

TTree of Life

it cause memory load

TTree of Life

after awhile it doesnt clear

TTree of Life

these node take alot of memory in gpu and overexceed from 20gb from 8gb

LLogan M

sounds like you are using a local model? They allocate up to a point. I've never had memory issues, but you need to configure some settings properly (batch size, llm context window, using an actual vector store if you have a lot of data, etc.)

TTree of Life

sorry back

TTree of Life

the problem with this after 8gb it exceed 20 then it fallback to cpu

TTree of Life

1 single call expand almost 2-4gb vram from llama index

TTree of Life

while ollama pure call not expand that much also reset back to 8gb

TTree of Life

llama index also make memory clean but i dont know it very slow like take almost 2-3min if ideal state

TTree of Life

@Logan M

TTree of Life

1 person cause this much rise of vram how can it survive

TTree of Life

also notice it wont clean the memory anymore

Attachment

TTree of Life

it stuck at this point unless wait for more then 1-2min i am not sure about time but it do clean after bit idle state if keep spam it fallback to cpu and hence it go very slow

TTree of Life

@Logan M i found it solve by the new ollama3 experimental update

TTree of Life

https://github.com/ollama/ollama/releases/tag/v0.1.33

TTree of Life

it seems like the ollama itself has problem in earlier version

TTree of Life

now it fixed i am happy now thnx alot bro >.</ for coming

LLogan M

great!

Add a reply