losing my hair over this, I've installed a fresh ubuntu 20.04, installed python3.11, setup a venv within my flask app running off gunicorn within the app to run python3.11 .. I have llama index working but whenever i try upload a new file type i get a modulenotfound error, first one was pypdf, so i installed pypdf, then doc2text giving error when trying to upload docx file.. isn't there some way to automate all these dependencies? Appreciate any help, thanks.
Llama-Index does not auto install all the file required library on its own. Otherwise there would be lots of dependency while installing llama-index itslef.
You'll have to download each of these requirements on your own. lol π
so if i want to support 20 different file types, i need to dig through the code and figure out the dependencies? that doesn't sound right, or is that just the way it is? on my dev server i could see it installing live the dependencies it needed whenever i first uploaded a file of a type.. so damn weird..
on my dev server i never had to install any extra dependencies and every file type worked.. freaking bizarre... it was a fresh server setup too... so pypdf and doc2text were definitely not already installed..
because that's what was happening on my dev server but i think I've setup the prod server a bit different and it's just throwing exceptions every time a new file type is being uploaded
I blew away the venv, created a fresh with just llama-index and as soon as i uploaded a .docx it automatically installed docx2txt... wasted a day on this :/
appreciate the tip man, you sent me in the right direction, much love