Hi guys, my understanding was that all data and docs were kept private and not sent to openai, however when I enable logging I can see the entire document being sent for embeddings and then excerpts of it being sent for querying.. is this normal?
losing my hair over this, I've installed a fresh ubuntu 20.04, installed python3.11, setup a venv within my flask app running off gunicorn within the app to run python3.11 .. I have llama index working but whenever i try upload a new file type i get a modulenotfound error, first one was pypdf, so i installed pypdf, then doc2text giving error when trying to upload docx file.. isn't there some way to automate all these dependencies? Appreciate any help, thanks.