Find answers from the community

Updated 4 months ago

Hi everyone! I have a general question

At a glance
Hi everyone! I have a general question about RAG and Data Privacy. I'm using llama-index to build a Q&A chatbot, which is fed by multiple data sources (Slack, Confluence, Jira, Google Docs).

Now, when a user talks to the bot, I want to fetch documents which this user is allowed to see. For example, if a user is allowed to see document X but not document Y, I want the semantic search to exclude document Y.

What's the best way of doing that? Are there any best practices around this issue? I couldn't find much information online, and specifically about llama-index + privacy.
T
d
2 comments
One approach could be using strict metadata filtering. Do you currently have the permissions defined somewhere regarding what each user can see?
For some data sources, yes. I can fetch users' permissions directly from the API. Other sources doesn't expose this kind of information.
Add a reply
Sign up and join the conversation on Discord