Find answers from the community

Updated 2 months ago

Techniques for Unstructured PDF in RAG Chatbot

At a glance

The community member is looking for techniques to use for unstructured PDF data in their RAG chatbot. Another community member suggests trying LlamaParse, which is described as a state-of-the-art PDF parser. However, the original poster is concerned about data privacy, as the PDF data is sensitive. The community members discuss that LlamaParse only keeps the data for 48 hours in the cloud, and it is removed after that time period, addressing the data privacy concern.

Useful resources
what are the techniques i can use for unstructured pdf for my RAG chatbot?
W
I
4 comments
You can try LlamaParse: https://docs.llamaindex.ai/en/stable/llama_cloud/llama_parse/
its state of the art parser out there
ow i know this but my pdf (data privacy) is bit sensitive, and i know that it has free 1000 creds per day so whats the catch? do they use my data?
No data is kept only for 48 hrs in the cloud for reason such as if you try to call the API again it would not take time like it did for the first time and return it quickly.
After 48 hrs data is removed from the cloud.
thanks for that information
Add a reply
Sign up and join the conversation on Discord