Find answers from the community

Updated 2 months ago

Hi there, for my company project I try

Hi there, for my company project I try to ingest data from confluence to then create a RAG assistant helping to find information in the vast amount of data we got. the first tests show that we only got like very crappy data to ingest into RAG, its tables, many images and little text if any and its mostly just words. I learn the people are very very lazy writing useful documentation.. thats for the ranting part πŸ˜‰ Is there any strategy how to deal with such messy data? Is there maybe tutorials or tipps around? I would guess its not just my company having such lazy people writing only crap that only themself understand the day they write it
W
o
8 comments
Try using Llamaparse to extract info from the tables/images. That might make your docs more meaningful
OK will try, thank you
Llamaparse is cloud?
thats no option πŸ™‚
why open source projects start closing the doors?
LlamaIndex also provides a way to make it on-Prem, you can connect with the team here for the requirements: https://www.llamaindex.ai/contact
OK I did get in contact, thank you πŸ™‚
Add a reply
Sign up and join the conversation on Discord