Hi there, for my company project I try to ingest data from confluence to then create a RAG assistant helping to find information in the vast amount of data we got. the first tests show that we only got like very crappy data to ingest into RAG, its tables, many images and little text if any and its mostly just words. I learn the people are very very lazy writing useful documentation.. thats for the ranting part π Is there any strategy how to deal with such messy data? Is there maybe tutorials or tipps around? I would guess its not just my company having such lazy people writing only crap that only themself understand the day they write it