Find answers from the community

Updated 3 weeks ago

Parsing and Indexing Excel Files for a Vector Database

Does anyone have a good way of parsing and indexing excel files (including some very messy ones, not standardized). My idea was to transform each sheet to markdown and embed those into the vectordb. However, it doesnt seem like there is a good way to transform excels into markdown so I will have to take another approach
W
T
4 comments
Unfortunately, the data with is not allowed to leave the cloud platform we're working in for this project.

However, I've tried similar tools (azure document intelligence), and it did not work that well
If anyone is wondering, I solved this by transforming excel to CSV and then letting GPT4o convert the CSV to markdown
Add a reply
Sign up and join the conversation on Discord