Find answers from the community

Updated 6 months ago

is it recommended to pre process data

At a glance

The community member who posted the original question is wondering if it is recommended to preprocess data before using it with LlamaIndex. They mention that they have had much better results when parsing everything into a text file and passing it to LlamaIndex.

In the comments, another community member suggests that the preprocessing required depends on the format of the original data. They note that most parsers already have some form of preprocessing, such as chunking and annotating, defined. Another community member asks about the original format of the data, and the original poster responds that it was in PPTX and CSV formats, but some of it also included images, which might make preprocessing more difficult.

There is no explicitly marked answer in the comments.

is it recommended to pre process data before using it with llamaindex ? Seems I have much better results when parsing everything into a text file and passing it to llamaindex
s
T
A
3 comments
depends on what you are passing it to. Most parsers do have some form of preprocessing viz chunking, annotating and such already defined.
What was the original format of the data?
pptx and csv but some have some images so it might be difficult for the parsers to pre process it ?
Add a reply
Sign up and join the conversation on Discord