----------------------
If you're looking for a local PDF parsing library in Python, you might want to consider using PyMuPDF. It's a Python binding to the PDF processing library MuPDF. You can use it to read PDF, XPS, and E-book documents. Here's a simple example of how you can use it with LlamaIndex:
from pathlib import Path
from llama_index.readers.file import PyMuPDFReader
loader = PyMuPDFReader()
documents = loader.load(file_path="./data/your_file.pdf")
Please note that while PyMuPDF is a powerful library, it may not handle complex PDFs as effectively as LlamaParse. If you're dealing with complex PDFs and can't use LlamaParse, you might need to explore other options or build custom solutions.
You can install PyMuPDF using pip:
Please refer to the
LlamaIndex documentation () for more details.
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------