The community member is using the LlamaParse Python library to process a 9-page PDF, but the code debugger is only returning 1 document instead of 9. Another community member suggests that 1 document object corresponds to 1 PDF file. The community member then mentions that they have to split the text by "\nβ-\n", but one of their PDF files with 209 pages is not being split correctly, resulting in 212 pages. The other community members are unsure about this issue.
@Logan M Okay, turned out I have to split the text by β\nβ-\nβ, but one of my pdf files is not being split correctly. It has 209 pages, but splitting by \nβ-\n gives me 212 pages. Is that a known issue?