hey 👋 - working on setting up a test for JSON mode in the new parser on a .pdf file based on the demo in
https://github.com/run-llama/llama_parse/blob/main/examples/demo_json.ipynb- Has anyone run into a content violation error that prevents the
get_text_nodes
function from populating the dictionary? I've tried both Anthro and OpenAI's multimodal models - with the same result (error message below). - I believe this error is on the Llama-parser side, not on the LLM - but clarification on that would be awesome.
Also, the pdf in testing doesn't have anything close to questionable images. It's a arxiv pdf.
BadRequestError: Error code: 400 - {'error': {'message': 'Your input image may contain content that is not allowed by our safety system.', 'type': 'invalid_request_error', 'param': None, 'code': 'content_policy_violation'}}
Interested in exploring a potential
exception
solution where any
content_violation_policy
is skipped.
Any thoughts, feedback, or ideas on this would be awesome, thanks!