I'm following
https://docs.llamaindex.ai/en/stable/use_cases/q_and_a/rag_cli.htmland get error:
llama_index/core/ingestion/pipeline.py", line 94, in get_transformation_hash
return sha256((nodes_str + transform_string).encode("utf-8")).hexdigest()
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud835' in position 5602662: surrogates not allowed
maybe need to make the code more robust to this issue?
might happen for some pdfs