The community member is asking if it is a good approach to create an ingestion pipeline using documents from SimpleDirectoryReader and nodes from HTML files parsed with HTMLNodeParser. In the comments, another community member suggests that this approach works with a web reader, providing specific configuration details for the web reader, including driver arguments and URLs to be used.
Hi, do you think is a good approach to create a ingestion pipeline with documents from SimpleDirectoryReader and nodes from HTML files parsed with HTMLNodeParser?