Is there a way to assemble existing LlamaIndex components so that I can run PydanticProgramExtractor over only the first few nodes of every document (like TitleExtractor)? I'm trying to categorize the document based on data that's on the first page of every doc. What is the best approach?