PydanticOutputParser
with MultiModalLLMCompletionProgram
how can I describe to the LLM what goes in each field of the output_cls
? In some cases the fields are called something different in the image. For example, sometime a surgeon is referred to as "surgeon" and other times as a "provider" based on the organization.from llama_index.bridge.pydantic import Field, BaseModel class Test(BaseModel): """A test class.""" name: str = Field(description="The name of a person.")
PydanticOutputParser
. I tweaked the prompt to the following and it resolved the issue.Summarize what is in the image and return the answer in a valid flat JSON structure with single-level keys.