Find answers from the community

Updated 3 months ago

Hello! When using OpenAIMultiModal.

Hello! When using OpenAIMultiModal.complete, can we specify the output class? I pass it like this:
Plain Text
image_llm = OpenAIMultiModal(
                model='gpt-4o',
                output_cls=output_class,
                api_key=api_key,
                max_new_tokens=1000,
                temperature=0.0,
            ) 
image_doc = load_image_urls([image_url])

response_vision = image_llm.complete(
            prompt=prompt,
            image_documents=image_doc,
        )

It doesn't throw any exception but the response doesn't have any output class instance either. Thanks!
L
S
4 comments
Interesting... is it possible to implement the same example, but with OpenAIMultivmodal?
Cool, I was able to do what I needed, with OpenAI and this approach. Many thanks!!
Add a reply
Sign up and join the conversation on Discord