bk

bbk

Does anyone know how to solve this when

Does anyone know how to solve this when trying to run the "rags" project?

3 comments

bbk

User

@Booster 6please tell me more about how you're attempting to phish my discord token.

@Logan M please boot this user

2 comments

bbk

Imports

Has anyone been able to use this demo?

https://github.com/run-llama/llama_parse/blob/main/examples/multimodal/multimodal_report_generation.ipynb

It seems it's missing a bunch of imports, I'm trying to find the original blog post, but linkedin is buggy AF rng

21 comments

bbk

I'm using the multi_modal_pydantic

I'm using the multi_modal_pydantic notebook for pydantic + gpt-4 vision extraction and trying to adopt it for my use case.

Shouldn't the code below allow for multiple images to be loaded in so any images in restaurant_images can be called in sequence?:

Plain Text

from llama_index.multi_modal_llms import OpenAIMultiModal
from llama_index import SimpleDirectoryReader

# put your local directory here
image_documents = SimpleDirectoryReader("./restaurant_images").load_data()

openai_mm_llm = OpenAIMultiModal(
    model="gpt-4-vision-preview", api_key=OPENAI_API_TOKEN, max_new_tokens=1000

3 comments

Find answers from the community

Does anyone know how to solve this when

User

Imports

I'm using the multi_modal_pydantic