ChatMemoryBuffer
with GPT-4o for multi-modal input. However, LlamaIndex does offer capabilities to build multi-modal applications, combining language and images. You can explore the guides provided in the LlamaIndex documentation () for more details on multi-modal use cases.ChatMemoryBuffer
with GPT-4o, I would recommend referring to the official documentation or examples provided by the developers.