Probably someday -- our multimodal support in general is pretty lacking
on the todo list, and in this order a) Expand the ChatMessage class to handle multi-modal objects, not just text b) Reduce code duplication in multi-modal LLMs c) Make multi-modal retrieval UX better d) Obviously, improve the chat/query engine experience with multimodal