Multimodal Integration with Cohere

Question

I saw you now have multi-modal integration with cohere. Is there a colipali implementation in llamaindex too?

Logan M · Answer

theres a PR just opened to add colpali as a reranker

Logan M · Answer

Adding it as an actual indexer is a lot harder due to handling multi-vector indexing, not there yet

Tiago Freitas · Answer

is multi-vector indexing planned? does it perform much better than just using the cohere multimodal embeddings?

Logan M · Answer

Some preliminary thoughts about how to do it, but nothing concrete.

Its extremely complex (and uses a lot more resources compared to dense embeddings)

Things that need refactoring to support it

the node class assumes a single dense vector
all our embedding model classes assume a single dense vector
all our vector stores assume a single dense vector for retrieval

These are not easy things to fix

RE performance, I can't really comment. It largely depends on what your data looks like. IMO cohere multimodal should be fine in most cases

Tiago Freitas · Answer

do you have benchmarks comparing cohere multimodal with full colipali with colqwen2 ?

Tiago Freitas · Answer

https://huggingface.co/spaces/vidore/vidore-leaderboardwould be good to know how llama-index with recommended settings and best embeddings compares

Tiago Freitas · Answer

could you consider running the vidore benchmark with your colipali reranker method?

Find answers from the community

Multimodal Integration with Cohere