Find answers from the community

Updated 11 hours ago

Cached Augmented Generation (CAG) with Gemini or other LLMs integrated with LlamaIndex

Hi everyone,
Is there any implementation of Cached Augmented Generation (CAG) with Gemini or other LLMs integrated with LlamaIndex?
L
2 comments
correct me if I'm wrong, but doesn't CAG require direct model access (i.e. with pytorch)? I don't think you can implement this over an API
Add a reply
Sign up and join the conversation on Discord