LLM

At a glance

The community members discuss the possibility of integrating the Together.ai API endpoint with the llama-index library. One community member suggests that the Together.ai interface is currently not available in llama-index, but that it can be connected using a custom LLM class. Another community member mentions that an integration should be available soon. The discussion also covers other potential integrations, such as with Anyscale, Perplexity, and DeepInfra. The community members also discuss using the Ollama Query Engine Pack and setting up remote providers like Perplexity for the language model, while keeping the documents local.

Useful resources

mmr.niko.la

Is possible to use together.ai api endpoint with llama-index ?

25 comments

WWhiteFang_Jr

I think together.ai interface is currently not available in llamaindex.

https://docs.llamaindex.ai/en/stable/module_guides/models/llms.html#modules
You can still connect with Together.ai services using the custom LLM class
Just setup the llm and pass it to the service_context https://docs.llamaindex.ai/en/stable/module_guides/models/llms/usage_custom.html#example-using-a-custom-llm-model-advanced

LLogan M

There should be an integration for it soon in the next few days! 👍

mmr.niko.la

Nice!!! @Logan M 🧀🐟 working hard this holiday. We appreciate it

Being will be able to replace ollama run ... with together.ai

mmr.niko.la

Any other integration we can expect ? Together
▶ Anyscale?
▶ Perplexity?
▶ DeepInfra?

It seems like together has the best combo of price , developer experience and speed.

LLogan M

We already have anyscale and perplexity actually

fffedericoni

OctoAI seems to have good prices and 10$ free credits (https://octoai.cloud/). Is in your plan to integrate with their API?

mmr.niko.la

Is this merged ?

LLogan M

It has not. Actually, I don't even see the PR anymore lol

But I need to do it anyways before Jan 3rd lol

mmr.niko.la

@Logan M happy new year.

when using the query engine llama pack with ollama we just need to pul the model we want to use only. We dont need the command ollama run mistral correct?

LLogan M

Happy new year!

Yea I think that's correct? It should pull the model if you havent used it already.... I think. I'm not an ollama expert lol

mmr.niko.la

to use these instead of ollama its possibel using the Ollama Query Engine Pack. Is there any guide or documentation? gave away my mac and my laptop is taking 30s on dolphin-phi model...

mmr.niko.la

😢

LLogan M

It should be possible! I think mistral will just automatically download right?

mmr.niko.la

no i ment how do i use perplexity with query engine pack. so the documents local but the llm is run on the cloud from perplexity...

LLogan M

Oh perplexity!

LLogan M

You have to setup the llm, and throw it into the service context

https://docs.llamaindex.ai/en/stable/examples/llm/perplexity.html#setup-llm

LLogan M

Probably will have to edit the pack code to do that

mmr.niko.la

ollama_pack = OllamaQueryEnginePack(model="dolphin-phi", documents=documents)

if we use remote provider aka model="perplexity-mistral-7b-instruct"

mmr.niko.la

and also provide the api key above ...

LLogan M

Maybe im confused lol. Does ollama talk to remote providers? I thought it was all local?

mmr.niko.la

yes its all local. and i was hoping to do the rag example using remote providers instead...

mmr.niko.la

so local document folder for rag and remote provider for the llm

LLogan M

The link I shared above shows how to setup the llm

I really think it's worth learning how to set up the llm and embeddings and service context yourself, and creating the query engine 🙏

LLogan M

We have extensive docs on how to do it 👍

mmr.niko.la

https://github.com/run-llama/llama-hub/tree/8fa06e01f1ae72722cd77372fc1cf23e4ef37e08/llama_hub/llama_packs/ollama_query_engine

ok got it will look into how this works and see if its possible to do local doc with remote provider.

Add a reply

Find answers from the community

LLM