How will I do that for llama2, will I

At a glance

The community member is asking how to use the LLama2 model, specifically whether they need to host the model themselves or if there are services that already host open-source models. The comments suggest that the community member can try using HuggingFace or Replicate, which are services that provide access to open-source language models like LLama2. The comments explain that the community member would need to get an API key to use these services, and provide example integrations for using LLama2 with Replicate and HuggingFace. The comments also describe Replicate as an inference service for large language models, which can be used to query open-source models instead of using a service like OpenAI.

Useful resources

BBasil

How will I do that for llama2, will I have to first host my llama2 model somewhere, are there no services out there that already hosts open source models, so I can just use their API

8 comments

aandrei

hey no silly question here:

you can try using huggingface or replicate for open source llms

aandrei

you'd need to get an api key in order to use their services, but after you have that, you can use our LLM integrations for either of the two

https://replicate.com/meta/llama-2-70b-chat (in llama-index, use with Replicate)
https://huggingface.co/meta-llama/Llama-2-70b-hf (in llama-index, use with HuggingFaceInferenceAPI)

BBasil

What does replicate do exactly?

BBasil

Is it a cloud compute service for LLMs?

BBasil

Thanks for replying btw

aandrei

Yup, they're an inference service for LLMs https://replicate.com/

aandrei

they even offer fine-tuning services too

aandrei

so instead of hitting openai's API for example to query an LLM, you can use replicate in similar ways to query an open-sourced LLM of your choosing (by providing a model name to any of the models they support)

Add a reply

Find answers from the community

How will I do that for llama2, will I