LlamaIndex

Log inLog into community

Find answers from the community

Updated 6 months ago

Maybe I need to change embedding model?

Maybe I need to change embedding model?

At a glance

The community members are using the llama2 library in their code, but they are still required to provide an OpenAI API key, even though they have set embed_model="local". They have tried various approaches, such as passing the service context to the query engine and loading the index from storage, but the issue persists. Eventually, they found that they need to initialize the service context regardless of the if-else condition in the _initialize_index() method. This resolved the issue, and the code is now working. The community members also discussed the meaning of embed_model="local", which is a shorthand for using the HuggingFace Embedding model "BAAI/bge-small-en-v1.5".

Useful resources

·

I am using llama2 in the code but why do I still need OpenAI API key

My .env file has this structure:

Plain Text

OPENAI_API_KEY=<key>
REPLICATE_API_KEY=<key>

B

L

46 comments

title?

I have set the embed_model="local" as well

You are creating the service context, but you are not passing it anywhere

I am passing it in the query engine

Plain Text

            query_engine = self.index.as_query_engine(service_context=self.ctx, streaming=True)

Here

That doesn't seem to work

Try this

Plain Text

self.index = VectorStoreIndex.from_documents(documents, service_context=self.ctx)
# or
self.index = load_index_from_storage(storage_context, service_context=self.ctx)

On it

Should I remove it from query engine?

yea remove it for now 🤔

Tried removing it

still the same issue

requires OPEN_AI key

even with embed_model="local" ?

It worked now

I had to delete the storage

and rerun

now its working

@Logan M I have to delete storage everytime I want to make it work

otherwise it doesn't work and keeps asking openai key

do you know why that is?

You added this line?
self.index = load_index_from_storage(storage_context, service_context=self.ctx) ?

adding it now

so I had to add context in all 3 places to make it work

let me try again

@Logan M still the same issue

first run without storage works

but it doesn't work in the second run

Can you send your current code again?

oh ha

in the loading path, self.ctx is none

Plain Text

def _initialize_index(self):
    self.ctx = ServiceContext.from_defaults(llm=llm, embed_model="local")
    if not os.path.exists(self.persist_dir):
        documents = SimpleDirectoryReader("data").load_data()
        LLAMA_13B_V2_CHAT = "a16z-infra/llama13b-v2-chat:df7690f1994d94e96ad9d568eac121aecf50684a0b0963b25a41cc40061269e5"
        llm = Replicate(
            model=LLAMA_13B_V2_CHAT,
            temperature=0.01,
            context_window=4096,
            completion_to_prompt=self.custom_completion_to_prompt,
            messages_to_prompt=messages_to_prompt,
        )
        self.index = VectorStoreIndex.from_documents(documents, service_context=self.ctx)
        self.index.storage_context.persist(persist_dir=self.persist_dir)
    else:
        storage_context = StorageContext.from_defaults(persist_dir=self.persist_dir)
        self.index = load_index_from_storage(storage_context, service_context=self.ctx)

that would work

initializes the ctx regardless of the if condition

ohhh no I am a dumbass

ty let me try

it works

THANKSSSS

@Logan M

niceee

https://github.com/basilysf1709/rag-multi-modal/blob/main/src/llama2.py if you are interested to see the code

its working

also @Logan M what does it mean by embed_model="local"

exactly

conceptually i mean

It's shorthand for embed_model=HuggingFaceEmbedding(model_name="BAAI/bge-small-en-v1.5")

Add a reply

Sign up and join the conversation on Discord