why would someone do this index struct

At a glance

why would someone do this "index_struct = self.index.index_struct
vector_store_index = VectorStoreIndex(index_struct=index_struct)" and set the index equal to this rather than just using the index

33 comments

LLogan M

I have no idea. Sounds like a recipe for bugs if you ask me

aaszaiman1

okay thats what i thought wanted to double check thank you!

aaszaiman1

Sorry I was looking more into it the reason I wanted to do it is because the index of a condense chat engine expected a vectorstore index rather than a base index however when i try to form the index as above the chat outputs very wrong answers

aaszaiman1

sorry to be a little more clear "index=self.index," this gives me the type error "expects VectorStoreIndex" however when i do what I did above i do not think the chat engine is querying the right index

LLogan M

Hmm, that error doesn't seem possible? The CondenseQuestionChatEngine can take any query engine

Note that the expected type is BaseQueryEngine

Attachment

aaszaiman1

ah sorry i did not mean error i meant

Attachment

aaszaiman1

but thank you again!

LLogan M

what's the line above that? What class are you instansiating? 👀

aaszaiman1

oh sorry for the late response it is a custom retriever class to retrieve embeddings rather than strings

LLogan M

Ah I see.

That's just a pylint error, so it's not a real error. It's just because the custom retriever expects a vector index.

You can do an assert isinstance() check for the type, or just cast the type to fix the "error"

aaszaiman1

sorry im a little confused how would i cast the type to make it a VectorIndex

aaszaiman1

Also is the new default llm llama-cpp i just tried running it and it is giving me a very different output

LLogan M

Well it's a completely different llm, so it will be wildly different 😅 it's just something to quickly unblock people.

You could definitely try downloading and usinf a different llama cpp model.

LLogan M

It's just a python typing thing

Plain Text

from typing import cast

cast(VectorStoreIndex, self.index)

LLogan M

It's just cosmetic though

aaszaiman1

hahah yeah sorry it was more unexpected because of all the logging and thank you so much for your help, I am going to try to load a model directly from hugging face and use the quantized version so I may have some more questions in a little while

aaszaiman1

Sorry I am back and have a few questions: if i dont specify an open ai key but specify the model like self.llm = llm if llm else load_llama_v2_8bit() why is the logging for the cpp still coming up?

aaszaiman1

Also if i do then specify the api key it seems to load in the model and everything but my account is still being charged

LLogan M

Where/how are you specifying the model?

aaszaiman1

im creating a bot class and specifying the llm inside the class like "llm if llm else load_llama_v2_8bit() "and then setting up the service context like "service_context = ServiceContext.from_defaults(
llm=self.llm,
embed_model=self.embed_model
)
condense_engine = CondenseQuestionChatEngine.from_defaults(
service_context=service_context,"

LLogan M

Ooo the chat engine doesn't actually use the service context there lol

LLogan M

You can pass in the llm directly as a kwarg

aaszaiman1

like this sorry "condense_engine = CondenseQuestionChatEngine.from_defaults(
llm=self.llm,
query_engine=RetrieverQueryEngine(
retriever=FAISSRetriever(
index=cast(VectorStoreIndex, self.index),
embed_model=self.embed_model
)
)

)"

aaszaiman1

sorry im also a little confused because llama cpp is still popping up but i do nto want to use it

aaszaiman1

sorry to spam but i have also tried with a context chat engine " def _get_context_engine(self) -> BaseChatEngine:
"""initialize a llama-index context engine"""

service_context = ServiceContext.from_defaults(
llm=self.llm,
embed_model=self.embed_model
)

context_engine = ContextChatEngine.from_defaults(
service_context=service_context,
retriever=FAISSRetriever(
index=cast(VectorStoreIndex, self.index),
embed_model=self.embed_model
)
)
return context_engine" and this one does not have the llama.cpp logging throughout the chat however it does in the beginning

LLogan M

Hmmm the only thing I can think is that llama cpp is being activated in the index?

LLogan M

Hard to say without seeing the whole flow I guess

LLogan M

But If it wasn't llama cpp, it would be key errors about openai lolb

aaszaiman1

oh lol would it be possible not to have an api key and also not have the llama cpp log or the openai key errors

aaszaiman1

or specify an api key but use another llm so that openai would not be used

aaszaiman1

also is that the right way to specify an llm in the condense chat engine