Find answers from the community

s
F
Y
a
P
Updated last month

Ah ok…i think openai embeddings are the

Ah ok…i think openai embeddings are the ones with the best performance, no?
L
A
11 comments
I think so! At least, they like to brag about them being the best/fastest/cheapest lol
Usually the flow in llama index is to generate embeddings with openai, and then store them in a vector store like pinecone/qdrant
Yep! Im using pinecone now and it’s ok. I hoped a bit faster.
Just a question: what do you think are the percentages of relevance for obtaining a good response? Eg imo:
  • 65% prompt engineering
  • 30% chunk size
  • 5% text splitter
Also the choice of the index I suppose could generate a relevant impact, though never tried anything but simple vector index and gpt pinecone index
What do you think? @Logan M
I definitely agree with your percentages!

Prompt engineering is important for following instructions and minimizing hallucinations

Chunk size is important for how your embeddings work . But also, if you can pre-split the documents yourself as much as possible (I.e into clear sections), this helps too
I'm actually working on a little demo using the help channel messages haha using bert to split the chat threads into topics and then building a composable index

I hope it works, we will see
This is cool! Never tried Bert, where do you find it’s more beneficial? Ohhh everything runs too fast and there’s a lot of material😂
Definitely lots to learn!

I'm using a python package called BerTopic to cluster the discord messages, and then creating an index per topic.
Im still not convinced of composed indices. Did you try them and in your experience they work better than a single vector index?
I mean, a single index with related materials (not one about medicine and the other about economics to be clear😅)
Yea if your data is focused on a single clear topic, a single index will work great 💪
Add a reply
Sign up and join the conversation on Discord