Find answers from the community

Updated 2 months ago

Err weird question hoping anyone else is

Err weird question hoping anyone else is/has seeing this: Has anyone else's retriever suddenly become very dumb over night? I'm using OpenAI's Davinci model for embedding and I swear yesterday it was doing great. Today, I query something I've queried before and it returns the worst nodes. I'm so confused.
L
P
14 comments
have not seen this πŸ‘€ Although I know davinci is being deprecated by openai -- could be something weird going on under the hood on their end
that would be weird. They just added Davinci as an embedding model though I thought...
I've been meaning to mess around with text-embedding-ada-002 anyway so I'll give that a shot.
ada-002 is about 500x cheaper it looks like lol
maybe a good call
yeah haha so much cheaper and apparently better but idk I tested it before and had mixed results.
I spent $440 on embeddings over the last week πŸ˜† hoping my company doesn't get upset about the big charge 😬
πŸ€‘ πŸ’Έ
Looks like that weird dumbing down of Davinci was a blessing in disguise! ada-2 is working great so far!
Ugh ok after a ton of testing Ada is only producing results with ~80% accuracy while Davinci had 92%. Today I tried Davinci again and sure enough it's back up... I wonder what they broke yesterday lol
any other embedding models/AI companies I should look at?
hmm. Cohere-Rerank is very popular (but it's a paid API for reranking, that we support)

I know JinaAI is about to launch their V2 embeddings (but they aren't out yet)

The rest is just stuff from this leaderboard -- https://huggingface.co/spaces/mteb/leaderboard -- I would maybe give bge-large-en-v1.5 a try again, but set your chunk size to 512
I set the chunk size to 512 and it didn't make any difference 😿
Hm, weird πŸ€·β€β™‚οΈ
Add a reply
Sign up and join the conversation on Discord