i just conducted test. luckily, i have still have the old version running on and old environment. the index.json is 120M. but every request only cost me 600 tokens for LLM. Under new model, with smaller index.json - 20M, it cost me about 4000 token for LLM, which is a huge difference need some help from technical team to investigate.