I find this one also useful
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboardBut tbh, most open-source models are not great for complex stuff yet. For basic Q&A, most work ok-ish. But for complex features (router engine, sub question, structured output), most open source models will not work well