Find answers from the community

Updated 2 months ago

anyone have any ideas on this?

anyone have any ideas on this?
L
s
5 comments
Not really anything built into the library.

These days though, DPO is much easier approach for fine-tuning compared to RLHF
Huggingface has quite a few utilities for this
got it. will look into DPO.
This might be a good article to start with πŸ™‚ https://huggingface.co/blog/dpo-trl
Add a reply
Sign up and join the conversation on Discord