What are the key features of the DPO project?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
The DPO project includes the following key features:
- Support for original DPO, "conservative" DPO, and IPO.
- A two-stage training pipeline involving supervised fine-tuning (SFT) followed by preference learning.
- Multi-GPU support with BasicTrainer, FSDPTrainer, and TensorParallelTrainer.
- Accelerated training through mixed precision (bfloat16) and activation checkpointing.