What training techniques were used for Gemma 3?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
The training process included:
1. **Pretraining**: Using 2-14 trillion tokens (scaling with model size)
2. **Post-training**:
- Knowledge distillation
- RLHF (Reinforcement Learning from Human Feedback)
- RLMF (Reinforcement Learning from Machine Feedback)
- RLEF (Reinforcement Learning from Execution Feedback)
3. **Vision encoding**: Frozen SigLIP-based encoder for multimodal models