What training techniques were used for Gemma 3?

Question

Answers ( 1 )

    0
    2025-04-01T15:25:30+00:00

    The training process included:
    1. **Pretraining**: Using 2-14 trillion tokens (scaling with model size)
    2. **Post-training**:
    - Knowledge distillation
    - RLHF (Reinforcement Learning from Human Feedback)
    - RLMF (Reinforcement Learning from Machine Feedback)
    - RLEF (Reinforcement Learning from Execution Feedback)
    3. **Vision encoding**: Frozen SigLIP-based encoder for multimodal models

Leave an answer