Join us to learn how to:
- Explain the full LLM training stack and where RL fits
- Choose between PPO/DPO/GRPO/GSPO for real use cases
- Design reward signals (including verifiable rewards)
- Debug RL fine-tuning with practical diagnostics
ENROLLMENT RATES:
- Early bird rate: $1,500 USD (until February 20, 2026)
- General admission rate: $2,000 USD (starting on February 21)
- Student rate: $250 USD. You must use your university or educational institution email account when registering and use the discount code "LOVETOLEARN2026". You will receive a confirmation email if you're approved.