Subhaditya's KB

Home

❯

KB

❯

AI

❯

Machine Learning

❯

Models

❯

Learning Rate Scheduling

Learning Rate Scheduling

Oct 14, 20251 min read

  • temp
  • architecture

Learning Rate Scheduling

  • Learning Rate Decay tricks
  • Gradient Descent
  • Increasing the batch size, reduces noise in the architecture so a larger learning rate is okay
  • Linear Learning Rate Scaling
  • Learning Rate Warmup

Graph View

Backlinks

  • Chapter 6 - Fitting models
  • _Index_of_Models
  • __Index_of__Models
  • Large Batch Training

Created with Quartz v4.5.1 © 2025

  • GitHub
  • LinkedIn