Question 1

What is Step Decay (Learning Rate)?

Accepted Answer

Simplest learning rate schedule strategy that reduces the LR by a factor after fixed intervals (epochs or steps). Typical: LR is reduced by factor 0.1 every 30 epochs. Simple to implement and understand, but less smooth than cosine annealing.

Question 2

How does Step Decay (Learning Rate) work?

Accepted Answer

Typical: LR is reduced by factor 0.1 every 30 epochs. Simple to implement and understand, but less smooth than cosine annealing.

Question 3

Why is Step Decay (Learning Rate) important for marketing?

Accepted Answer

Step decay was standard in computer vision for years (ResNet paper). Now mostly replaced by cosine annealing or one-cycle.

Question 4

What are common mistakes with Step Decay (Learning Rate)?

Accepted Answer

Abrupt LR drops can destabilize training. Timing and factor must be manually tuned. Less efficient than smooth schedules.

Question 5

Where does Step Decay (Learning Rate) come from?

Accepted Answer

Step decay was standard in ImageNet training recipes (AlexNet 2012, VGG 2014, ResNet 2015). Cosine annealing (2017) and one-cycle (2018) showed consistently better results and replaced step decay as standard.

Question 6

What is the difference between Step Decay (Learning Rate) and Learning Rate Schedule?

Accepted Answer

Step Decay (Learning Rate) and Learning Rate Schedule are related concepts in AI and marketing. Simplest learning rate schedule strategy that reduces the LR by a factor after fixed intervals (epoc...

Step Decay (Learning Rate)

Explanation

Marketing Relevance

Common Pitfalls

Origin & History

Comparisons & Differences

Step Decay (Learning Rate) vs. Cosine Annealing

Step Decay (Learning Rate) vs. Exponential Decay

Further Resources

Related Services

Related Terms