Question 1

What is Leaky ReLU?

Accepted Answer

A variant of ReLU that lets negative values pass with a small factor (e.g., 0.01) instead of setting them to 0 – prevents the dead neuron problem. Leaky ReLU: f(x) = x for x > 0, f(x) = αx for x ≤ 0 (typically α = 0.01). The small negative gradient ensures neurons can never fully "die" as with standard ReLU. Simple to implement, minimal computational overhead.

Question 2

How does Leaky ReLU work?

Accepted Answer

Leaky ReLU: f(x) = x for x > 0, f(x) = αx for x ≤ 0 (typically α = 0.01). The small negative gradient ensures neurons can never fully "die" as with standard ReLU. Simple to implement, minimal computational overhead.

Question 3

Why is Leaky ReLU important for marketing?

Accepted Answer

Important improvement over ReLU in GANs and deep networks where dead neurons are a common problem.

Question 4

What are common mistakes with Leaky ReLU?

Accepted Answer

The leak factor α must be chosen. Not always better than standard ReLU. GELU/SwiGLU preferred in Transformers.

Question 5

Where does Leaky ReLU come from?

Accepted Answer

Maas et al. (2013) introduced Leaky ReLU. It became especially popular in GANs (DCGAN, 2015), where dead neurons destabilize training. PReLU (He et al., 2015) made the leak factor learnable.

Question 6

What is the difference between Leaky ReLU and ReLU (Rectified Linear Unit)?

Accepted Answer

Leaky ReLU and ReLU (Rectified Linear Unit) are related concepts in AI and marketing. A variant of ReLU that lets negative values pass with a small factor (e.g., 0.01) instead of setting...

Leaky ReLU

Explanation

Marketing Relevance

Common Pitfalls

Origin & History

Comparisons & Differences

Leaky ReLU vs. ReLU

Leaky ReLU vs. PReLU

Further Resources

Related Services

Related Terms