The Learning Rate
- A high learning rate is useful when learning begins, as very little information is encoded in the Q-estimates
- As learning continues, more and more information is encoded in the estimates
- Consequently, it is worthwhile to start with a high learning rate that decreases with time
- Example
- α = 1 / (1 + (t/c))
-
t
is the current timestep
-
c
is a fixed constant
(next)