A reinforcement learning memory for giving recent moves credit after delayed rewards.
Eligibility traces are fading sticky notes on an AI's recent moves. A late reward reads the newest notes first. Old notes get tiny cookie crumbs.
You see them in TD Learning and Actor-Critic. They help late rewards travel back faster.
TD Learning
Eligibility traces let TD Learning give reward credit to recent steps.
RL
Eligibility traces help RL learn from late rewards.
Actor-Critic
Actor-Critic can use eligibility traces to update its policy and value.