Latest articles in policy gradients

Baseline for Policy Gradients that All Deep Learning Enthusists Must Know

Baseline for Policy Gradients that All Deep Learning Enthusists Must Know

Baseline is a function when added to an expectation, does not change the expected value but, it significantly affects the variance

Popular policy gradients

More articles in policy gradients