Intuition of PG

Comparison to maximum likelihood

Policy gradient:

Maximum likelihood:

Last updated