Actor-Critic in practice

Architecture design

+: simple & stable

- : no shared features between actor and critic

+: shared features

-: need more time for hyper-parameter tuning

In practice

Online version perhaps works best with a batch(e.g., parallel)

Last updated