Actor-Critic in practice
Architecture design

+: simple & stable
- : no shared features between actor and critic

+: shared features
-: need more time for hyper-parameter tuning
In practice
Online version perhaps works best with a batch(e.g., parallel)

Last updated
Was this helpful?