Actor-Critic in practice
Architecture design
+: simple & stable
- : no shared features between actor and critic
+: shared features
-: need more time for hyper-parameter tuning
In practice
Online version perhaps works best with a batch(e.g., parallel)
Last updated