how does the policy gradient theorem reduce the variance?

how does the policy gradient theorem reduce the variance? http://adv-ml-2017.wikidot.com/forum/t-2315060/how-does-the-policy-gradient-theorem-reduce-the-variance Posts in the discussion thread "how does the policy gradient theorem reduce the variance?" Wed, 17 Apr 2024 07:34:57 +0000 http://adv-ml-2017.wikidot.com/forum/t-2315060#post-2848373 how does the policy gradient theorem reduce the variance? http://adv-ml-2017.wikidot.com/forum/t-2315060/how-does-the-policy-gradient-theorem-reduce-the-variance#post-2848373 Sat, 10 Jun 2017 08:37:39 +0000 Ido Hadanny 3026643 I think I understand the derivation of the policy gradient theorem presented in the scribe rl_class1, section 2.3.
However, I do not understand why does it reduce the variance? Is it because the new expression for the gradient has less terms?

]]>