default search action
"Improving Exploration in Actor-Critic With Weakly Pessimistic Value ..."
Fan Li et al. (2024)
- Fan Li, Mingsheng Fu, Wenyu Chen, Fan Zhang, Haixian Zhang, Hong Qu, Zhang Yi:
Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8783-8796 (2024)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.