default search action
Ziyu Wang 0001
Person information
- affiliation: Google Deepmind, London, UK
- affiliation: University of Oxford, Computer Science Department, UK
- affiliation: University of British Columbia, Department of Computer Science, Vancouver, BC, Canada
Other persons with the same name
- Ziyu Wang (aka: Zi-Yu Wang) — disambiguation page
- Ziyu Wang 0002 — Peking University, State Key Laboratory on Advanced Optical Communication Systems & Networks, Beijing, China
- Ziyu Wang 0003 — University College London, Department of Statistical Science, UK
- Ziyu Wang 0004 — Yidu Central Hospital of Weifang, Department of Radiology, Qingzhou, China
- Ziyu Wang 0005 — Inha University, Department of Computer and Information Engineering, Incheon, Republic of Korea
- Ziyu Wang 0006 — Department of Computer Science and Technology, Tsinghua University, Beijing, China
- Ziyu Wang 0007 — Institute for Network Sciences and Cyberspace, Tsinghua University, Beijing, China
- Ziyu Wang 0008 — New York University Shanghai, Shanghai, China
- Ziyu Wang 0009 — Beihang University, Beijing, China
- Ziyu Wang 0010 — Shanghai Jiao Tong University, Shanghai, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2021
- [c20]Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. ICLR 2021 - [c19]Michael R. Zhang, Thomas Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi:
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization. ICLR 2021 - [i29]Çaglar Gülçehre, Sergio Gómez Colmenarejo, Ziyu Wang, Jakub Sygnowski, Thomas Paine, Konrad Zolna, Yutian Chen, Matthew W. Hoffman, Razvan Pascanu, Nando de Freitas:
Regularized Behavior Value Estimation. CoRR abs/2103.09575 (2021) - [i28]Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. CoRR abs/2103.16596 (2021) - [i27]Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi:
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization. CoRR abs/2104.13877 (2021) - 2020
- [c18]Konrad Zolna, Scott E. Reed, Alexander Novikov, Sergio Gómez Colmenarejo, David Budden, Serkan Cabi, Misha Denil, Nando de Freitas, Ziyu Wang:
Task-Relevant Adversarial Imitation Learning. CoRL 2020: 247-263 - [c17]Çaglar Gülçehre, Tom Le Paine, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil C. Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team:
Making Efficient Use of Demonstrations to Solve Hard Exploration Problems. ICLR 2020 - [c16]Ziyu Wang, Alexander Novikov, Konrad Zolna, Josh Merel, Jost Tobias Springenberg, Scott E. Reed, Bobak Shahriari, Noah Y. Siegel, Çaglar Gülçehre, Nicolas Heess, Nando de Freitas:
Critic Regularized Regression. NeurIPS 2020 - [c15]Çaglar Gülçehre, Ziyu Wang, Alexander Novikov, Thomas Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel J. Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matthew Hoffman, Nicolas Heess, Nando de Freitas:
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning. NeurIPS 2020 - [c14]Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott E. Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerík, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang:
Scaling data-driven robotics with reward sketching and batch reinforcement learning. Robotics: Science and Systems 2020 - [i26]Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal M. P. Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alexander Novikov, Sergio Gómez Colmenarejo, Serkan Cabi, Çaglar Gülçehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas:
Acme: A Research Framework for Distributed Reinforcement Learning. CoRR abs/2006.00979 (2020) - [i25]Çaglar Gülçehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel J. Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas:
RL Unplugged: Benchmarks for Offline Reinforcement Learning. CoRR abs/2006.13888 (2020) - [i24]Ziyu Wang, Alexander Novikov, Konrad Zolna, Jost Tobias Springenberg, Scott E. Reed, Bobak Shahriari, Noah Y. Siegel, Josh Merel, Çaglar Gülçehre, Nicolas Heess, Nando de Freitas:
Critic Regularized Regression. CoRR abs/2006.15134 (2020) - [i23]Tom Le Paine, Cosmin Paduraru, Andrea Michi, Çaglar Gülçehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas:
Hyperparameter Selection for Offline Reinforcement Learning. CoRR abs/2007.09055 (2020) - [i22]Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Çaglar Gülçehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott E. Reed:
Offline Learning from Demonstrations and Unlabeled Experience. CoRR abs/2011.13885 (2020)
2010 – 2019
- 2019
- [j4]Oriol Vinyals, Igor Babuschkin, Wojciech M. Czarnecki, Michaël Mathieu, Andrew Dudzik, Junyoung Chung, David H. Choi, Richard Powell, Timo Ewalds, Petko Georgiev, Junhyuk Oh, Dan Horgan, Manuel Kroiss, Ivo Danihelka, Aja Huang, Laurent Sifre, Trevor Cai, John P. Agapiou, Max Jaderberg, Alexander Sasha Vezhnevets, Rémi Leblond, Tobias Pohlen, Valentin Dalibard, David Budden, Yury Sulsky, James Molloy, Tom Le Paine, Çaglar Gülçehre, Ziyu Wang, Tobias Pfaff, Yuhuai Wu, Roman Ring, Dani Yogatama, Dario Wünsch, Katrina McKinney, Oliver Smith, Tom Schaul, Timothy P. Lillicrap, Koray Kavukcuoglu, Demis Hassabis, Chris Apps, David Silver:
Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nat. 575(7782): 350-354 (2019) - [i21]Tom Le Paine, Çaglar Gülçehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil C. Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team:
Making Efficient Use of Demonstrations to Solve Hard Exploration Problems. CoRR abs/1909.01387 (2019) - [i20]Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott E. Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerík, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang:
A Framework for Data-Driven Robotics. CoRR abs/1909.12200 (2019) - [i19]Konrad Zolna, Scott E. Reed, Alexander Novikov, Sergio Gomez Colmenarejo, David Budden, Serkan Cabi, Misha Denil, Nando de Freitas, Ziyu Wang:
Task-Relevant Adversarial Imitation Learning. CoRR abs/1910.01077 (2019) - 2018
- [c13]Karol Hausman, Jost Tobias Springenberg, Ziyu Wang, Nicolas Heess, Martin A. Riedmiller:
Learning an Embedding Space for Transferable Robot Skills. ICLR (Poster) 2018 - [c12]Yusuf Aytar, Tobias Pfaff, David Budden, Tom Le Paine, Ziyu Wang, Nando de Freitas:
Playing hard exploration games by watching YouTube. NeurIPS 2018: 2935-2945 - [c11]Yuke Zhu, Ziyu Wang, Josh Merel, Andrei A. Rusu, Tom Erez, Serkan Cabi, Saran Tunyasuvunakool, János Kramár, Raia Hadsell, Nando de Freitas, Nicolas Heess:
Reinforcement and Imitation Learning for Diverse Visuomotor Skills. Robotics: Science and Systems 2018 - [i18]Yuke Zhu, Ziyu Wang, Josh Merel, Andrei A. Rusu, Tom Erez, Serkan Cabi, Saran Tunyasuvunakool, János Kramár, Raia Hadsell, Nando de Freitas, Nicolas Heess:
Reinforcement and Imitation Learning for Diverse Visuomotor Skills. CoRR abs/1802.09564 (2018) - [i17]Yusuf Aytar, Tobias Pfaff, David Budden, Tom Le Paine, Ziyu Wang, Nando de Freitas:
Playing hard exploration games by watching YouTube. CoRR abs/1805.11592 (2018) - [i16]Tom Le Paine, Sergio Gomez Colmenarejo, Ziyu Wang, Scott E. Reed, Yusuf Aytar, Tobias Pfaff, Matthew W. Hoffman, Gabriel Barth-Maron, Serkan Cabi, David Budden, Nando de Freitas:
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL. CoRR abs/1810.05017 (2018) - [i15]Yutian Chen, Aja Huang, Ziyu Wang, Ioannis Antonoglou, Julian Schrittwieser, David Silver, Nando de Freitas:
Bayesian Optimization in AlphaGo. CoRR abs/1812.06855 (2018) - 2017
- [c10]Serkan Cabi, Sergio Gomez Colmenarejo, Matthew W. Hoffman, Misha Denil, Ziyu Wang, Nando de Freitas:
The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously. CoRL 2017: 207-216 - [c9]Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Rémi Munos, Koray Kavukcuoglu, Nando de Freitas:
Sample Efficient Actor-Critic with Experience Replay. ICLR (Poster) 2017 - [c8]Scott E. Reed, Aäron van den Oord, Nal Kalchbrenner, Sergio Gomez Colmenarejo, Ziyu Wang, Yutian Chen, Dan Belov, Nando de Freitas:
Parallel Multiscale Autoregressive Density Estimation. ICML 2017: 2912-2921 - [c7]Ziyu Wang, Josh Merel, Scott E. Reed, Nando de Freitas, Gregory Wayne, Nicolas Heess:
Robust Imitation of Diverse Behaviors. NIPS 2017: 5320-5329 - [i14]Scott E. Reed, Aäron van den Oord, Nal Kalchbrenner, Sergio Gomez Colmenarejo, Ziyu Wang, Dan Belov, Nando de Freitas:
Parallel Multiscale Autoregressive Density Estimation. CoRR abs/1703.03664 (2017) - [i13]Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess:
Learning human behaviors from motion capture by adversarial imitation. CoRR abs/1707.02201 (2017) - [i12]Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin A. Riedmiller, David Silver:
Emergence of Locomotion Behaviours in Rich Environments. CoRR abs/1707.02286 (2017) - [i11]Ziyu Wang, Josh Merel, Scott E. Reed, Greg Wayne, Nando de Freitas, Nicolas Heess:
Robust Imitation of Diverse Behaviors. CoRR abs/1707.02747 (2017) - [i10]Serkan Cabi, Sergio Gomez Colmenarejo, Matthew W. Hoffman, Misha Denil, Ziyu Wang, Nando de Freitas:
The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously. CoRR abs/1707.03300 (2017) - 2016
- [j3]Ziyu Wang, Frank Hutter, Masrour Zoghi, David Matheson, Nando de Freitas:
Bayesian Optimization in a Billion Dimensions via Random Embeddings. J. Artif. Intell. Res. 55: 361-387 (2016) - [j2]Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P. Adams, Nando de Freitas:
Taking the Human Out of the Loop: A Review of Bayesian Optimization. Proc. IEEE 104(1): 148-175 (2016) - [c6]Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas:
Dueling Network Architectures for Deep Reinforcement Learning. ICML 2016: 1995-2003 - [i9]Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Rémi Munos, Koray Kavukcuoglu, Nando de Freitas:
Sample Efficient Actor-Critic with Experience Replay. CoRR abs/1611.01224 (2016) - 2015
- [c5]Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alexander J. Smola, Le Song, Ziyu Wang:
Deep Fried Convnets. ICCV 2015: 1476-1483 - [i8]Ziyu Wang, Nando de Freitas, Marc Lanctot:
Dueling Network Architectures for Deep Reinforcement Learning. CoRR abs/1511.06581 (2015) - 2014
- [c4]Ziyu Wang, Babak Shakibi, Lin Jin, Nando de Freitas:
Bayesian Multi-Scale Optimistic Optimization. AISTATS 2014: 1005-1014 - [i7]Ziyu Wang, Babak Shakibi, Lin Jin, Nando de Freitas:
Bayesian Multi-Scale Optimistic Optimization. CoRR abs/1402.7005 (2014) - [i6]Bobak Shahriari, Ziyu Wang, Matthew W. Hoffman, Alexandre Bouchard-Côté, Nando de Freitas:
An Entropy Search Portfolio for Bayesian Optimization. CoRR abs/1406.4625 (2014) - [i5]Ziyu Wang, Nando de Freitas:
Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper-Parameters. CoRR abs/1406.7758 (2014) - [i4]John-Alexander M. Assael, Ziyu Wang, Nando de Freitas:
Heteroscedastic Treed Bayesian Optimisation. CoRR abs/1410.7172 (2014) - [i3]Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alexander J. Smola, Le Song, Ziyu Wang:
Deep Fried Convnets. CoRR abs/1412.7149 (2014) - [i2]Yishu Miao, Ziyu Wang, Phil Blunsom:
Bayesian Optimisation for Machine Translation. CoRR abs/1412.7180 (2014) - 2013
- [j1]Firas Hamze, Ziyu Wang, Nando de Freitas:
Self-Avoiding Random Dynamics on Integer Complex Systems. ACM Trans. Model. Comput. Simul. 23(1): 9:1-9:25 (2013) - [c3]Ziyu Wang, Shakir Mohamed, Nando de Freitas:
Adaptive Hamiltonian and Riemann Manifold Monte Carlo. ICML (3) 2013: 1462-1470 - [c2]Ziyu Wang, Masrour Zoghi, Frank Hutter, David Matheson, Nando de Freitas:
Bayesian Optimization in High Dimensions via Random Embeddings. IJCAI 2013: 1778-1784 - [i1]Ziyu Wang, Masrour Zoghi, Frank Hutter, David Matheson, Nando de Freitas:
Bayesian Optimization in a Billion Dimensions via Random Embeddings. CoRR abs/1301.1942 (2013) - 2012
- [c1]Nimalan Mahendran, Ziyu Wang, Firas Hamze, Nando de Freitas:
Adaptive MCMC with Bayesian Optimization. AISTATS 2012: 751-760
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint