default search action
Shenao Zhang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c8]Feng Gao, Liangzhi Shi, Shenao Zhang, Zhaoran Wang, Yi Wu:
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations. ICML 2024 - [c7]Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang:
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents. ICML 2024 - [i11]Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang:
How Can LLM Guide RL? A Value-Based Approach. CoRR abs/2402.16181 (2024) - [i10]Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose H. Blanchet, Zhaoran Wang:
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer. CoRR abs/2405.16436 (2024) - [i9]Shenao Zhang, Donghan Yu, Hiteshi Sharma, Ziyi Yang, Shuohang Wang, Hany Hassan, Zhaoran Wang:
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment. CoRR abs/2405.19332 (2024) - [i8]Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang:
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs. CoRR abs/2410.08067 (2024) - 2023
- [c6]Shenao Zhang, Li Shen, Lei Han, Li Shen:
Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning. CoLLAs 2023: 292-317 - [c5]Shenao Zhang, Wanxin Jin, Zhaoran Wang:
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics. ICML 2023: 41219-41243 - [c4]Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang:
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration. NeurIPS 2023 - [c3]Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao:
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. NeurIPS 2023 - [i7]Xiaoyu Chen, Shenao Zhang, Pushi Zhang, Li Zhao, Jianyu Chen:
Asking Before Action: Gather Information in Embodied Decision Making with Language Models. CoRR abs/2305.15695 (2023) - [i6]Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang:
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration. CoRR abs/2305.18258 (2023) - [i5]Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang:
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency. CoRR abs/2309.17382 (2023) - [i4]Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao:
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. CoRR abs/2310.19927 (2023) - 2022
- [c2]Shenao Zhang:
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning. NeurIPS 2022 - [i3]Shenao Zhang:
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning. CoRR abs/2209.07676 (2022) - 2021
- [i2]Shenao Zhang, Li Shen, Zhifeng Li, Wei Liu:
Structure-Regularized Attention for Deformable Object Representation. CoRR abs/2106.06672 (2021) - [i1]Shenao Zhang, Li Shen, Lei Han, Li Shen:
Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning. CoRR abs/2108.12988 (2021)
2010 – 2019
- 2019
- [c1]Dazheng Hu, Huabiao Qin, Hongmei Liu, Shenao Zhang:
Gaze Tracking Algorithm Based on Projective Mapping Correction and Gaze Point Compensation in Natural Light. ICCA 2019: 1150-1155
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-20 21:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint