default search action
Chenfei Wu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c23]Kun Yan, Lei Ji, Chenfei Wu, Jian Liang, Ming Zhou, Nan Duan, Shuai Ma:
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis. AAAI 2024: 6431-6439 - [c22]Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan:
ORES: Open-Vocabulary Responsible Visual Synthesis. AAAI 2024: 21473-21481 - [c21]Yiduo Guo, Yaobo Liang, Chenfei Wu, Wenshan Wu, Dongyan Zhao, Nan Duan:
Learning to Plan by Updating Natural Language. EMNLP (Findings) 2024: 10062-10098 - [c20]Zecheng Tang, Chenfei Wu, Juntao Li, Nan Duan:
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models. ICLR 2024 - [c19]Jun Cen, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhang:
Using Left and Right Brains Together: Towards Vision and Language Planning. ICML 2024 - [c18]Zecheng Tang, Chenfei Wu, Zekai Zhang, Minheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan:
StrokeNUWA - Tokenizing Strokes for Vector Graphic Synthesis. ICML 2024 - [c17]Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Nan Duan, Furu Wei:
Low-code LLM: Graphical User Interface over Large Language Models. NAACL (Demonstrations) 2024: 12-25 - [i30]Zecheng Tang, Chenfei Wu, Zekai Zhang, Mingheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan:
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis. CoRR abs/2401.17093 (2024) - [i29]Jun Cen, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhang:
Using Left and Right Brains Together: Towards Vision and Language Planning. CoRR abs/2402.10534 (2024) - [i28]Gabriela Ben Melech Stan, Raanan Y. Yehezkel Rohekar, Yaniv Gurwicz, Matthew Lyle Olson, Anahita Bhiwandiwalla, Estelle Aflalo, Chenfei Wu, Nan Duan, Shao-Yen Tseng, Vasudev Lal:
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models. CoRR abs/2404.03118 (2024) - [i27]Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin:
Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification. CoRR abs/2406.02990 (2024) - [i26]Minheng Ni, Chenfei Wu, Huaying Yuan, Zhengyuan Yang, Ming Gong, Lijuan Wang, Zicheng Liu, Wangmeng Zuo, Nan Duan:
AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition. CoRR abs/2408.11564 (2024) - 2023
- [c16]Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning. AAAI 2023: 10637-10647 - [c15]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Ming Gong, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. ACL (1) 2023: 1309-1320 - [c14]Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning. ACL (1) 2023: 14507-14525 - [c13]Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
ReCo: Region-Controlled Text-to-Image Generation. CVPR 2023: 14246-14255 - [c12]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. IJCAI 2023: 1506-1514 - [i25]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. CoRR abs/2302.10781 (2023) - [i24]Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan:
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models. CoRR abs/2303.04671 (2023) - [i23]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. CoRR abs/2303.12346 (2023) - [i22]Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan:
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs. CoRR abs/2303.16434 (2023) - [i21]Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan:
Low-code LLM: Visual Programming over LLMs. CoRR abs/2304.08103 (2023) - [i20]Yiduo Guo, Yaobo Liang, Chenfei Wu, Wenshan Wu, Dongyan Zhao, Nan Duan:
Learning to Program with Natural Language. CoRR abs/2304.10464 (2023) - [i19]Bingqian Lin, Zicong Chen, Mingjie Li, Haokun Lin, Hang Xu, Yi Zhu, Jianzhuang Liu, Wenjia Cai, Lei Yang, Shen Zhao, Chenfei Wu, Ling Chen, Xiaojun Chang, Yi Yang, Lei Xing, Xiaodan Liang:
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining. CoRR abs/2304.14204 (2023) - [i18]Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning. CoRR abs/2306.00103 (2023) - [i17]Shengming Yin, Chenfei Wu, Jian Liang, Jie Shi, Houqiang Li, Gong Ming, Nan Duan:
DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory. CoRR abs/2308.08089 (2023) - [i16]Dan Qiao, Chenfei Wu, Yaobo Liang, Juntao Li, Nan Duan:
GameEval: Evaluating LLMs on Conversational Games. CoRR abs/2308.10032 (2023) - [i15]Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan:
ORES: Open-vocabulary Responsible Visual Synthesis. CoRR abs/2308.13785 (2023) - [i14]Zecheng Tang, Chenfei Wu, Juntao Li, Nan Duan:
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models. CoRR abs/2309.09506 (2023) - [i13]Wang You, Wenshan Wu, Yaobo Liang, Shaoguang Mao, Chenfei Wu, Maosong Cao, Yuzhe Cai, Yiduo Guo, Yan Xia, Furu Wei, Nan Duan:
EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation. CoRR abs/2310.08185 (2023) - 2022
- [c11]Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal:
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers. CVPR 2022: 21374-21383 - [c10]Kun Yan, Lei Ji, Chenfei Wu, Jianmin Bao, Ming Zhou, Nan Duan, Shuai Ma:
Trace Controlled Text to Image Generation. ECCV (36) 2022: 59-75 - [c9]Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan:
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion. ECCV (16) 2022: 720-736 - [c8]Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan:
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. NAACL-HLT (Findings) 2022: 1589-1600 - [c7]Jian Liang, Chenfei Wu, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis. NeurIPS 2022 - [c6]Lei Ji, Chenfei Wu, Daisy Zhou, Kun Yan, Edward Cui, Xilin Chen, Nan Duan:
Learning Temporal Video Procedure Segmentation from an Automatically Collected Large Dataset. WACV 2022: 2733-2742 - [i12]Minheng Ni, Chenfei Wu, Haoyang Huang, Daxin Jiang, Wangmeng Zuo, Nan Duan:
NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN. CoRR abs/2202.05009 (2022) - [i11]Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal:
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers. CoRR abs/2203.17247 (2022) - [i10]Jie Shi, Chenfei Wu, Jian Liang, Xiang Liu, Nan Duan:
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder. CoRR abs/2206.00386 (2022) - [i9]Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Nan Duan:
Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning. CoRR abs/2206.08657 (2022) - [i8]Chenfei Wu, Jian Liang, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis. CoRR abs/2207.09814 (2022) - [i7]Kun Yan, Lei Ji, Chenfei Wu, Jian Liang, Ming Zhou, Nan Duan, Shuai Ma:
HORIZON: A High-Resolution Panorama Synthesis Framework. CoRR abs/2210.04522 (2022) - [i6]Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
ReCo: Region-Controlled Text-to-Image Generation. CoRR abs/2211.15518 (2022) - 2021
- [c5]Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti:
GEM: A General Evaluation Benchmark for Multimodal Tasks. ACL/IJCNLP (Findings) 2021: 2594-2603 - [i5]Chenfei Wu, Lun Huang, Qianxi Zhang, Binyang Li, Lei Ji, Fan Yang, Guillermo Sapiro, Nan Duan:
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions. CoRR abs/2104.14806 (2021) - [i4]Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti:
GEM: A General Evaluation Benchmark for Multimodal Tasks. CoRR abs/2106.09889 (2021) - [i3]Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan:
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. CoRR abs/2109.10504 (2021) - [i2]Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan:
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion. CoRR abs/2111.12417 (2021)
2010 – 2019
- 2019
- [c4]Chenfei Wu, Jinlai Liu, Xiaojie Wang, Ruifan Li:
Differential Networks for Visual Question Answering. AAAI 2019: 8997-9004 - [i1]Chenfei Wu, Yanzhao Zhou, Gen Li, Nan Duan, Duyu Tang, Xiaojie Wang:
Deep Reason: A Strong Baseline for Real-World Visual Reasoning. CoRR abs/1905.10226 (2019) - 2018
- [c3]Jinlai Liu, Chenfei Wu, Xiaojie Wang, Xuan Dong:
Sequential Visual Reasoning for Visual Question Answering. CCIS 2018: 410-415 - [c2]Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Object-Difference Attention: A Simple Relational Attention for Visual Question Answering. ACM Multimedia 2018: 519-527 - [c1]Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Chain of Reasoning for Visual Question Answering. NeurIPS 2018: 273-283
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 20:57 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint