default search action
Yuhang Zang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c11]Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever you Want. CVPR 2024: 13019-13029 - [c10]Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. ECCV (18) 2024: 37-53 - [c9]Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. ECCV (51) 2024: 310-325 - [c8]Yuhang Zang, Hanlin Goh, Joshua M. Susskind, Chen Huang:
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization. ICLR 2024 - [c7]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. ACM Multimedia 2024: 11198-11201 - [i35]Yuhang Zang, Hanlin Goh, Josh M. Susskind, Chen Huang:
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization. CoRR abs/2401.15914 (2024) - [i34]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i33]Ziyu Liu, Zeyi Sun, Yuhang Zang, Wei Li, Pan Zhang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition. CoRR abs/2403.13805 (2024) - [i32]Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. CoRR abs/2403.15378 (2024) - [i31]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024) - [i30]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i29]Tao Chu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Qiong Liu, Jiaqi Wang:
Unified Scene Representation and Reconstruction for 3D Large Language Models. CoRR abs/2404.13044 (2024) - [i28]Tianqi Liu, Guangcong Wang, Shoukang Hu, Li Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. CoRR abs/2405.12218 (2024) - [i27]Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. CoRR abs/2405.16009 (2024) - [i26]Zeyi Sun, Tong Wu, Pan Zhang, Yuhang Zang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Bootstrap3D: Improving 3D Content Creation with Synthetic Data. CoRR abs/2406.00093 (2024) - [i25]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024) - [i24]Pengyang Ling, Jiazi Bu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Tong Wu, Huaian Chen, Jiaqi Wang, Yi Jin:
MotionClone: Training-Free Motion Cloning for Controllable Video Generation. CoRR abs/2406.05338 (2024) - [i23]Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024) - [i22]Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. CoRR abs/2406.11833 (2024) - [i21]Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun:
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations. CoRR abs/2407.01523 (2024) - [i20]Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation. CoRR abs/2407.02165 (2024) - [i19]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i18]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024) - [i17]Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way. CoRR abs/2410.06241 (2024) - [i16]Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate. CoRR abs/2410.07167 (2024) - [i15]Shuangrui Ding, Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Yuwei Guo, Dahua Lin, Jiaqi Wang:
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree. CoRR abs/2410.16268 (2024) - [i14]Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, Dahua Lin:
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction. CoRR abs/2410.17247 (2024) - [i13]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024) - 2023
- [j1]Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch. Int. J. Comput. Vis. 131(4): 987-1001 (2023) - [i12]Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch. CoRR abs/2305.14813 (2023) - [i11]Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy:
Contextual Object Detection with Multimodal Large Language Models. CoRR abs/2305.18279 (2023) - [i10]Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want. CoRR abs/2312.03818 (2023) - 2022
- [c6]Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Open-Vocabulary DETR with Conditional Matching. ECCV (9) 2022: 106-122 - [i9]Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Open-Vocabulary DETR with Conditional Matching. CoRR abs/2203.11876 (2022) - [i8]Kaiyang Zhou, Yuanhan Zhang, Yuhang Zang, Jingkang Yang, Chen Change Loy, Ziwei Liu:
On-Device Domain Generalization. CoRR abs/2209.07521 (2022) - [i7]Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Unified Vision and Language Prompt Learning. CoRR abs/2210.07225 (2022) - 2021
- [c5]Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CVPR 2021: 9695-9704 - [c4]Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. ICCV 2021: 3437-3446 - [i6]Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. CoRR abs/2102.12867 (2021) - 2020
- [c3]Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan:
KPNet: Towards Minimal Face Detector. AAAI 2020: 12015-12022 - [i5]Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan:
KPNet: Towards Minimal Face Detector. CoRR abs/2003.07543 (2020) - [i4]Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang:
1st Place Solutions for OpenImage2019 - Object Detection and Instance Segmentation. CoRR abs/2003.07557 (2020) - [i3]Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CoRR abs/2008.10032 (2020)
2010 – 2019
- 2019
- [c2]Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li:
Scene Text Detection with Supervised Pyramid Context Network. AAAI 2019: 9038-9045 - [c1]Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen:
Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network. ICCV 2019: 8439-8448 - [i2]Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen:
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network. CoRR abs/1908.05900 (2019) - 2018
- [i1]Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li:
Scene Text Detection with Supervised Pyramid Context Network. CoRR abs/1811.08605 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-28 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint