default search action
Qi Dai 0001
Person information
- affiliation: Microsoft Research Asia, Beijing, China
- affiliation (PhD 2017): Fudan University, School of Computer Science, Shanghai, China
Other persons with the same name
- Qi Dai — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c37]Yuxuan Zhou, Xudong Yan, Zhi-Qi Cheng, Yan Yan, Qi Dai, Xian-Sheng Hua:
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition. CVPR 2024: 2049-2058 - [c36]Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong:
ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models. CVPR Workshops 2024: 7395-7405 - [c35]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CVPR 2024: 7827-7839 - [c34]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CVPR 2024: 7882-7891 - [c33]Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai, Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo:
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation. CVPR 2024: 8414-8424 - [i32]Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion. CoRR abs/2405.20325 (2024) - [i31]Zhen Xing, Qi Dai, Zejia Weng, Zuxuan Wu, Yu-Gang Jiang:
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction. CoRR abs/2406.06465 (2024) - [i30]Miaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li, Zheng Zhang, Qi Dai, Chong Luo, Xin Geng, Baining Guo:
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms. CoRR abs/2406.09397 (2024) - [i29]Minghan Li, Heng Li, Zhi-Qi Cheng, Yifei Dong, Yuxuan Zhou, Jun-Yan He, Qi Dai, Teruko Mitamura, Alexander G. Hauptmann:
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions. CoRR abs/2406.19236 (2024) - 2023
- [j7]Dayan Wu, Qi Dai, Bo Li, Weiping Wang:
Deep Uncoupled Discrete Hashing via Similarity Matrix Decomposition. ACM Trans. Multim. Comput. Commun. Appl. 19(1): 22:1-22:22 (2023) - [c32]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu:
On Data Scaling in Masked Image Modeling. CVPR 2023: 10365-10374 - [c31]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CVPR 2023: 18816-18826 - [c30]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CVPR 2023: 22721-22731 - [c29]Yan Liu, Xiaokang Chen, Qi Dai:
Parallel Sentence-Level Explanation Generation for Real-World Low-Resource Scenarios. ICASSP 2023: 1-5 - [c28]Jia Ning, Chen Li, Zheng Zhang, Chunyu Wang, Zigang Geng, Qi Dai, Kun He, Han Hu:
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token. ICCV 2023: 19843-19853 - [c27]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. ICCV 2023: 19879-19890 - [c26]Zhi-Qi Cheng, Qi Dai, Alexander G. Hauptmann:
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules. ICCV 2023: 22145-22156 - [c25]Xiaosong Zhang, Yunjie Tian, Lingxi Xie, Wei Huang, Qi Dai, Qixiang Ye, Qi Tian:
HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer. ICLR 2023 - [i28]Jia Ning, Chen Li, Zheng Zhang, Zigang Geng, Qi Dai, Kun He, Han Hu:
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token. CoRR abs/2301.02229 (2023) - [i27]Yan Liu, Xiaokang Chen, Qi Dai:
Parallel Sentence-Level Explanation Generation for Real-World Low-Resource Scenarios. CoRR abs/2302.10707 (2023) - [i26]Zhi-Qi Cheng, Qi Dai, Siyao Li, Jingdong Sun, Teruko Mitamura, Alexander G. Hauptmann:
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules. CoRR abs/2304.02173 (2023) - [i25]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. CoRR abs/2304.10465 (2023) - [i24]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CoRR abs/2308.09710 (2023) - [i23]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. CoRR abs/2310.10647 (2023) - [i22]Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai, Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo:
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation. CoRR abs/2311.18829 (2023) - [i21]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CoRR abs/2311.18830 (2023) - [i20]Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong:
ART·V: Auto-Regressive Text-to-Video Generation with Diffusion Models. CoRR abs/2311.18834 (2023) - [i19]Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models. CoRR abs/2311.18837 (2023) - 2022
- [c24]Yan Liu, Sanyuan Chen, Yazheng Yang, Qi Dai:
MPII: Multi-Level Mutual Promotion for Inference and Interpretation. ACL (1) 2022: 7074-7084 - [c23]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu:
SimMIM: a Simple Framework for Masked Image Modeling. CVPR 2022: 9643-9653 - [c22]Zhi-Qi Cheng, Qi Dai, Hong Li, Jingkuan Song, Xiao Wu, Alexander G. Hauptmann:
Rethinking Spatial Invariance of Convolutional Networks for Object Counting. CVPR 2022: 19606-19616 - [c21]Qi Han, Zejia Fan, Qi Dai, Lei Sun, Ming-Ming Cheng, Jiaying Liu, Jingdong Wang:
On the Connection between Local Attention and Dynamic Depth-wise Convolution. ICLR 2022 - [c20]Zhi-Qi Cheng, Qi Dai, Siyao Li, Teruko Mitamura, Alexander Hauptmann:
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement. ACM Multimedia 2022: 3272-3281 - [i18]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu-Gang Jiang:
Deeper Insights into ViTs Robustness towards Common Corruptions. CoRR abs/2204.12143 (2022) - [i17]Xiaosong Zhang, Yunjie Tian, Wei Huang, Qixiang Ye, Qi Dai, Lingxi Xie, Qi Tian:
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling. CoRR abs/2205.14949 (2022) - [i16]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu:
On Data Scaling in Masked Image Modeling. CoRR abs/2206.04664 (2022) - [i15]Zhi-Qi Cheng, Qi Dai, Hong Li, Jingkuan Song, Xiao Wu, Alexander G. Hauptmann:
Rethinking Spatial Invariance of Convolutional Networks for Object Counting. CoRR abs/2206.05253 (2022) - [i14]Zhi-Qi Cheng, Qi Dai, Siyao Li, Teruko Mitamura, Alexander Hauptmann:
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement. CoRR abs/2208.08965 (2022) - [i13]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CoRR abs/2211.13222 (2022) - [i12]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CoRR abs/2212.00776 (2022) - 2021
- [j6]Qi He, Qi Dai, Xiao Wu, Jun-Yan He:
A novel class restriction loss for unsupervised domain adaptation. Neurocomputing 461: 254-265 (2021) - [j5]Xingbo Liu, Xiushan Nie, Qi Dai, Yupan Huang, Li Lian, Yilong Yin:
Reinforced Short-Length Hashing. IEEE Trans. Circuits Syst. Video Technol. 31(9): 3655-3668 (2021) - [c19]Baifeng Shi, Qi Dai, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu:
Temporal Action Detection with Multi-level Supervision. ICCV 2021: 8002-8012 - [i11]Zhenda Xie, Yutong Lin, Zhuliang Yao, Zheng Zhang, Qi Dai, Yue Cao, Han Hu:
Self-Supervised Learning with Swin Transformers. CoRR abs/2105.04553 (2021) - [i10]Qi Han, Zejia Fan, Qi Dai, Lei Sun, Ming-Ming Cheng, Jiaying Liu, Jingdong Wang:
Demystifying Local Vision Transformer: Sparse Connectivity, Weight Sharing, and Dynamic Weight. CoRR abs/2106.04263 (2021) - [i9]Shaobo Min, Qi Dai, Hongtao Xie, Chuang Gan, Yongdong Zhang, Jingdong Wang:
Cross-Modal Attention Consistency for Video-Audio Unsupervised Learning. CoRR abs/2106.06939 (2021) - [i8]Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu:
SimMIM: A Simple Framework for Masked Image Modeling. CoRR abs/2111.09886 (2021) - 2020
- [c18]Baifeng Shi, Qi Dai, Yadong Mu, Jingdong Wang:
Weakly-Supervised Action Localization by Generative Attention Modeling. CVPR 2020: 1006-1016 - [c17]Baifeng Shi, Dinghuai Zhang, Qi Dai, Zhanxing Zhu, Yadong Mu, Jingdong Wang:
Informative Dropout for Robust Representation Learning: A Shape-bias Perspective. ICML 2020: 8828-8839 - [i7]Baifeng Shi, Qi Dai, Yadong Mu, Jingdong Wang:
Weakly-Supervised Action Localization by Generative Attention Modeling. CoRR abs/2003.12424 (2020) - [i6]Xingbo Liu, Xiushan Nie, Qi Dai, Yupan Huang, Yilong Yin:
Reinforcing Short-Length Hashing. CoRR abs/2004.11511 (2020) - [i5]Baifeng Shi, Dinghuai Zhang, Qi Dai, Zhanxing Zhu, Yadong Mu, Jingdong Wang:
Informative Dropout for Robust Representation Learning: A Shape-bias Perspective. CoRR abs/2008.04254 (2020) - [i4]Baifeng Shi, Qi Dai, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu:
Temporal Action Detection with Multi-level Supervision. CoRR abs/2011.11893 (2020)
2010 – 2019
- 2019
- [c16]Dayan Wu, Qi Dai, Jing Liu, Bo Li, Weiping Wang:
Deep Incremental Hashing Network for Efficient Image Retrieval. CVPR 2019: 9069-9077 - [c15]Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander G. Hauptmann:
Learning Spatial Awareness to Improve Crowd Counting. ICCV 2019: 6151-6160 - [c14]Yupan Huang, Qi Dai, Yutong Lu:
Decoupling Localization and Classification in Single Shot Temporal Action Detection. ICME 2019: 1288-1293 - [c13]Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Jun-Yan He, Alexander G. Hauptmann:
Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting. ACM Multimedia 2019: 1897-1906 - [i3]Yupan Huang, Qi Dai, Yutong Lu:
Decoupling Localization and Classification in Single Shot Temporal Action Detection. CoRR abs/1904.07442 (2019) - [i2]Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander G. Hauptmann:
Learning Spatial Awareness to Improve Crowd Counting. CoRR abs/1909.07057 (2019) - [i1]Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Jun-Yan He, Alexander G. Hauptmann:
Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting. CoRR abs/1909.07608 (2019) - 2018
- [c12]Dong Li, Zhaofan Qiu, Qi Dai, Ting Yao, Tao Mei:
Recurrent Tubelet Proposal and Recognition Networks for Action Detection. ECCV (6) 2018: 306-322 - [c11]Fuchen Long, Ting Yao, Qi Dai, Xinmei Tian, Jiebo Luo, Tao Mei:
Deep Domain Adaptation Hashing with Adversarial Learning. SIGIR 2018: 725-734 - 2016
- [j4]Qi Dai, Jianguo Li, Jun Wang, Yurong Chen, Yu-Gang Jiang:
A Bayesian Hashing approach and its application to face recognition. Neurocomputing 213: 5-13 (2016) - [c10]Qi Dai, Jianguo Li, Jingdong Wang, Yu-Gang Jiang:
Binary Optimized Hashing. ACM Multimedia 2016: 1247-1256 - 2015
- [j3]Yu-Gang Jiang, Qi Dai, Wei Liu, Xiangyang Xue, Chong-Wah Ngo:
Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling. IEEE Trans. Image Process. 24(11): 3781-3795 (2015) - [j2]Yu-Gang Jiang, Qi Dai, Tao Mei, Yong Rui, Shih-Fu Chang:
Super Fast Event Recognition in Internet Videos. IEEE Trans. Multim. 17(8): 1174-1186 (2015) - [c9]Qi Dai, Jianguo Li, Jun Wang, Yurong Chen, Yu-Gang Jiang:
Optimal Bayesian Hashing for Efficient Face Recognition. IJCAI 2015: 3430-3437 - [c8]Qi Dai, Rui-Wei Zhao, Zuxuan Wu, Xi Wang, Zichen Gu, Wenhai Wu, Yu-Gang Jiang:
Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning. MediaEval 2015 - 2014
- [c7]Jian Tu, Zuxuan Wu, Qi Dai, Yu-Gang Jiang, Xiangyang Xue:
Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation. ICME Workshops 2014: 1-6 - [c6]Qi Dai, Zuxuan Wu, Yu-Gang Jiang, Xiangyang Xue, Jinhui Tang:
Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks. MediaEval 2014 - 2013
- [c5]Qi Dai, Jian Tu, Ziqiang Shi, Yu-Gang Jiang, Xiangyang Xue:
Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes. MediaEval 2013 - [c4]Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang:
Beauty is here: evaluating aesthetics in videos using multimodal features and free training data. ACM Multimedia 2013: 369-372 - 2012
- [j1]Yu-Gang Jiang, Qi Dai, Jun Wang, Chong-Wah Ngo, Xiangyang Xue, Shih-Fu Chang:
Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation. IEEE Trans. Image Process. 21(6): 3080-3091 (2012) - [c3]Yu-Gang Jiang, Qi Dai, Xiangyang Xue, Wei Liu, Chong-Wah Ngo:
Trajectory-Based Modeling of Human Actions with Motion Reference Points. ECCV (5) 2012: 425-438 - [c2]Yu-Gang Jiang, Qi Dai, Chun Chet Tan, Xiangyang Xue, Chong-Wah Ngo:
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features. MediaEval 2012 - [c1]Yu-Gang Jiang, Qi Dai, Yingbin Zheng, Xiangyang Xue, Jie Liu, Dong Wang:
A fast video event recognition system and its application to video search. ACM Multimedia 2012: 1347-1348
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint