default search action
18th ECCV 2024: Milan, Italy - Part LVI
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LVI. Lecture Notes in Computer Science 15114, Springer 2025, ISBN 978-3-031-72991-1 - Nina Shvetsova, Anna Kukleva, Xudong Hong, Christian Rupprecht, Bernt Schiele, Hilde Kuehne:
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale. 1-18 - Sanmin Kim, Youngseok Kim, Sihwan Hwang, Hyeonjun Jeong, Dongsuk Kum:
LabelDistill: Label-Guided Cross-Modal Knowledge Distillation for Camera-Based 3D Object Detection. 19-37 - Hyeong-Seok Jeon, Sanmin Kim, Abi Rahman Syamil, Junsoo Kim, Dongsuk Kum:
Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction. 38-53 - Hasan Abed Al Kader Hammoud, Tuhin Das, Fabio Pizzati, Philip H. S. Torr, Adel Bibi, Bernard Ghanem:
On Pretraining Data Diversity for Self-Supervised Learning. 54-71 - Gianluca Scarpellini, Stefano Rosa, Pietro Morerio, Lorenzo Natale, Alessio Del Bue:
Look Around and Learn: Self-training Object Detection by Exploration. 72-88 - Ozan Unal, Christos Sakaridis, Luc Van Gool:
Bayesian Self-training for Semi-supervised 3D Segmentation. 89-107 - Zhongyang Ren, Bangyan Liao, Delei Kong, Jinghang Li, Peidong Liu, Laurent Kneip, Guillermo Gallego, Yi Zhou:
Motion and Structure from Event-Based Normal Flow. 108-125 - Qiran Zou, Shangyuan Yuan, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji:
ParCo: Part-Coordinating Text-to-Motion Synthesis. 126-143 - Zheng Zhang, Wenjie Ai, Kevin Wells, David Rosewarne, Thanh-Toan Do, Gustavo Carneiro:
Learning to Complement and to Defer to Multiple Users. 144-162 - Qingyuan Wang, Barry Cardiff, Antoine Frappé, Benoit Larras, Deepu John:
Tiny Models are the Computational Saver for Large Models. 163-182 - Yufan Deng, Ruida Wang, Yuhao Zhang, Yu-Wing Tai, Chi-Keung Tang:
DragVideo: Interactive Drag-Style Video Editing. 183-199 - Zeqian Li, Qirui Chen, Tengda Han, Ya Zhang, Yanfeng Wang, Weidi Xie:
Multi-sentence Grounding for Long-Term Instructional Video. 200-216 - Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song:
Do Generalised Classifiers Really Work on Human Drawn Sketches? 217-235 - Zhihao Xu, Shengjie Gong, Jiapeng Tang, Lingyu Liang, Yining Huang, Haojie Li, Shuangping Huang:
KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding. 236-253 - Yuxiao He, Yiyu Zhuang, Yanwen Wang, Yao Yao, Siyu Zhu, Xiaoyu Li, Qi Zhang, Xun Cao, Hao Zhu:
Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360$^\circ $. 254-272 - Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jia-Wei Liu, Weijia Wu, Jussi Keppo, Mike Zheng Shou:
MotionDirector: Motion Customization of Text-to-Video Diffusion Models. 273-290 - Yang Wu, Kaihua Zhang, Jianjun Qian, Jin Xie, Jian Yang:
Text2LiDAR: Text-Guided LiDAR Point Cloud Generation via Equirectangular Transformer. 291-310 - Sungjune Kim, Hadam Baek, Seunggwan Lee, Hyung-Gun Chi, Hyerin Lim, Jinkyu Kim, Sangpil Kim:
Enhanced Motion Forecasting with Visual Relation Reasoning. 311-328 - Jinming Liu, Ruoyu Feng, Yunpeng Qi, Qiuyu Chen, Zhibo Chen, Wenjun Zeng, Xin Jin:
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression. 329-348 - Zixuan Fu, Lanqing Guo, Chong Wang, Yufei Wang, Zhihao Li, Bihan Wen:
Temporal As a Plugin: Unsupervised Video Denoising with Pre-trained Image Denoisers. 349-367 - Yujeong Chae, Hyeonseong Kim, Changgyoon Oh, Minseok Kim, Kuk-Jin Yoon:
LiDAR-Based All-Weather 3D Object Detection via Prompting and Distilling 4D Radar. 368-385 - Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao:
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models. 386-403 - Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu:
Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models. 404-420 - Eric Brachmann, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Áron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu:
Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer. 421-440 - Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang, Xin Tong:
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-trained Diffusion Priors. 441-458 - Xinyu Yang, Hossein Rahmani, Sue Black, Bryan M. Williams:
Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation. 459-478 - Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, Changsheng Xu:
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion. 479-495
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.