default search action
18th ECCV 2024: Milan, Italy - Part XII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XII. Lecture Notes in Computer Science 15070, Springer 2025, ISBN 978-3-031-73253-9 - Ziyue Huang, Yongchao Feng, Qingjie Liu, Yunhong Wang:
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection. 1-17 - Minlong Lu, Yichen Lu, Siwei Nie, Xudong Yang, Xiaobo Zhang:
Self-supervised Video Copy Localization with Regional Token Representation. 18-35 - Claudio Rota, Marco Buzzelli, Joost van de Weijer:
Enhancing Perceptual Quality in Video Super-Resolution Through Temporally-Consistent Detail Synthesis Using Diffusion Models. 36-53 - Sibi Catley-Chandar, Richard Shaw, Gregory G. Slabaugh, Eduardo Pérez-Pellitero:
RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF. 54-71 - ShahRukh Athar, Shunsuke Saito, Zhengyu Yang, Stanislav Pidhorskyi, Chen Cao:
Bridging the Gap: Studio-Like Avatar Creation from a Monocular Phone Capture. 72-88 - Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang:
ControlLLM: Augment Language Models with Tools by Searching on Graphs. 89-105 - Lan Feng, Mohammadhossein Bahari, Kaouther Messaoud Ben Amor, Éloi Zablocki, Matthieu Cord, Alexandre Alahi:
UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction. 106-123 - Zizheng Yan, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu, Zisheng Ye, Shuguang Cui, Guanying Chen, Xiaoguang Han:
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors. 124-141 - Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun:
Vamos: Versatile Action Models for Video Understanding. 142-160 - Xinyu Sun, Lizhao Liu, Hongyan Zhi, Ronghe Qiu, Junwei Liang:
Prioritized Semantic Learning for Zero-Shot Instance Navigation. 161-178 - Zhongxing Ma, Shuang Liang, Yongkun Wen, Weixin Lu, Guowei Wan:
RoadPainter: Points Are Ideal Navigators for Topology TransformER. 179-195 - Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li:
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis. 196-212 - Jiahui Liu, Xin Wen, Shizhen Zhao, Yingxian Chen, Xiaojuan Qi:
Can OOD Object Detectors Learn from Foundation Models? 213-231 - Xiang Fan, Anand Bhattad, Ranjay Krishna:
VIDEOSHOP: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion. 232-250 - Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman:
MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo. 251-269 - Qiangqiang Wu, Yan Xia, Jia Wan, Antoni B. Chan:
Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training. 270-288 - Junsung Lee, Minsoo Kang, Bohyung Han:
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation. 289-304 - Siqi Yang, Zhaojun Huang, Yakun Chang, Bin Fan, Zhaofei Yu, Boxin Shi:
Real-Data-Driven 2000 FPS Color Video from Mosaicked Chromatic Spikes. 305-321 - Peirong Liu, Oula Puonti, Xiaoling Hu, Daniel C. Alexander, Juan Eugenio Iglesias:
Brain-ID: Learning Contrast-Agnostic Anatomical Representations for Brain Imaging. 322-340 - Youssef Mansour, Xuyang Zhong, Serdar I. Caglar, Reinhard Heckel:
TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts. 341-357 - Fernando Pérez-García, Sam Bond-Taylor, Pedro P. Sanchez, Boris van Breugel, Daniel C. Castro, Harshita Sharma, Valentina Salvatelli, Maria T. A. Wetscherek, Hannah Richardson, Matthew P. Lungren, Aditya V. Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse:
RadEdit: Stress-Testing Biomedical Vision Models via Diffusion Image Editing. 358-376 - Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé:
SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow. 377-395 - Yuanting Fan, Chengxu Liu, Nengzhong Yin, Changlong Gao, Xueming Qian:
AdaDiffSR: Adaptive Region-Aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution. 396-413 - Hang Xu, Chen Long, Wenxiao Zhang, Yuan Liu, Zhen Cao, Zhen Dong, Bisheng Yang:
Explicitly Guided Information Interaction Network for Cross-Modal Point Cloud Completion. 414-432 - Taewoo Kim, Jaeseok Jeong, Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon:
Towards Real-World Event-Guided Low-Light Video Enhancement and Deblurring. 433-451 - Zixin Zhu, Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua:
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation. 452-469 - Jinjie Mai, Wenxuan Zhu, Sara Rojas, Jesus Zarzar, Abdullah Hamdi, Guocheng Qian, Bing Li, Silvio Giancola, Bernard Ghanem:
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks. 470-489
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.