default search action
18th ECCV 2024: Milan, Italy - Part L
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part L. Lecture Notes in Computer Science 15108, Springer 2025, ISBN 978-3-031-72972-0 - Xinpeng Liu, Haowen Hou, Yanchao Yang, Yong-Lu Li, Cewu Lu:
Revisit Human-Scene Interaction via Space Occupancy. 1-19 - Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu:
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control. 20-36 - Haisheng Fu, Jie Liang, Zhenman Fang, Jingning Han, Feng Liang, Guohe Zhang:
WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model. 37-53 - Pengyu Li, Tianchu Guo, Biao Wang, Xian-Sheng Hua:
Grid-Attention: Enhancing Computational Efficiency of Large Vision Models Without Fine-Tuning. 54-70 - Gilhan Park, WonJun Moon, SuBeen Lee, Tae-Young Kim, Jae-Pil Heo:
Mitigating Background Shift in Class-Incremental Semantic Segmentation. 71-88 - Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen, Xuguang Lan:
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection. 89-105 - Zekai Xu, Kang You, Qinghai Guo, Xiang Wang, Zhezhi He:
BKDSNN: Enhancing the Performance of Learning-Based Spiking Neural Networks Training with Blurred Knowledge Distillation. 106-123 - Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang:
Agent Attention: On the Integration of Softmax and Linear Attention. 124-140 - Quoc-Huy Tran, Muhammad Ahmed, Murad Popattia, M. Hassan Ahmed, Andrey Konin, M. Zeeshan Zia:
Learning by Aligning 2D Skeleton Sequences and Multi-modality Fusion. 141-161 - Kohei Ashida, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita:
Resolving Scale Ambiguity in Multi-view 3D Reconstruction Using Dual-Pixel Sensors. 162-178 - Shibin Mei, Bingbing Ni, Hang Wang, Chenglong Zhao, Fengfa Hu, Zhiming Pi, Bilian Ke:
Object-Oriented Anchoring and Modal Alignment in Multimodal Learning. 179-196 - Jiabao Wang, Qiang Meng, Guochao Liu, Liujiang Yan, Ke Wang, Ming-Ming Cheng, Qibin Hou:
Towards Stable 3D Object Detection. 197-213 - Byunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham:
FYI: Flip Your Images for Dataset Distillation. 214-230 - Hyeonseong Kim, Sung-Hoon Yoon, Minseok Kim, Kuk-Jin Yoon:
On-the-Fly Category Discovery for LiDAR Semantic Segmentation. 231-249 - Renlong Wu, Zhilu Zhang, Yu Yang, Wangmeng Zuo:
Dual-Camera Smooth Zoom on Mobile Phones. 250-269 - Xumin Yu, Yanbo Wang, Jie Zhou, Jiwen Lu:
ProtoComp: Diverse Point Cloud Completion with Controllable Prototype. 270-286 - Long Li, Nian Liu, Dingwen Zhang, Zhongyu Li, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal, Junwei Han, Fahad Shahbaz Khan:
CONDA: Condensed Deep Association Learning for Co-salient Object Detection. 287-303 - Ge Wu, Xin Zhang, Zheng Li, Zhaowei Chen, Jiajun Liang, Jian Yang, Xiang Li:
Cascade Prompt Learning for Vision-Language Model Adaptation. 304-321 - Yuzhou Liu, Lingjie Zhu, Xiaodong Ma, Hanqiao Ye, Xiang Gao, Xianwei Zheng, Shuhan Shen:
PolyRoom: Room-Aware Transformer for Floorplan Reconstruction. 322-339 - Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Yaohang Li, Xing Luo, Chenyu Yi, Alex C. Kot:
BenchLMM: Benchmarking Cross-Style Visual Capability of Large Multimodal Models. 340-358 - Mingjun Zheng, Long Sun, Jiangxin Dong, Jinshan Pan:
SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution. 359-375 - Zhongyu Xia, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
HENet: Hybrid Encoding for End-to-End Multi-task 3D Perception from Multi-view Cameras. 376-392 - Bowei Xing, Xianghua Ying, Ruibin Wang, Ruohao Guo, Ji Shi, Wenzhen Yue:
Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation. 393-409 - Jian Jin, Yang Shen, Zhenyong Fu, Jian Yang:
Customized Generation Reimagined: Fidelity and Editability Harmonized. 410-426 - Kaishen Yuan, Zitong Yu, Xin Liu, Weicheng Xie, Huanjing Yue, Jingyu Yang:
AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit Detectors. 427-445 - Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li:
Improving Video Segmentation via Dynamic Anchor Queries. 446-463 - Shunqi Mao, Chaoyi Zhang, Hang Su, Hwanjun Song, Igor Shalyminov, Weidong Cai:
Controllable Contextualized Image Captioning: Directing the Visual Narrative Through User-Defined Highlights. 464-481
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.