default search action
18th ECCV 2024: Milan, Italy - Part XXXVIII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXXVIII. Lecture Notes in Computer Science 15096, Springer 2025, ISBN 978-3-031-72919-5 - Luchuan Song, Pinxin Liu, Lele Chen, Guojun Yin, Chenliang Xu:
Tri2-plane: Thinking Head Avatar via Feature Pyramid. 1-20 - Yuzhong Zhao, Yue Liu, Zonghao Guo, Weijia Wu, Chen Gong, Qixiang Ye, Fang Wan:
ControlCap: Controllable Region-Level Captioning. 21-38 - Jilong Wang
, Saihui Hou
, Yan Huang
, Chunshui Cao
, Xu Liu
, Yongzhen Huang
, Tianzhu Zhang
, Liang Wang
:
Free Lunch for Gait Recognition: A Novel Relation Descriptor. 39-56 - Weitai Kang
, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. 57-75 - Xiaoran Zhang
, John C. Stendahl
, Lawrence H. Staib
, Albert J. Sinusas
, Alex Wong
, James S. Duncan
:
Adaptive Correspondence Scoring for Unsupervised Medical Image Registration. 76-92 - Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu
, Vishal M. Patel
:
MaxFusion: Plug&Play Multi-modal Generation in Text-to-Image Diffusion Models. 93-110 - Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski:
Watch Your Steps: Local Image and Scene Editing by Text Instructions. 111-129 - Hritam Basak, Zhaozheng Yin:
Forget More to Learn More: Domain-Specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation. 130-148 - Anh Thai, Weiyao Wang, Hao Tang, Stefan Stojanov, James M. Rehg, Matt Feiszli:
3˟ 2: 3D Object Part Segmentation by 2D Semantic Correspondences. 149-166 - Zhengyuan Yang
, Jianfeng Wang
, Linjie Li, Kevin Lin, Chung-Ching Lin
, Zicheng Liu
, Lijuan Wang
:
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation. 167-184 - Gustavo Pérez
, Daniel Sheldon, Grant Van Horn, Subhransu Maji
:
Human-in-the-Loop Visual Re-ID for Population Size Estimation. 185-202 - Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation. 203-220 - Weiwei Sun
, Eduard Trulls
, Yang-Che Tseng, Sneha Sambandam, Gopal Sharma, Andrea Tagliasacchi
, Kwang Moo Yi
:
PointNeRF++: A Multi-scale, Point-Based Neural Radiance Field. 221-238 - Junfei Xiao, Ziqi Zhou, Wenxuan Li, Shiyi Lan, Jieru Mei, Zhiding Yu, Bingchen Zhao, Alan L. Yuille, Yuyin Zhou, Cihang Xie:
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties. 239-258 - Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang:
UMG-CLIP: A Unified Multi-granularity Vision Generalist for Open-World Understanding. 259-277 - Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen, Simon Niklaus, Jianming Zhang, Jia-Bin Huang, Feng Liu:
Fast View Synthesis of Casual Videos with Soup-of-Planes. 278-296 - Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy, Jitendra Malik:
Adaptive Human Trajectory Prediction via Latent Corridors. 297-314 - Rohan Choudhury
, Koichiro Niinuma
, Kris M. Kitani
, László A. Jeni
:
Video Question Answering with Procedural Programs. 315-332 - Wenhui Zhu, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang:
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification. 333-351 - Dong Huo
, Zixin Guo, Xinxin Zuo
, Zhihao Shi, Juwei Lu, Peng Dai, Songcen Xu
, Li Cheng
, Yee-Hong Yang
:
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling. 352-368 - Rongchang Li
, Zhenhua Feng
, Tianyang Xu
, Linze Li
, Xiaojun Wu
, Muhammad Awais
, Sara Atito Ali Ahmed
, Josef Kittler
:
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition. 369-388 - Bin Xia, Shiyin Wang, Yingfan Tao, Yitong Wang, Jiaya Jia:
LLMGA: Multimodal Large Language Model Based Generation Assistant. 389-406 - Mi Luo, Zihui Xue, Alex Dimakis, Kristen Grauman:
Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos. 407-425 - Sriram Narayanan, Mani Ramanagopal, Mark Sheinin, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan:
Shape from Heat Conduction. 426-444 - Moritz Heep
, Eduard Zell
:
An Adaptive Screen-Space Meshing Approach for Normal Integration. 445-461 - Seung Hyun Lee
, Yinxiao Li
, Junjie Ke
, Innfarn Yoo
, Han Zhang
, Jiahui Yu
, Qifei Wang
, Fei Deng
, Glenn Entis
, Junfeng He
, Gang Li
, Sangpil Kim
, Irfan Essa
, Feng Yang
:
Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation. 462-478 - Eugene Valassakis, Guillermo Garcia-Hernando:
HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning. 479-496
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.