default search action
18th ECCV 2024: Milan, Italy - Part XXIX
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXIX. Lecture Notes in Computer Science 15087, Springer 2025, ISBN 978-3-031-73396-3 - Sohyun Lee, Namyup Kim, Sungyeon Kim, Suha Kwak:
FREST: Feature RESToration for Semantic Segmentation Under Multiple Adverse Conditions. 1-18 - Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillère, Stefano Berretti, Mohamed Daoudi:
ScanTalk: 3D Talking Heads from Unregistered Scans. 19-36 - Xianghao Kong, Jinyu Chen, Wenguan Wang, Hang Su, Xiaolin Hu, Yi Yang, Si Liu:
Controllable Navigation Instruction Generation with Chain of Thought Prompting. 37-54 - Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang:
GiT: Towards Generalist Vision Transformer Through Universal Language Interface. 55-73 - Chenhang He, Ruihuang Li, Guowen Zhang, Lei Zhang:
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention. 74-92 - Chao Dai, Yang Wang, Chaolin Huang, Jiakai Zhou, Qilin Xu, Minpeng Xu:
A Cephalometric Landmark Regression Method Based on Dual-Encoder for High-Resolution X-Ray Image. 93-109 - Jikai Zheng, Mingjiang Liang, Shaoli Huang, Jifeng Ning:
Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking. 110-126 - Yiming Ren, Xiao Han, Yichen Yao, Xiaoxiao Long, Yujing Sun, Yuexin Ma:
LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment. 127-144 - Mehdi Noroozi, Isma Hadji, Brais Martínez, Adrian Bulat, Georgios Tzimiropoulos:
You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation. 145-161 - Mingqiao Ye, Martin Danelljan, Fisher Yu, Lei Ke:
Gaussian Grouping: Segment and Edit Anything in 3D Scenes. 162-179 - Yiming Huang, Weilin Wan, Yue Yang, Chris Callison-Burch, Mark Yatskar, Lingjie Liu:
CoMo: Controllable Motion Generation Through Language Guided Pose Code Editing. 180-196 - Joseph Tung, Gene Chou, Ruojin Cai, Guandao Yang, Kai Zhang, Gordon Wetzstein, Bharath Hariharan, Noah Snavely:
MegaScenes: Scene-Level View Synthesis at Scale. 197-214 - Yuan Shen, Duygu Ceylan, Paul Guerrero, Zexiang Xu, Niloy J. Mitra, Shenlong Wang, Anna Frühstück:
SUPERGAUSSIAN: Repurposing Video Models for 3D Super Resolution. 215-233 - Jun-Yeong Moon, Jung Uk Kim, Gyeong-Moon Park:
Towards Model-Agnostic Dataset Condensation by Heterogeneous Models. 234-250 - Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Mingchen Zhuge, Jian Ding, Deyao Zhu, Jürgen Schmidhuber, Mohamed Elhoseiny:
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos. 251-267 - Mihir Mahajan, Florian Hofherr, Daniel Cremers:
MeshFeat: Multi-resolution Features for Neural Fields on Meshes. 268-285 - Yi Wang, Conrad M. Albrecht, Nassim Ait Ali Braham, Chenying Liu, Zhitong Xiong, Xiao Xiang Zhu:
Decoupling Common and Unique Representations for Multimodal Self-supervised Learning. 286-303 - Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang:
MM1: Methods, Analysis and Insights from Multimodal LLM Pre-training. 304-323 - Yixiao Wang, Chen Tang, Lingfeng Sun, Simone Rossi, Yichen Xie, Chensheng Peng, Thomas Hannagan, Stefano Sabatini, Nicola Poerio, Masayoshi Tomizuka, Wei Zhan:
Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation. 324-341 - Atsuya Nakata, Takao Yamanaka:
2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction. 342-356 - Xiaoyu Zhu, Hao Zhou, Pengfei Xing, Long Zhao, Hao Xu, Junwei Liang, Alexander Hauptmann, Ting Liu, Andrew C. Gallagher:
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models. 357-375 - Bowen Fu, Gu Wang, Chenyangguang Zhang, Yan Di, Ziqin Huang, Zhiying Leng, Fabian Manhardt, Xiangyang Ji, Federico Tombari:
D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction. 376-394 - Lan Yao, Chaofeng Chen, Xiaoming Li, Zifei Yan, Wangmeng Zuo:
Combining Generative and Geometry Priors for Wide-Angle Portrait Correction. 395-411 - Yuehan Zhang, Angela Yao:
RealViformer: Investigating Attention for Real-World Video Super-Resolution. 412-428 - Yuehan Zhang, Seungjun Lee, Angela Yao:
Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution. 429-446 - Zhe Zhao, Mengshi Qi, Huadong Ma:
Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation. 447-463 - Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo:
UniFS: Universal Few-Shot Instance Perception with Point Representations. 464-483
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.