default search action
Computer Vision and Image Understanding, Volume 248
Volume 248, 2024
- Yumeng Kang, Lu Zhang, Ping Hu, Yu Liu, Huchuan Lu, You He:
Learning depth-aware decomposition for single image dehazing. 104069 - Xiaobao Yang, Xi Tian, Junsheng Wu, Xiaochun Yang, Sugang Ma, Xinman Qi, Zhiqiang Hou:
LLAFN-Generator: Learnable linear-attention with fast-normalization for large-scale image captioning. 104088 - Clément Hardy, Yvain Quéau, David Tschumperlé:
Uni MS-PS: A multi-scale encoder-decoder transformer for universal photometric stereo. 104093 - Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Lihe Zhang, Bolun Zheng, Wei Zhou, Huchuan Lu:
Neural image re-exposure. 104094 - Kun Gao, Haoyang Zhang, Xiaolong Liu, Xinyi Wang, Liang Xie, Bowen Ji, Ye Yan, Erwei Yin:
Challenges and solutions for vision-based hand gesture interpretation: A review. 104095 - Xiaochen Lu, Yuting Pan, Yuan Liu, Lei Zhang, Yajun Li:
Multi-dimensional attention-aided transposed ConvBiLSTM network for hyperspectral image super-resolution. 104096 - Weina Zhou, Linhui Ye:
UC-former: A multi-scale image deraining network using enhanced transformer. 104097 - Yong Xu, Shaohui Pan, Ruotao Xu, Haibin Ling:
View-aligned pixel-level feature aggregation for 3D shape classification. 104098 - Jue Wang, Yuxiang Lin, Qi Zhao, Dong Luo, Shuaibao Chen, Wei Chen, Xiaojiang Peng:
Invisible gas detection: An RGB-thermal cross attention network and a new benchmark. 104099 - Fatima Haimour, Rizik M. H. Al-Sayyed, Waleed Mahafza, Omar Sultan Al-Kadi:
Bidirectional brain image translation using transfer learning from generic pre-trained models. 104100 - Hanwei Zhang, Felipe Torres, Ronan Sicre, Yannis Avrithis, Stéphane Ayache:
Opti-CAM: Optimizing saliency maps for interpretability. 104101 - Ronny Velastegui, Maxim Tatarchenko, Sezer Karaoglu, Theo Gevers:
Image semantic segmentation of indoor scenes: A survey. 104102 - Abhijeet M. Pimpale, Kishor M. Bhurchandi:
Cascaded UNet for progressive noise residual prediction for structure-preserving video denoising. 104103 - Dandan Fan, Kaibing Zhang, Hui Li, Longgang Ren, Guang Shi:
MFCT: Multi-Frequency Cascade Transformers for no-reference SR-IQA. 104104 - Haiying Xia, Zhuolin Gong, Yumei Tan, Shuxiang Song:
Joint pyramidal perceptual attention and hierarchical consistency constraint for gaze estimation. 104105 - Hang Chen, Chufeng Tang, Xiaolin Hu:
DHS-DETR: Efficient DETRs with dynamic head switching. 104106 - Qi Wu, Sanping Zhou, Le Wang, Liushuai Shi, Yonghao Dong, Gang Hua:
End-to-end pedestrian trajectory prediction via Efficient Multi-modal Predictors. 104107 - Ziyu Zhao, Leilei Gan, Tao Shen, Kun Kuang, Fei Wu:
Deconfounded hierarchical multi-granularity classification. 104108 - Lin Chen, Jing Zhang, Yian Zhang, Junpeng Kang, Li Zhuo:
MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming. 104109 - Liyan Wang, Qinyu Yang, Cong Wang, Wei Wang, Zhixun Su:
Coarse-to-fine mechanisms mitigate diffusion limitations on image restoration. 104118 - Iman Hosseini, Md. Zakir Hossain, Yuhao Zhang, Shafin Rahman:
Deep learning model for simultaneous recognition of quantitative and qualitative emotion using visual and bio-sensing data. 104121 - Huanlong Zhang, Mengdan Liu, Xiaohui Song, Yong Wang, Guanglu Yang, Rui Qi:
Spatial attention inference model for cascaded siamese tracking with dynamic residual update strategy. 104125 - Yaming Wang, Jiatong Chen, Xian Fang, Mingfeng Jiang, Jianhua Ma:
Dual cross perception network with texture and boundary guidance for camouflaged object detection. 104131 - Jin Liu, Yang Yang, Biyun Xu, Hao Yu, Yaozong Zhang, Qian Li, Zhenghua Huang:
RSTC: Residual Swin Transformer Cascade to approximate Taylor expansion for image denoising. 104132 - Damien Mariyanayagam, Adrien Bartoli:
The shading isophotes: Model and methods for Lambertian planes and a point light. 104135 - Hongbo Bi, Yuyu Tong, Pan Zhang, Jiayuan Zhang, Cong Zhang:
Dual cross-enhancement network for highly accurate dichotomous image segmentation. 104122 - Peng Zhang, Xinlei Zhao, Lijia Dong, Weimin Lei, Wei Zhang, Zhaonan Lin:
A framework for detecting fighting behavior based on key points of human skeletal posture. 104123 - Qiang Zhang, Hongyuan Guo, Guanghe Li, Tianlu Zhang, Qiang Jiao:
Deep unsupervised shadow detection with curriculum learning and self-training. 104124 - Qian Ye, Masanori Suganuma, Takayuki Okatani:
Improved high dynamic range imaging using multi-scale feature flows balanced between task-orientedness and accuracy. 104126 - Kejun Wu, Zhenxing Li, You Yang, Qiong Liu:
Deep video compression based on Long-range Temporal Context Learning. 104127 - Yinan Wang, Sansitha Panchadsaram, Rezvan Sherkati, James J. Clark:
An egocentric video and eye-tracking dataset for visual search in convenience stores. 104129 - Zhichao Cui, Zeqi Chen, Chi Zhang, Gaofeng Meng, Yuehu Liu, Xiangmo Zhao:
DDGPnP: Differential degree graph based PnP solution to handle outliers. 104130 - Yujia Wang, Hua Huang:
Audio-visual deepfake detection using articulatory representation learning. 104133 - Quanwei Yang, Lingyun Yu, Fengyuan Liu, Yun Song, Meng Shao, Guoqing Jin, Hongtao Xie:
Symmetrical Siamese Network for pose-guided person synthesis. 104134 - Qiuxia Wu, Kunming Su:
URINet: Unsupervised point cloud rotation invariant representation learning via semantic and structural reasoning. 104136 - Yingda Lyu, Zhehao Liu, Yingxin Zhang, Haipeng Chen, Zhimin Xu:
CRML-Net: Cross-Modal Reasoning and Multi-Task Learning Network for tooth image segmentation. 104138 - Simona Tiribelli, Benedetta Giovanola, Rocco Pietrini, Emanuele Frontoni, Marina Paolanti:
Embedding AI ethics into the design and use of computer vision technology for consumer's behaviour understanding. 104142
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.