default search action
ICME 2024: Niagara Falls, ON, Canada - Workshops
- IEEE International Conference on Multimedia and Expo, ICME 2024 - Workshops, Niagara Falls, ON, Canada, July 15-19, 2024. IEEE 2024, ISBN 979-8-3503-7981-5
- Jiaying Lin, Jiajun Wen, Mengyuan Liu, L. Yue, Jinfu Liu, Baiqiao Yin:
SFMVIT: Slowfast Meet VIT in Chaotic World. 1-6 - Pantid Chantangphol, Sattaya Singkul, Thanawat Lodkaew, Nattasit Maharattamalai, Atthakorn Petchsod, Theerat Sakdejayont, Tawunrat Chalothorn:
An Enhanced Multimodal Negative Feedback Detection Framework with Target Retrieval in Thai Spoken Audio. 1-7 - Liman Wang, Hanyang Zhong:
LLM-SAP: Large Language Models Situational Awareness-Based Planning. 1-6 - Duc-Quang Vu, Ngan Le, Jia-Ching Wang:
Self-Supervised Learning via Multi-Transformation Classification for Action Recognition. 1-6 - Olgierd Stankiewicz, Tomasz Grajek, Slawomir Mackowiak, Jakub Stankowski, Slawomir Rózek, Mateusz Lorkiewicz, Maciej Wawrzyniak, Marek Domanski:
Region-of-Interest-Based Video Coding for Machines. 1-6 - Hung-Min Hsu, Zhongwei Cheng, Xinyu Yuan, Lin Chen:
Learning to Learn Multiview Detection by Camera-Aware Attention. 1-4 - Tianrun Chen, Runlong Cao, Ankang Lu, Tao Xu, Xiaoling Zhang, Papa Mao, Min Zhang, Lingyun Sun, Ying Zang:
High-Fidelity 3D Model Generation with Relightable Appearance from Single Freehand Sketches and Text Guidance. 1-6 - Jiaying Fu, Tianyue Gong, Jialin Gu, Tiange Zhou:
ȌAI Life" and Human Fear: from Phenomenological Insights to Digital Creation. 1-6 - Muyang Yi, Zhaozhi Xie, Yuwen Yang, Chang Liu, Yue Ding, Hongtao Lu:
Decoupling Classification and Localization of CLIP. 1-6 - Huiwen Ren, Zhao Wang, Jiexi Wang, H. Yuwen, M. Siwei, Li Zhang, Wen Gao:
Rate Control Optimizing Model for Constraining Over-Saturated Live Streaming Quality. 1-6 - Luntian Mou, Yihan Sun, Yunhan Tian, Ruichen He, Feng Gao, Zijin Li, Ramesh C. Jain:
MemoMusic 4.0: Personalized Emotion Music Generation Conditioned by Valence and Arousal as Virtual Tokens. 1-6 - Nazia Hossain, M. Manzur Murshed, Mohammad Awrangjeb, Singarayer Florentine, Marc Irvin, Shyh Wei Teng:
Automatic Malleefowl Mound Detection Using LiDAR-based Ground and Habitat Features with Planar Terrain Modelling. 1-6 - Vinay Kashyap, Nilesh A. Ahuja, Omesh Tickoo:
Compression without Compromise: Optimizing Point Cloud Object Detection with Bottleneck Architectures For Split Computing. 1-6 - Shifu Xiong, Li-Rong Dai:
Exploring Semi-Supervised, Subcategory Classification and Subwords Alignment for Visual Wake Word Spotting. 1-6 - Shangbo Mao, Dongyun Lin, Aiyuan Guo, Yiqun Li:
Partclip: How Does Clip Assist Mechanical Part Image Retrieval? 1-5 - Hao Ni, Yuke Li, Ping Lai, Pengpeng Zeng, Hangyu Guo, Lianli Gao:
Attribute Vision Transformer for UAV-Human Re-Identification. 1-6 - Chengpeng Xiong, Zhengxuan Chen, Nuoer Long, Kin-Seong Un, Zhuolin Li, Shaobin Chen, Tao Tan, Chan-Tong Lam, Yue Sun:
Enhancing Video Grounding with Dual-Path Modality Fusion on Animal Kingdom Datasets. 1-6 - Kian Eng Ong, Sivaji Retta, Ramarajulu Srinivasan, Shawn Tan, Jun Liu:
MTYOLO: A Multi-Task Model to Concurrently Obtain the Vital Characteristics of Individuals or Animals. 1-4 - Chen-Yue Zhang, Hang Chen, Jun Du, Sabato Marco Siniscalchi, Ya Jiang, Chin-Hui Lee:
Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge. 1-6 - Haoxu Wang, Cancan Li, Fei Su, Juan Liu, Hongbin Suo, Ming Li:
The Whu Wake Word Lipreading System for the 2024 Chat-Scenario Chinese Lipreading Challenge. 1-6 - Pu Ching, Wen-Cheng Chen, Min-Chun Hu:
I3FNET: Instance-Aware Feature Fusion for Few-Shot Point Cloud Generation from Single Image. 1-6 - Shuo Wang, L. Xiaobing, Qingwen Zhou, Yun Tie, Yan Gao, Xinran Zhang:
Intelligent Music Chord Recognition and Evaluation Based on Convolution and Attention. 1-6 - Yeming Li, Junrong Song, David Kei-Man Yip:
AI-Assisted Content Creation of Naked-Eye 3D Effects on Curved LED Screen: Enhancing Artistic Expression and Creativity. 1-5 - Michael Neri, Marco Carli:
Semi-Supervised Acoustic Scene Classification Under Domain Shift Using an Attention Module and Angular Loss. 1-6 - Yixiong Liu, Qihua Chen, Xuejin Chen:
Neuproofreader: An Interactive Proofreading System with Suggestive Prompts for Connectomics. 1-2 - Ji-Jia Wu:
Efficient Facial Landmark Detection for Embedded Systems. 1-6 - Lian Chen, Zehai Niu, Qingyuan Liu, Jinbao Wang, Jian Xue, Ke Lu:
Anatomically-Informed Vector Quantization Variational Auto-Encoder for Text to Motion Generation. 1-6 - Manuel Ladron de Guevara, Matt Fisher, Aaron Hertzmann:
Segmentation-Based Parametric Painting. 1-6 - Yuchen Wang, Ruimin Lyu:
Characteristics of Visual Complexity: Calligraphic Fonts vs. Printed Fonts. 1-6 - Jian Ding, Linze Li, L. Rongchang, W. Cong, X. Tianyang, Xiaojun Wu:
Robust Person Re-Identification Approach with Deep Learning and Optimized Feature Extraction. 1-6 - Zixuan Tang, Youjun Zhao, Yuhang Wen, Mengyuan Liu:
A Survey on Backbones for Deep Video Action Recognition. 1-6 - Xinrui Shan, Kejun Zhang, Lyukesheng Shen, Bolin Wang:
The WuShu Database for Cursive Script Character and Style Recognition. 1-6 - Chia-Chun Yen, Show-Po Guo, Tsì-Uí Ik:
Assistant Referee System in Da-Qiang(Pike) Competition. 1-8 - Zhikai Liu, Zhidao Zhou, Fan Liang, Wei Sun:
LIghtweight Texture-Guided Fast Partition Method for Luma and Chroma Intra Coding in VVC. 1-6 - L. Yuanhang, Qi Mao, Libiao Jin:
Beyond Aligned Target Face: StyleGAN-Based Face-Swapping via Inverted Identity Learning. 1-6 - Yundi Zhang, Xin Wang, Ziyi Zhang, Xueying Wang, Xiaohan Ma, Yingying Wu, Han-Wu-Shuang Baao, Xiyang Zhang:
Using Large Language Models to Understand Leadership Perception and Expectation. 1-7 - Tsung-Han Tsai, Chun-Yu Chen:
An SoC Based Hardware Accelerator for Blind Assistive System. 1-2 - Xuandong Huang, Shangfei Wang, Jinghao Yan, Kai Tang, Pengfei Hu:
Enhancing Visual Wake Word Spotting with Pretrained Model and Feature Balance Scaling. 1-6 - Yu-Chen Sun, Jie Dong, Ahmed Fouad, Jian Zhou, Roger Zhou, Shyam Sadhwani:
Low-Complexity Video PSNR Measurement in Real-Time Communication Products. 1-4 - Peilin Xiao, Yueyi Zhang, Dachun Kai, Yansong Peng, Zheyu Zhang, Xiaoyan Sun:
A Micro-Expression Recognition System with Event Cameras. 1-2 - He Wang, Pengcheng Guo, Xucheng Wan, Huan Zhou, Lei Xie:
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder. 1-6 - Qinhua Xie, Weicong Liu, Fan Yuan, Jifan Shi, Ziyu Liu, Yanbing Zhang:
VidBot: Intelligent Video Learning Tool for Content Mining and Playback Traffic Statistics. 1-3 - Yuan Zhang, Hanming Wang, Yunlong Li, Lu Yu:
Afc: Asymmetrical Feature Coding for Multi-Task Machine Intelligence. 1-6 - Qing Wang, Guirui Zhong, Hengyi Hong, Lei Wang, Mingqi Cai, Xin Fang, Ya Jiang, Jun Du:
The NERCSLIP-USTC System for Semi-Supervised Acoustic Scene Classification of ICME 2024 Grand Challenge. 1-4 - Yilin Guo, Ruoke Yan, Yaqiang Wu, Siwei Ma:
Styleself: Style-Controllable High-Fidelity Conversational Virtual Avatars Generation. 1-6 - Jinfu Liu, Baiqiao Yin, Jiaying Lin, Jiajun Wen, Yue Li, Mengyuan Liu:
HDBN: A Novel Hybrid Dual-Branch Network for Robust Skeleton-Based Action Recognition. 1-6 - Ayse B. Demir, Mervegul Parlak, Zafer Gurel, Deniz Ugur, Ali C. Begen:
Impact of Prioritized HTTP/3 Transport on Low-Latency Live Streaming. 1-6 - Haopeng Lu, Wenkang Shan, Yuhuai Zhang, Li Song, Xinfeng Zhang, Siwei Ma, Liuxin Zhang, Wen Gao:
LFCAVE: Interactive 3D Space with Multiple Light Field Displays. 1-2 - Tsung-Han Tsai, Chun-Lin Lee:
Equipped with Monocular Depth Estimation and Intelligent Wake-Up Vision Based Tracking System for a Human-Following Mobile Robot. 1-2 - Han Wang, Xinning Chai, Yiwen Wang, YuHong Zhang, Rong Xie, Li Song:
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior. 1-6 - Ziming He, Xiaomin Zou, Pengfei Wu, Ling Fan, Xiaomei Li:
Creating and Experiencin 3D Immersion Using Generative 2D Diffusion: An Integrated Framework. 1-6 - Xinda Wu, Jiaming Wang, Jiaxing Yu, Tieyao Zhang, Kejun Zhang:
Popular Hooks: A Multimodal Dataset of Musical Hooks for Music Understanding and Generation. 1-6 - Wenkang Shan, Haopeng Lu, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Yaqiang Wu, Wen Gao:
Real-Time Human Motion Transfer System for Holographic Displays. 1-2 - Jason Gerard, David C. Bonilla, Abdelhak Bentaleb, Sandra Céspedes:
Optimizing Quality and Energy Efficiency in Webrtc with ML-Powered Adaptive FEC. 1-6 - Hao Ni, Ping Lai, Yuke Li, Pengpeng Zeng, Haonan Zhang, Jingkuan Song:
Pedestrian Attributes Recognition for UAV-Human. 1-5 - Shuo Chen, Wu Liu, Binbin Yan, Xinzhu Sang, Alicia Li, Xiangcheng Yi:
Blender-NeRF: A Monocular Dynamic Human Body Explicit Reconstruction and Rendering Method. 1-6 - Wen Huang, Anbai Jiang, Bing Han, Xinhu Zheng, Yihong Qiu, Wenxi Chen, Yuzhe Liang, Pingyi Fan, Wei-Qiang Zhang, Cheng Lu, Xie Chen, Jia Liu, Yanmin Qian:
Semi-Supervised Acoustic Scene Classification with Test-Time Adaptation. 1-5 - Yanjun Wang, Wenjia Wang, Jun Ling, Rong Xie, Li Song:
Visibility-Aware Human Mesh Recovery via Balancing Dense Correspondence and Probability Model. 1-6 - Xin Jin, Jinyu Wang, Wenbo Yuan, B. Yihang, Heng Huang, Yiran Zhang, Bao Peng, X. Peng, Xin Song, Hanbing Yang:
Aesthetic Assessment of Movie Still Frame for Various Field of Views. 1-6 - Linze Li, Youwei Zhou, Jiannan Hu, Cong Wu, Tianyang Xu, Xiaojun Wu:
A Hybrid Multi-Perspective Complementary Model for Human Skeleton-Based Action Recognition. 1-6 - Rui Li, Yifan Wei, Haopeng Lu, Siwei Ma, Zhenyu Liu, Hui Liu, Qianying Wang, Yaqiang Wu, Jianrong Tan:
Chinese Ancient Painting Figure Face Restoration and its Application in a Q&A Interaction System. 1-6 - Jiahe Liu, Dandan Zhu, Sajid Javed:
Visual-Language Alignment for Background Subtraction. 1-7 - Keke Chen, Zhewei Tu, Xiangbo Shu:
Leveraging Multimodal Knowledge for Spatio-Temporal Action Localization. 1-5 - Yuyang Wu, Liang Xie, Shangkun Sun, Wei Gao, Yiqiang Yan:
Adaptive Intra Period Size for Deep Learning-Based Screen Content Video Coding. 1-6 - Zicheng Zhang, Haoning Wu, Zhongpeng Ji, Chunyi Li, Erli Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Fengyu Sun, Shangling Jui, Weisi Lin, Guangtao Zhai:
Q-Boost: On Visual Quality Assessment Ability of Low-Level Multi-Modality Foundation Models. 1-6 - An Yu, Jeremy Varghese, Ferhat Demirkiran, Peter Buonaiuto, Xin Li, Ming-Ching Chang:
Dual-Phase Msqnet for Species-Specific Animal Activity Recognition. 1-6 - Yu-Hsi Chen, I-Hsuan Tai:
Optimizing Facial Landmark Estimation for Embedded Systems Through Iterative Autolabeling and Model Pruning. 1-6 - Yu-Shu Ni, Han-Chun Chen, Chia-Chi Tsai, Chih-Cheng Chen, Po-Yu Chen, Hsien-Kai Kuo, Jun-Ying Hunag, Po-Chi Hu, Jenq-Neng Hwang, Jiun-In Guo:
Summary of the 2024 Low-Power Efficient and Accurate Facial-Landmark Detection for Embedded Systems. 1-6 - Mingjie Wang, Song Yuan, Zhuohang Li, Longlong Zhu, Eric Buys, Minglun Gong:
Language-Guided Zero-Shot Object Counting. 1-6 - Yongpeng Yan, Wuyang Liu, Yi Chai, Yanzhen Ren:
Semi-Supervised Acoustic Scene Classification under Domain Shift with MixMatch and Information Bottleneck Optimization. 1-4 - Yuewei Zhang, Huanbin Zou, Jie Zhu:
An Intra- and Inter-Frame Sequence Model with Discrete Cosine Transform for Streaming Speech Enhancement. 1-4 - Genshun Wan, Zhongfu Ye:
Multi-Modal Knowledge Transfer for Target Speaker Lipreading with Improved Audio-Visual Pretraining and Cross-Lingual Fine-Tuning. 1-6 - Yuan Ouyang, Ping Wang, Lijun He, Fan Li:
An End-to-End Channel-Adaptive Feature Compression Approach in Device-Edge Co-Inference Systems. 1-6 - Nuoer Long, Kin-Seong Un, Chengpeng Xiong, Zhuolin Li, Shaobin Chen, Tao Tan, Chan-Tong Lam, Yue Sun:
A Multimodal Behavior Recognition Network with Interconnected Architectures. 1-6 - Shaofan Sun, Jiahang Zhang, Guo Tang, Chuanmin Jia, Jiaying Liu:
Learning Discriminative and Robust Representations for UAV-View Skeleton-Based Action Recognition. 1-6 - X. Yue, Kaizhi Yang, Kai Cheng, Jiebo Luo, Xuejin Chen:
Dual Attribute-Spatial Relation Alignment for 3D Visual Grounding. 1-6 - Austin Kaburia Kibaara, Joan Kabura, Antony Gitau, Ciira Maina:
AJA-Pose: A Framework for Animal Pose Estimation Based on VHR Network Architecture. 1-6 - Yifan Wei, Wenkang Shan, Qi Zhang, Liuxin Zhang, Jian Zhang, Siwei Ma:
Real-Time Interaction with Animated Human Figures in Chinese Ancient Paintings. 1-6 - Dristi Datta, Manoranjan Paul, M. Manzur Murshed, Shyh Wei Teng, Leigh M. Schmidtke:
Unveiling Soil-Vegetation Interactions: Reflection Relationships and an Attention-Based Deep Learning Approach for Carbon Estimation. 1-6 - Zesen Wu, Mang Ye, Shuoyi Chen, Bo Du:
Attribute-Aware Network for Pedestrian Attribute Recognition. 1-6 - Yue Li, Baiqiao Yin, Jinfu Liu, Jiajun Wen, Jiaying Lin, Mengyuan Liu:
SEMIPL: A Semi-Supervised Method for Event Sound Source Localization. 1-6 - Anderson de Andrade, Ivan V. Bajic:
Towards Task-Compatible Compressible Representations. 1-6 - Saeed Ranjbar Alvar, Ivan V. Bajic:
Compressive Feature Selection for Remote Visual Multi-Task Inference. 1-6 - Yuzhe Liang, Wenxi Chen, Anbai Jiang, Yihong Qiu, Xinhu Zheng, Wen Huang, Bing Han, Yanmin Qian, Pingyi Fan, Wei-Qiang Zhang, L. Cheng, Jia Liu, Xie Chen:
Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer. 1-6 - Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu, Rong Wei:
3DMIT: 3D Multi-Modal Instruction Tuning for Scene Understanding. 1-5 - Hao Liu, Lijun He, Jiaxi Liang:
Joint Modal Circular Complementary Attention for Multimodal Aspect-Based Sentiment Analysis. 1-6 - Anastasia Henkel, Benjamin Bross, Jens Brandenburg, Adam Wieckowski, Detlev Marpe, Andoni Morales, Sergio Sanchez:
Optimizing an Open VVC Encoder for Low Delay Remote Desktop Applications. 1-6 - Zhuokai Zhao, Harish Palani, Tianyi Liu, Lena Evans, Ruth Toner:
Multimodal Guidance Network for Missing- Modality Inference in Content Moderation. 1-4 - Jiyong Rao, Tianyang Xu, Xiaoning Song, Zhenhua Feng, Xiaojun Wu:
Body-Part Guided Animal Pose Estimation. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.