default search action
IISWC 2024: Vancouver, BC, Canada
- IEEE International Symposium on Workload Characterization, IISWC 2024, Vancouver, BC, Canada, September 15-17, 2024. IEEE 2024, ISBN 979-8-3503-5603-8
- Junrui Pan, Timothy G. Rogers:
CRISP: Concurrent Rendering and Compute Simulation Platform for GPUs. 1-14 - Jaehong Cho, Minsu Kim, Hyunmin Choi, Guseul Heo, Jongse Park:
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale. 15-29 - Rajveer Bachkaniwala, Harshith Lanka, Kexin Rong, Ada Gavrilovska:
Lotus: Characterization of Machine Learning Preprocessing Pipelines via Framework and Hardware Profiling. 30-43 - Seung Hun Choi, Myung Jae Chung, Young Geun Kim, Sung Woo Chung:
Mediator: Characterizing and Optimizing Multi-DNN Inference for Energy Efficient Edge Intelligence. 44-56 - Joyjit Kundu, Wenzhe Guo, Ali BanaGozar, Udari De Alwis, Sourav Sengupta, Puneet Gupta, Arindam Mallik:
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference. 57-67 - José Morgado, Leonel Sousa, Aleksandar Ilic:
CARM Tool: Cache-Aware Roofline Model Automatic Benchmarking and Application Analysis. 68-81 - Viyom Mittal, Pedro Bruel, Michalis Faloutsos, Dejan S. Milojicic, Eitan Frachtenberg:
SHARP: A Distribution-Based Framework for Reproducible Performance Evaluation. 82-93 - Georgia Antoniou, Haris Volos, Yiannakis Sazeides:
Taming Performance Variability caused by Client-Side Hardware Configuration. 94-107 - Xinquan Lin, Haobo Xu, Yinhe Han, Yiming Gan:
HEX-SIM: Evaluating Multi-modal Large Language Models on Multi-chiplet NPUs. 108-120 - Shmeelok Chakraborty, Yuewen Hou, Ang Chen, Gokul Subramanian Ravi:
Empowering the Quantum Cloud User with QRIO. 121-131 - Tersiteab Adem, Andrew McCrabb, Vidushi Goyal, Valeria Bertacco:
Evergreen: Comprehensive Carbon Model for Performance-Emission Tradeoffs. 132-143 - Saichand Samudrala, Jiawen Wu, Chen Chen, Haoxuan Shan, Jonathan Ku, Yiran Chen, Jeyavijayan Rajendran:
Performance Analysis of Zero-Knowledge Proofs. 144-155 - Alexander Hankin, Abdulrahman Mahmoud, Mark Hempstead, David Brooks, Gu-Yeon Wei:
VelociTI: An Architecture-level Performance Modeling Framework for Trapped Ion Quantum Computers. 156-168 - Seonjin Na, Geonhwa Jeong, Byung Hoon Ahn, Jeffrey Young, Tushar Krishna, Hyesoon Kim:
Understanding Performance Implications of LLM Inference on CPUs. 169-180 - Cheng Chen, Christina Giannoula, Andreas Moshovos:
Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models. 181-193 - Chakshu Moar, Faraz Tahmasebi, Michael Pellauer, Hyoukjun Kwon:
Characterizing the Accuracy-Efficiency Trade-off of Low-rank Decomposition in Language Models. 194-209 - Yuchen Xia, Jiho Kim, Yuhan Chen, Haojie Ye, Souvik Kundu, Cong Callie Hao, Nishil Talati:
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning. 210-223 - Kailash Gogineni, Yongsheng Mei, Karthikeya Gogineni, Peng Wei, Tian Lan, Guru Venkataramani:
Characterizing and Optimizing the End-to-End Performance of Multi-Agent Reinforcement Learning Systems. 224-235 - Nick Lindsay, Abhishek Bhattacharjee:
Understanding Address Translation Scaling Behaviours Using Hardware Performance Counters. 236-246 - Farzana Ahmed Siddique, Deyuan Guo, Zhenxing Fan, MohammadHosein Gholamrezaei, Morteza Baradaran, Alif Ahmed, Hugo Abbot, Kyle Durrer, Kumaresh Nandagopal, Ethan Ermovick, Khyati Kiyawat, Beenish Gul, Abdullah Mughrabi, Ashish Venkat, Kevin Skadron:
Architectural Modeling and Benchmarking for Digital DRAM PIM. 247-261 - K. P. Arun, Debadatta Mishra:
Kindle: A Comprehensive Framework for Exploring OS-Architecture Interplay in Hybrid Memory Systems. 262-272 - Anoop Mysore Nataraja, Ricardo Fernández Pascual, Alberto Ros:
Enhanced System-Level Coherence for Heterogeneous Unified Memory Architectures. 273-283 - Michael Wu, Sibren Isaacman, Abhishek Bhattacharjee:
Characterizing Emerging Page Replacement Policies for Memory-Intensive Applications. 284-294 - Brandon Alexander Burtchell, Martin Burtscher:
Characterizing CUDA and OpenMP Synchronization Primitives. 295-308 - Demirhan Sevim, Baturalp Bilgin, Ismail Akturk:
Evaluating Performance and Energy Efficiency of Parallel Programming Models in Heterogeneous Computing Systems. 309-319 - Yiqian Liu, Avery Vanausdal, Martin Burtscher:
Performance Impact of Removing Data Races from GPU Graph Analytics Programs. 320-331
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.