default search action
25th CLUSTER 2023: Santa Fe, NM, USA
- IEEE International Conference on Cluster Computing, CLUSTER 2023, Santa Fe, NM, USA, October 31 - Nov. 3, 2023. IEEE 2023, ISBN 979-8-3503-0792-4
- Sahil Tyagi, Martin Swany:
Accelerating Distributed ML Training via Selective Synchronization. 1-12 - Kevin Assogba, Eduardo Lima, M. Mustafa Rafique, Minseok Kwon:
PredictDDL: Reusable Workload Performance Prediction for Distributed Deep Learning. 13-24 - Frank Wanye, Vitaliy Gleyzer, Edward K. Kao, Wu-Chun Feng:
Exact Distributed Stochastic Block Partitioning. 25-36 - Taehoon Kim, Kwangwon Koh, Changdae Kim, Eunji Pak, Yeonjeong Jeong, Sang-Hoon Kim:
DEHype: Retrofitting Hypervisors for a Resource-Disaggregated Environment. 37-48 - Xinying Wang, Lipeng Wan, Scott Klasky, Dongfang Zhao, Feng Yan:
SciLance: Mitigate Load Imbalance for Parallel Scientific Applications in Cloud Environments. 49-59 - Michael Wilkins, Hanming Wang, Peizhi Liu, Bangyen Pham, Yanfei Guo, Rajeev Thakur, Peter A. Dinda, Nikos Hardavellas:
Generalized Collective Algorithms for the Exascale Era. 60-71 - Melvin Chelli, Cèdric Prigent, René Schubotz, Alexandru Costan, Gabriel Antoniu, Loïc Cudennec, Philipp Slusallek:
FedGuard: Selective Parameter Aggregation for Poisoning Attack Mitigation in Federated Learning. 72-81 - Wei Wang, Zhiquan Lai, Shengwei Li, Weijie Liu, Keshi Ge, Yujie Liu, Ao Shen, Dongsheng Li:
Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models. 82-94 - Turja Kundu, Tong Shu:
HIOS: Hierarchical Inter-Operator Scheduler for Real-Time Inference of DAG-Structured Deep Learning Models on Multiple GPUs. 95-106 - Yuzuo Zhang, Xinyuan Tu, Lin Wang, Yuchong Hu, Fang Wang, Ye Wang:
FullRepair: Towards Optimal Repair Pipelining in Erasure-Coded Clustered Storage Systems. 107-117 - Krijn Doekemeijer, Nick Tehrany, Balakrishnan Chandrasekaran, Matias Bjørling, Animesh Trivedi:
Performance Characterization of NVMe Flash Devices with Zoned Namespaces (ZNS). 118-131 - Inhyuk Park, Qing Zheng, Dominic Manno, Soonyeal Yang, Jason Lee, David Bonnie, Bradley W. Settlemyer, Youngjae Kim, Woosuk Chung, Gary Grider:
KV-CSD: A Hardware-Accelerated Key-Value Store for Data-Intensive Applications. 132-144 - Xingguo Jia, Xingzi Yu, Yun Wang, Senhao Yu, Zhengwei Qi:
Rethinking Virtual Machines Live Migration for Memory Disaggregation. 145-157 - George Michelogiannakis, Yehia Arafa, Brandon Cook, Liang Yuan Dai, Abdel-Hameed A. Badawy, Madeleine Glick, Yuyang Wang, Keren Bergman, John Shalf:
Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics. 158-172 - Hongliang Li, Hairui Zhao, Zhewen Xu, Xiang Li, Haixiao Xu:
ExplSched: Maximizing Deep Learning Cluster Efficiency for Exploratory Jobs. 173-184 - Urvij Saroliya, Eishi Arima, Dai Liu, Martin Schulz:
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach. 185-196 - Yihao Sun, Sidharth Kumar, Thomas Gilray, Kristopher K. Micinski:
Communication-Avoiding Recursive Aggregation. 197-208 - Wenxuan Li, Helin Cheng, Zhengyang Lu, Yuechen Lu, Weifeng Liu:
HASpMV: Heterogeneity-Aware Sparse Matrix-Vector Multiplication on Modern Asymmetric Multicore Processors. 209-220 - Daniel Rosendo, Marta Mattoso, Alexandru Costan, Renan Souza, Débora B. Pina, Patrick Valduriez, Gabriel Antoniu:
ProvLight: Efficient Workflow Provenance Capture on the Edge-to-Cloud Continuum. 221-233 - Zhangyu Liu, Cheng Zhang, Huijun Wu, Jianbin Fang, Lin Peng, Guixin Ye, Zhanyong Tang:
Optimizing HPC I/O Performance with Regression Analysis and Ensemble Learning. 234-246 - Arkaprabha Ganguli, Robert Underwood, Julie Bessac, David Krasowska, Jon C. Calhoun, Sheng Di, Franck Cappello:
A Lightweight, Effective Compressibility Estimation Method for Error-bounded Lossy Compression. 247-258 - Yiltan Hassan Temuçin, Scott Levy, Whit Schonbein, Ryan E. Grant, Ahmad Afsahi:
A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs. 259-270 - Yiwen Zhang, Guokuan Li, Jiguang Wan, Junyue Wang, Jun Li, Ting Yao, Huatao Wu, Daohui Wang:
DoW-KV: A DPU-offloaded and Write-optimized Key-Value Store on Disaggregated Persistent Memory. 271-283 - Jesper Larsson Träff, Sascha Hunold, Ioannis Vardas, Nikolaus Manes Funk:
Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI. 284-294 - Wenhai Lin, Jingchang Qin, Yiquan Chen, Zhen Jin, Jiexiong Xu, Yuzhong Zhang, Shishun Cai, Lirong Fu, Yi Chen, Wenzhi Chen:
JACO: JAva Code Layout Optimizer Enabling Continuous Optimization without Pausing Application Services. 295-306 - Zhenyu Xu, Miaoxiang Yu, Jillian Cai, Qing Yang, Tao Wei:
A Finite-Difference Time-Domain (FDTD) solver with linearly scalable performance in an FPGA cluster. 307-317 - Hengquan Mei, Huaizhi Qu, Jingwei Sun, Yanjie Gao, Haoxiang Lin, Guangzhong Sun:
GPU Occupancy Prediction of Deep Learning Models Using Graph Neural Network. 318-329 - Qinglei Cao, Sameh Abdulah, Hatem Ltaief, Marc G. Genton, David E. Keyes, George Bosilca:
Reducing Data Motion and Energy Consumption of Geospatial Modeling Applications Using Automated Precision Conversion. 330-342 - Zixuan Chen, Zhigao Zhao, Zijian Li, Jiang Shao, Sen Liu, Yang Xu:
SDT: A Low-cost and Topology-reconfigurable Testbed for Network Research. 343-353 - Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
PiP-MColl: Process-in-Process-based Multi-object MPI Collectives. 354-364 - Olamide Timothy Tawose, Lei Yang, Dongfang Zhao:
TopoCommit: A Topological Commit Protocol for Cross-Ledger Transactions in Scientific Computing. 365-375
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.