default search action
34th IPDPS 2020: New Orleans, LA, USA
- 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, May 18-22, 2020. IEEE 2020, ISBN 978-1-7281-6876-0
- Mark Clark, Yingping Chen, Avinash Karanth, Dongsheng Brian Ma, Ahmed Louri:
DozzNoC: Reducing Static and Dynamic Energy in NoCs with Low-latency Voltage Regulators using Machine Learning. 1-11 - Vidushi Goyal, Xiaowei Wang, Valeria Bertacco, Reetuparna Das:
Neksus: An Interconnect for Heterogeneous System-In-Package Architectures. 12-21 - Yunfan Li, Lizhong Chen:
Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs. 22-31 - Jahanzeb Maqbool Hashmi, Shulei Xu, Bharath Ramesh, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures. 32-41 - Zhirong Shen, Jiwu Shu, Zhijie Huang, Yingxun Fu:
ClusterSR: Cluster-Aware Scattered Repair in Erasure-Coded Storage. 42-51 - Jay F. Lofstead, John Mitchell, Enze Chen:
Stitch It Up: Using Progressive Data Storage to Scale Science. 52-61 - Hariharan Devarajan, Anthony Kougkas, Xian-He Sun:
HFetch: Hierarchical Data Prefetching for Scientific Workflows in Multi-Tiered Storage Environments. 62-72 - Michael R. Wyatt II, Stephen Herbein, Kathleen Shoga, Todd Gamblin, Michela Taufer:
CanarIO: Sounding the Alarm on IO-Related Performance Degradation. 73-83 - Vishwesh Jatala, Roshan Dathathri, Gurbinder Gill, Loc Hoang, V. Krishna Nandivada, Keshav Pingali:
A Study of Graph Analytics for Massive Datasets on Distributed Multi-GPUs. 84-94 - Hang Cao, Liang Yuan, He Zhang, Baodong Wu, Shigang Li, Pengqi Lu, Yunquan Zhang, Yongjun Xu, Minghua Zhang:
A Highly Efficient Dynamical Core of Atmospheric General Circulation Model based on Leap-Format. 95-104 - Sian Jin, Pascal Grosset, Christopher M. Biwer, Jesus Pulido, Jiannan Tian, Dingwen Tao, James P. Ahrens:
Understanding GPU-Based Lossy Compression for Extreme-Scale Cosmological Simulations. 105-115 - Oguz Selvitopi, Md Taufique Hussain, Ariful Azad, Aydin Buluç:
Optimizing High Performance Markov Clustering for Pre-Exascale Architectures. 116-126 - Yukun Cheng, Xiaotie Deng, Yuhao Li:
Tightening Up the Incentive Ratio for Resource Sharing Over the Rings. 127-136 - Timo Bingmann, Peter Sanders, Matthias Schimek:
Communication-Efficient String Sorting. 137-147 - Tianchen Ding, Shiyou Qian, Jian Cao, Guangtao Xue, Minglu Li:
SCSL: Optimizing Matching Algorithms to Improve Real-time for Content-based Pub/Sub Systems. 148-157 - John Augustine, Keerti Choudhary, Avi Cohen, David Peleg, Sumathi Sivasubramaniam, Suman Sourav:
Distributed Graph Realizations †. 158-167 - Sang Wook Stephen Do, Michel Dubois:
Transaction-Based Core Reliability. 168-179 - Seung-Hwan Lim, Ross G. Miller, Sudharshan S. Vazhkudai:
Understanding the Interplay between Hardware Errors and User Job Characteristics on the Titan Supercomputer. 180-190 - Han Qiu, Chentao Wu, Jie Li, Minyi Guo, Tong Liu, Xubin He, Yuanyuan Dong, Yafei Zhao:
EC-Fusion: An Efficient Hybrid Erasure Coding Framework to Improve Both Application and Recovery Performance in Cloud Storage Systems. 191-201 - Tang Liu, Baijun Wu, Wenzheng Xu, Xianbo Cao, Jian Peng, Hongyi Wu:
Learning an Effective Charging Scheme for Mobile Devices. 202-211 - Cong Wang, Xin Wei, Pengzhan Zhou:
Optimize Scheduling of Federated Learning on Battery-powered Mobile Devices. 212-221 - Evangelos Georganas, Kunal Banerjee, Dhiraj D. Kalamkar, Sasikanth Avancha, Anand Venkat, Michael J. Anderson, Greg Henry, Hans Pabst, Alexander Heinecke:
Harnessing Deep Learning via a Single Building Block. 222-233 - Yufeng Zhan, Peng Li, Song Guo:
Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning. 234-243 - Jiepeng Zhang, Jingwei Sun, Wenju Zhou, Guangzhong Sun:
An Active Learning Method for Empirical Modeling in Performance Tuning. 244-253 - Bin Dong, Verónica Rodríguez Tribaldos, Xin Xing, Suren Byna, Jonathan Ajo-Franklin, Kesheng Wu:
DASSA: Parallel DAS Data Storage and Analysis for Subsurface Event Detection. 254-263 - Mahesh Balasubramanian, Trevor D. Ruiz, Brandon Cook, Prabhat, Sharmodeep Bhattacharyya, Aviral Shrivastava, Kristofer E. Bouchard:
Scaling of Union of Intersections for Inference of Granger Causal Networks from Observational Data. 264-273 - Xiaodong Yu, Fengguo Wei, Xinming Ou, Michela Becchi, Tekin Bicer, Danfeng Daphne Yao:
GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting. 274-284 - Dongyu Lu, Yuben Qu, Fan Wu, Haipeng Dai, Chao Dong, Guihai Chen:
Robust Server Placement for Edge Computing. 285-294 - Yoonsung Nam, Yongjun Choi, Byeonghun Yoo, Hyeonsang Eom, Yongseok Son:
EdgeIso: Effective Performance Isolation for Edge Devices. 295-305 - Runtian Ren, Xueyan Tang:
Busy-Time Scheduling on Heterogeneous Machines. 306-315 - Evripidis Bampis, Konstantinos Dogeas, Alexander V. Kononov, Giorgio Lucarelli, Fanny Pascual:
Scheduling Malleable Jobs Under Topological Constraints. 316-325 - Cheng Li, Abdul Dakkak, Jinjun Xiong, Wei Wei, Lingjie Xu, Wen-Mei Hwu:
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs. 326-327 - Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, Leonel Sousa:
Exploring the Binary Precision Capabilities of Tensor Cores for Epistasis Detection. 338-347 - Pantea Zardoshti, Michael F. Spear, Aida Vosoughi, Garret Swart:
Understanding and Improving Persistent Transactions on Optane™ DC Memory. 348-357 - Mengqian Zhang, Jichen Li, Zhaohua Chen, Hongyin Chen, Xiaotie Deng:
CycLedger: A Scalable and Secure Parallel Protocol for Distributed Ledger via Sharding. 358-367 - Jianshu Liu, Shungeng Zhang, Qingyang Wang, Jinpeng Wei:
Mitigating Large Response Time Fluctuations through Fast Concurrency Adapting in Clouds. 368-377 - Yinggen Xu, Liu Liu, Zhijun Ding:
DAG-Aware Joint Task Scheduling and Cache Management in Spark Clusters. 378-387 - Tim Shaffer, Nicholas L. Hazekamp, Jakob Blomer, Douglas Thain:
Solving the Container Explosion Problem for Distributed High Throughput Computing. 388-398 - Zijun Li, Quan Chen, Shuai Xue, Tao Ma, Yong Yang, Zhuo Song, Minyi Guo:
Amoeba: QoS-Awareness and Reduced Resource Usage of Microservices with Serverless Computing. 399-408 - Zhao Zhang, Lei Huang, J. Gregory Pauloski, Ian T. Foster:
Efficient I/O for Neural Network Training with Compressed Data. 409-418 - Jun Yi, Chengliang Zhang, Wei Wang, Cheng Li, Feng Yan:
Not All Explorations Are Equal: Harnessing Heterogeneous Profiling Cost for Efficient MLaaS Training. 419-428 - Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi:
ASYNC: A Cloud Engine with Asynchrony and History for Distributed Machine Learning. 429-439 - Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei Hwu:
Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs. 440-450 - Debashis Ganguly, Ziyu Zhang, Jun Yang, Rami G. Melhem:
Adaptive Page Migration for Irregular Data-intensive Applications under GPU Memory Oversubscription. 451-461 - Alberto Zeni, Giulia Guidi, Marquita Ellis, Nan Ding, Marco D. Santambrogio, Steven A. Hofmeyr, Aydin Buluç, Leonid Oliker, Katherine A. Yelick:
LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment. 462-471 - Qi Yu, Bruce R. Childers, Libo Huang, Cheng Qian, Hui Guo, Zhiying Wang:
Coordinated Page Prefetch and Eviction for Memory Oversubscription Management in GPUs. 472-482 - Lingqi Zhang, Mohamed Wahib, Haoyu Zhang, Satoshi Matsuoka:
A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs. 483-493 - Lili Gao, Fangyu Zheng, Niall Emmart, Jiankuo Dong, Jingqiang Lin, Charles C. Weems:
DPF-ECC: Accelerating Elliptic Curve Cryptography with Floating-Point Computing Power of GPUs. 494-504 - François-Henry Rouet, Cleve Ashcraft, Jef Dawson, Roger Grimes, Erman Guleryuz, Seid Koric, Robert F. Lucas, James S. Ong, Todd A. Simons, Ting-Ting Zhu:
Scalability Challenges of an Industrial Implicit Finite Element Code. 505-514 - Gregory D. Abram, Vignesh Adhinarayanan, Wu-chun Feng, David H. Rogers, James P. Ahrens:
ETH: An Architecture for Exploring the Design Space of In-situ Scientific Visualization. 515-526 - Alexander van der Grinten, Henning Meyerhenke:
Scaling Betweenness Approximation to Billions of Edges by MPI-based Adaptive Sampling. 527-535 - Haoyu Wang, Haiying Shen, Charles Reiss, Arnim Jain, Yunqiao Zhang:
Improved Intermediate Data Management for MapReduce Frameworks. 536-545 - David Gureya, João Neto, Reza Karimi, João Barreto, Pramod Bhatotia, Vivien Quéma, Rodrigo Rodrigues, Paolo Romano, Vladimir Vlassov:
Bandwidth-Aware Page Placement in NUMA. 546-556 - Hariharan Devarajan, Anthony Kougkas, Luke Logan, Xian-He Sun:
HCompress: Hierarchical Data Compression for Multi-Tiered Storage Environments. 557-566 - Robert Underwood, Sheng Di, Jon C. Calhoun, Franck Cappello:
FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data. 567-577 - Nadja Holtryd, Madhavan Manivannan, Per Stenström, Miquel Pericàs:
DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors. 578-589 - Mehrzad Nejat, Madhavan Manivannan, Miquel Pericàs, Per Stenström:
Coordinated Management of Processor Configuration and Cache Partitioning to Optimize Energy under QoS Constraints. 590-601 - Wenjie Liu, Ping Huang, Xubin He:
StragglerHelper: Alleviating Straggling in Computing Clusters via Sharing Memory Access Patterns. 602-611 - Nicholas Buoncristiani, Sanjana Shah, David Donofrio, John Shalf:
Evaluating the Numerical Stability of Posit Arithmetic. 612-621 - Ignacio Laguna:
Varity: Quantifying Floating-Point Variations in HPC Systems Through Randomized Testing. 622-633 - Da Yan, Wei Wang, Xiaowen Chu:
Demystifying Tensor Cores to Optimize Half-Precision Matrix Multiply. 634-643 - Yuchen Li, Weifa Liang, Wenzheng Xu, Xiaohua Jia:
Data Collection of IoT Devices Using an Energy-Constrained UAV. 644-653 - Qian Zhou, Omkant Pandey, Fan Ye:
Argus: Multi-Level Service Visibility Scoping for Internet-of-Things in Enterprise Environments. 654-663 - Laphou Lao, Xiaohai Dai, Bin Xiao, Songtao Guo:
G-PBFT: A Location-based and Scalable Consensus Protocol for IoT-Blockchain Applications. 664-673 - Giuseppe Antonio Di Luna, Emmanuelle Anceaume, Leonardo Querzoni:
Byzantine Generalized Lattice Agreement. 674-683 - Yu Huang, Long Zheng, Pengcheng Yao, Jieshan Zhao, Xiaofei Liao, Hai Jin, Jingling Xue:
A Heterogeneous PIM Hardware-Software Co-Design for Energy-Efficient Graph Processing. 684-695 - Long Zheng, Jieshan Zhao, Yu Huang, Qinggang Wang, Zhen Zeng, Jingling Xue, Xiaofei Liao, Hai Jin:
Spara: An Energy-Efficient ReRAM-Based Accelerator for Sparse Graph Analytics Applications. 696-707 - Zhijie Huang, Hong Jiang, Zhirong Shen, Hao Che, Nong Xiao, Ning Li:
Optimal Encoding and Decoding Algorithms for the RAID-6 Liberation Codes. 708-717 - Pu Pang, Quan Chen, Deze Zeng, Chao Li, Jingwen Leng, Wenli Zheng, Minyi Guo:
Sturgeon: Preference-aware Co-location for Improving Utilization of Power Constrained Computers. 718-727 - Yu-Hang Tang, Oguz Selvitopi, Doru-Thom Popovici, Aydin Buluç:
A High-Throughput Solver for Marginalized Graph Kernels on GPU. 728-738 - Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, John D. Owens:
Dynamic Graphs on the GPU. 739-748 - Lucas Erlandson, Difeng Cai, Yuanzhe Xi, Edmond Chow:
Accelerating Parallel Hierarchical Matrix-Vector Products via Data-Driven Sampling. 749-758 - Changyong Hu, Vijay K. Garg:
NC Algorithms for Popular Matchings in One-Sided Preference Systems and Related Problems. 759-768 - Jiechao Gao, Haoyu Wang, Haiying Shen:
Smartly Handling Renewable Energy Instability in Supporting A Cloud Datacenter. 769-778 - Vinodh Kumaran Jayakumar, Jaewoo Lee, In Kee Kim, Wei Wang:
A Self-Optimized Generic Workload Prediction Framework for Cloud Computing. 779-788 - Ivana Marincic, Venkatram Vishwanath, Henry Hoffmann:
SeeSAw: Optimizing Performance of In-Situ Analytics Applications under Power Constraints. 789-798 - Tirthak Patel, Adam Wagenhäuser, Christopher Eibel, Timo Hönig, Thomas Zeiser, Devesh Tiwari:
What does Power Consumption Behavior of HPC Jobs Reveal? : Demystifying, Quantifying, and Predicting Power Consumption Characteristics. 799-809 - Jie Yang, Satish Puri:
Efficient Parallel and Adaptive Partitioning for Load-balancing in Spatial Join. 810-820 - Xin Wang, Misbah Mubarak, Yao Kang, Robert B. Ross, Zhiling Lan:
Union: An Automatic Workload Manager for Accelerating Network Simulation. 821-830 - Harshitha Menon, Abhinav Bhatele, Todd Gamblin:
Auto-tuning Parameter Choices in HPC Applications using Bayesian Optimization. 831-840 - Zhihui Du, Xinning Hui, Yurui Wang, Jun Jiang, Jason Liu, Baokun Lu, Chongyu Wang:
Inter-Job Scheduling of High-Throughput Material Screening Applications. 841-852 - Ana Gainaru, Brice Goglin, Valentin Honoré, Guillaume Pallez Aupy, Padma Raghavan, Yves Robert, Hongyang Sun:
Reservation and Checkpointing Strategies for Stochastic Jobs. 853-863 - Shikha Singh, Sergey Madaminov, Michael A. Bender, Michael Ferdman, Ryan Johnson, Benjamin Moseley, Hung Q. Ngo, Dung Nguyen, Soeren Olesen, Kurt Stirewalt, Geoffrey Washburn:
A Scheduling Approach to Incremental Maintenance of Datalog Programs. 864-873 - Costas Busch, Maurice Herlihy, Miroslav Popovic, Gokarna Sharma:
Dynamic Scheduling in Distributed Transactional Memory. 874-883 - Marcus Ritter, Alexandru Calotoiu, Sebastian Rinke, Thorsten Reimann, Torsten Hoefler, Felix Wolf:
Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling. 884-895 - Abhinav Bhatele, Jayaraman J. Thiagarajan, Taylor L. Groves, Rushil Anirudh, Staci A. Smith, Brandon Cook, David K. Lowenthal:
The Case of Performance Variability on Dragonfly-based Systems. 896-905 - Donghe Kang, Oliver Rübel, Suren Byna, Spyros Blanas:
Predicting and Comparing the Performance of Array Management Libraries. 906-915 - Ivy Bo Peng, Kai Wu, Jie Ren, Dong Li, Maya B. Gokhale:
Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems. 916-925 - Rui Xia, Haipeng Dai, Jiaqi Zheng, Hong Xu, Meng Li, Guihai Chen:
Packet-in Request Redirection for Minimizing Control Plane Response Time. 926-935 - Chao Tian, Lingxiao Ma, Zhi Yang, Yafei Dai:
PCGCN: Partition-Centric Processing for Accelerating Graph Convolutional Network. 936-945 - Guiyan Liu, Songtao Guo, Pan Li, Liang Liu:
ConMidbox: Consolidated Middleboxes Selection and Routing in SDN/NFV-Enabled Networks. 946-955 - Gustavo Chávez, Yang Liu, Pieter Ghysels, Xiaoye Sherry Li, Elizaveta Rebrova:
Scalable and Memory-Efficient Kernel Ridge Regression. 956-965 - Renping Liu, Xianzhang Chen, Yujuan Tan, Runyu Zhang, Liang Liang, Duo Liu:
SSDKeeper: Self-Adapting Channel Allocation to Improve the Performance of SSD Devices. 966-975 - Madhurima Ray, Krishna Kant, Peng Li, Sanjeev Trika:
FlashKey: A High-Performance Flash Friendly Key-Value Store. 976-985 - Yubo Liu, Yutong Lu, Zhiguang Chen, Ming Zhao:
Pacon: Improving Scalability and Efficiency of Metadata Service through Partial Consistency. 986-996 - Peter Pirkelbauer, Pei-Hung Lin, Tristan Vanderbruggen, Chunhua Liao:
XPlacer: Automatic Analysis of Data Access Patterns on Heterogeneous CPU/GPU Systems. 997-1007 - João P. L. de Carvalho, Bruno C. Honorio, Alexandro Baldassin, Guido Araujo:
Improving Transactional Code Generation via Variable Annotation and Barrier Elision. 1008-1017 - Hancheng Wu, Michela Becchi:
Evaluating Thread Coarsening and Low-cost Synchronization on Intel Xeon Phi. 1018-1029 - André Müller, Bertil Schmidt, Andreas Hildebrandt, Richard Membarth, Roland Leißa, Matthis Kruse, Sebastian Hack:
AnySeq: A High Performance Sequence Alignment Library based on Partial Evaluation. 1030-1040 - Lionel Eyraud-Dubois, Suraj Kumar:
Analysis of a List Scheduling Algorithm for Task Graphs on Two Types of Resources. 1041-1050 - Rory Hector, Ramachandran Vaidyanathan, Gokarna Sharma, Jerry L. Trahan:
Optimal Convex Hull Formation on a Grid by Asynchronous Robots with Lights. 1051-1060 - Alberto Marchetti-Spaccamela, Nicole Megow, Jens Schlöter, Martin Skutella, Leen Stougie:
On the Complexity of Conditional DAG Scheduling in Multiprocessor Systems. 1061-1070 - Xin Sunny Huang, Yiting Xia, T. S. Eugene Ng:
Weaver: Efficient Coflow Scheduling in Heterogeneous Parallel Networks. 1071-1081 - Diyu Zhou, Yuval Tamir:
Fault-Tolerant Containers Using NiLiCon. 1082-1091 - Anwesha Das, Frank Mueller, Barry Rountree:
Aarohi: Making Real-Time Node Failure Prediction Feasible. 1092-1101 - Pinchao Liu, Hailu Xu, Dilma Da Silva, Qingyang Wang, Sarker Tanzir Ahmed, Liting Hu:
FP4S: Fragment-based Parallel State Recovery for Stateful Stream Applications. 1102-1111 - Maxime France-Pillois, Jérôme Martin, Frédéric Rousseau:
Implementation and Evaluation of a Hardware Decentralized Synchronization Lock for MPSoCs. 1112-1121 - Maciej Besta, Raghavendra Kanakagiri, Harun Mustafa, Mikhail Karasikov, Gunnar Rätsch, Torsten Hoefler, Edgar Solomonik:
Communication-Efficient Jaccard similarity for High-Performance Distributed Genome Comparisons. 1122-1132 - Kyle Berney, Nodari Sitchinava:
Engineering Worst-Case Inputs for Pairwise Merge Sort on GPUs. 1133-1142 - Karolos Antoniadis, Diego Didona, Rachid Guerraoui, Willy Zwaenepoel:
The Impossibility of Fast Transactions. 1143-1154
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.