default search action
27th PACT 2018: Limassol, Cyprus
- Skevos Evripidou, Per Stenström, Michael F. P. O'Boyle:
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, PACT 2018, Limassol, Cyprus, November 01-04, 2018. ACM 2018
Emerging applications & systems
- Vasileios Zois, Divya Gupta, Vassilis J. Tsotras, Walid A. Najjar, Jean-François Roy:
Massively parallel skyline computation for processing-in-memory architectures. 1:1-1:12 - Wanling Gao, Jianfeng Zhan, Lei Wang, Chunjie Luo, Daoyi Zheng, Fei Tang, Biwei Xie, Chen Zheng, Xu Wen, Xiwen He, Hainan Ye, Rui Ren:
Data motifs: a lens towards fully understanding big data and AI workloads. 2:1-2:14 - Sanchit Misra, Tony C. Pan, Kanak Mahadik, George Powley, Priya N. Vaidya, Md. Vasimuddin, Srinivas Aluru:
Performance extraction and suitability analysis of multi- and many-core architectures for next generation sequencing secondary analysis. 3:1-3:14
Memory systems
- Biswabandan Panda, André Seznec:
Synergistic cache layout for reuse and compression. 4:1-4:13 - Jinsu Park, Seongbeom Park, Myeonggyun Han, Jihoon Hyun, Woongki Baek:
Hypart: a hybrid technique for practical memory bandwidth partitioning on commodity servers. 5:1-5:14 - Jeongkyu Hong, Hyeonggyu Kim, Soontae Kim:
EAR: ECC-aided refresh reduction through 2-D zero compression. 6:1-6:11
Graph processing
- Maciej Besta, Dimitri Stanojevic, Tijana Zivic, Jagpreet Singh, Maurice Hoerold, Torsten Hoefler:
Log(graph): a near-optimal high-performance graph representation. 7:1-7:13 - Pengcheng Yao, Long Zheng, Xiaofei Liao, Hai Jin, Bingsheng He:
An efficient graph accelerator with parallel data conflict management. 8:1-8:12 - Zhen Peng, Alexander Powell, Bo Wu, Tekin Bicer, Bin Ren:
Graphphi: efficient parallel graph processing on emerging throughput-oriented architectures. 9:1-9:14
Compiler optimization
- Peng Jiang, Linchuan Chen, Gagan Agrawal:
Revealing parallel scans and reductions in recurrences through function reconstruction. 10:1-10:13 - Sooraj Puthoor, Mikko H. Lipasti:
Compiler assisted coalescing. 11:1-11:11 - Vasileios Porpodas, Rodrigo C. O. Rocha, Luís F. W. Góes:
VW-SLP: auto-vectorization with adaptive vector width. 12:1-12:15
Parallelization management
- Adrià Armejach, Helena Caminal, Juan M. Cebrian, Rekai González-Alberquilla, Chris Adeniyi-Jones, Mateo Valero, Marc Casas, Miquel Moretó:
Stencil codes on a vector length agnostic architecture. 13:1-13:12 - Younghyun Cho, Camilo A. Celis Guzman, Bernhard Egger:
Maximizing system utilization via parallelism management for co-located parallel applications. 14:1-14:14 - Prakash Prabhu, Stephen R. Beard, Sotiris Apostolakis, Ayal Zaks, David I. August:
MemoDyn: exploiting weakly consistent data structures for dynamic parallel memoization. 15:1-15:12
Machine learning architectures
- Animesh Jain, Michael A. Laurenzano, Gilles A. Pokam, Jason Mars, Lingjia Tang:
Architectural support for convolutional neural networks on modern CPUs. 16:1-16:13 - Young H. Oh, Quan Quan, Daeyeon Kim, Seonghak Kim, Jun Heo, Sungjun Jung, Jaeyoung Jang, Jae W. Lee:
A portable, automatic data qantizer for deep neural networks. 17:1-17:14 - Franyell Silfa, Gem Dot, José-María Arnau, Antonio González:
E-PUR: an energy-efficient processing unit for recurrent neural networks. 18:1-18:12
Runtime
- Francisco Romero, Christina Delimitrou:
Mage: online and interference-aware scheduling for multi-scale heterogeneous systems. 19:1-19:13 - Bang Di, Jianhua Sun, Dong Li, Hao Chen, Zhe Quan:
GMOD: a dynamic GPU memory overflow detector. 20:1-20:13 - Younghyun Cho, Florian Negele, Seohong Park, Bernhard Egger, Thomas R. Gross:
On-the-fly workload partitioning for integrated CPU/GPU architectures. 21:1-21:13
Storage systems
- Sukhan Lee, Kiwon Lee, Min Chul Sung, Mohammad Alian, Chankyung Kim, Wooyeong Cho, Reum Oh, Seongil O, Jung Ho Ahn, Nam Sung Kim:
3D-Xpath: high-density managed DRAM architecture with cost-effective alternative paths for memory transactions. 22:1-22:12 - Jiang Zhou, Yong Chen, Weiping Wang:
Atributed consistent hashing for heterogeneous storage systems. 23:1-23:12 - Wei Zhang, Houjun Tang, Suren Byna, Yong Chen:
DART: distributed adaptive radix tree for efficient affix-based keyword search on HPC systems. 24:1-24:12
Best paper session
- Arthur Perais, André Seznec:
Cost effective speculation with the omnipredictor. 25:1-25:13 - Long Zheng, Xiaofei Liao, Hai Jin, Bingsheng He, Jingling Xue, Haikun Liu:
Towards concurrency race debugging: an integrated approach for constraint solving and dynamic slicing. 26:1-26:13 - Arun Thangamani, V. Krishna Nandivada:
Optimizing remote data transfers in X10. 27:1-27:15 - Wim Heirman, Kristof Du Bois, Yves Vandriessche, Stijn Eyerman, Ibrahim Hur:
Near-side prefetch throttling: adaptive prefetching for high-performance many-core processors. 28:1-28:11
Programming models & compilers
- Michael LeBeane, Khaled Hamidouche, Brad Benton, Maurício Breternitz, Steven K. Reinhardt, Lizy K. John:
ComP-net: command processor networking for efficient intra-kernel communications on GPUs. 29:1-29:13 - Vladimir Kiriansky, Haoran Xu, Martin C. Rinard, Saman P. Amarasinghe:
Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP. 30:1-30:16 - Pedro Ramos, Gleison Souza Diniz Mendonca, Divino Soares, Guido Araújo, Fernando Magno Quintão Pereira:
Automatic annotation of tasks in structured code. 31:1-31:13 - Prithayan Barua, Jun Shirako, Vivek Sarkar:
Cost-driven thread coarsening for GPU kernels. 32:1-32:14
Memory and acceleration
- Sunjae Park, Christopher J. Hughes, Milos Prvulovic:
Transactional pre-abort handlers in hardware transactional memory. 33:1-33:11 - Amir Yazdanbakhsh, Choungki Song, Jacob Sacks, Pejman Lotfi-Kamran, Hadi Esmaeilzadeh, Nam Sung Kim:
In-DRAM near-data approximate acceleration for GPUs. 34:1-34:14 - Jiho Choi, Thomas Shull, Josep Torrellas:
Biased reference counting: minimizing atomic operations in garbage collection. 35:1-35:12 - Tony Nowatzki, Newsha Ardalani, Karthikeyan Sankaralingam, Jian Weng:
Hybrid optimization/heuristic instruction scheduling for programmable accelerator codesign. 36:1-36:15
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.