default search action
Parallel Computing, Volume 35
Volume 35, Number 1, January 2009
- Xiandong Meng, Vipin Chaudhary:
Boosting data throughput for sequence database similarity searches on FPGAs using an adaptive buffering scheme. 1-11 - Ricardo C. Corrêa, Valmir Carneiro Barbosa:
Partially ordered distributed computations on asynchronous point-to-point networks. 12-28 - Lih-Yuan Deng, Huajiang Li, Jyh-Jen Horng Shiau:
Scalable parallel multiple recursive generators of large order. 29-37 - Alfredo Buttari, Julien Langou, Jakub Kurzak, Jack J. Dongarra:
A class of parallel tiled linear algebra algorithms for multicore architectures. 38-53
Volume 35, Number 2, February 2009
- Fabrício Alves Barbosa da Silva, Hermes Senger:
Improving scalability of Bag-of-Tasks applications running on master-slave platforms. 57-71 - Yuh-Rau Wang:
A novel O(1) time algorithm for 3D block-based medial axis transform by peeling corner shells. 72-82 - Anne Benoit, Mourad Hakem, Yves Robert:
Contention awareness and fault-tolerant scheduling for precedence constrained tasks in heterogeneous systems. 83-108 - Lars K. S. Daldorff, Bengt Eliasson:
Parallelization of a Vlasov-Maxwell solver in four-dimensional phase space. 109-115
Volume 35, Number 3, March 2009
- Rupak Biswas, Leonid Oliker, Jeffrey S. Vetter:
Revolutionary technologies for acceleration of emerging petascale applications. 117-118 - David A. Bader, Virat Agarwal, Seunghwa Kang:
Computing discrete transforms on the Cell Broadband Engine. 119-137 - Jakub Kurzak, Wesley Alvaro, Jack J. Dongarra:
Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor. 138-150 - Jeremy S. Meredith, Gonzalo Alvarez, Thomas A. Maier, Thomas C. Schulthess, Jeffrey S. Vetter:
Accuracy and performance of graphics processors: A Quantum Monte Carlo application case study. 151-163 - David J. Hardy, John E. Stone, Klaus Schulten:
Multilevel summation of electrostatic potentials using graphics processing units. 164-177 - Samuel Williams, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel:
Optimization of sparse matrix-vector multiplication on emerging multicore platforms. 178-194
Volume 35, Number 4, April 2009
- Suresh Behara, Sanjay Mittal:
Parallel finite element computation of incompressible flows. 195-212 - Arquimedes Canedo, Ben A. Abderazek, Masahiro Sowa:
Efficient compilation for queue size constrained queue processors. 213-225 - Tien-Yien Li, Chih-Hsiung Tsai:
HOM4PS-2.0para: Parallelization of HOM4PS-2.0 for solving polynomial systems. 226-238 - Sid Ahmed Ali Touati, Zsolt Mathe:
Periodic register saturation in innermost loops. 239-254
Volume 35, Number 5, May 2009
- Won Woo Ro, Jean-Luc Gaudiot:
A complexity-effective microprocessor design with decoupled dispatch queues and prefetching. 255-268 - Yaohang Li, Michael Mascagni, Andrey Gorin:
A decentralized parallel implementation for parallel tempering algorithm. 269-283 - Leopold Grinberg, Dmitry Pekurovsky, Spencer J. Sherwin, George E. Karniadakis:
Parallel performance of the coarse space linear vertex solver and low energy basis preconditioner for spectral/hp elements. 284-304 - Antonio Robles-Gómez, Aurelio Bermúdez, Rafael Casado, Åshild Grønstad Solheim:
A dynamic distributed mechanism for reconfiguring high-performance networks. 305-312
Volume 35, Number 6, June 2009
- Ching-Wen Chen, Chuan-Chi Weng, Chang-Jung Ku:
An overlapping and pipelining data transmission MAC protocol with multiple channels in ad hoc networks. 313-330 - Taro Konda, Yoshimasa Nakamura:
A new algorithm for singular value decomposition and its parallelization. 331-344 - Gerold Jäger, Clemens Wagner:
Efficient parallelizations of Hermite and Smith normal form algorithms. 345-357 - Julian Borrill, Leonid Oliker, John Shalf, Hongzhang Shan, Andrew Uselton:
HPC global file system performance analysis using a scientific-application derived benchmark. 358-373
Volume 35, Number 7, July 2009
- Markus Geimer, Felix Wolf, Brian J. N. Wylie, Bernd Mohr:
A scalable tool architecture for diagnosing wait states in massively parallel applications. 375-388 - Jay Smith, Vladimir Shestak, Howard Jay Siegel, Suzy Price, Larry Teklits, Prasanna Sugavanam:
Robust resource allocation in a cluster based imaging system. 389-400 - Yang Wang, Ming Zhu, Hua Li:
A distributed Key Message algorithm to optimize the communication in clusters. 401-415 - Hatem Ltaief, Marc Garbey:
A parallel Aitken-additive Schwarz waveform relaxation suitable for the grid. 416-428
Volume 35, Numbers 8-9, August - September 2009
- Cole Trapnell, Michael C. Schatz:
Optimizing data intensive GPGPU computations for DNA sequence alignment. 429-440 - Tz-Liang Kueng, Cheng-Kuan Lin, Tyne Liang, Jimmy J. M. Tan, Lih-Hsing Hsu:
Embedding paths of variable lengths into hypercubes with conditional link-faults. 441-454 - Arturo González-Escribano, Arjan J. C. van Gemund, Valentín Cardeñoso-Payo:
Performance implications of synchronization structure in parallel programming. 455-474 - Ananta Tiwari, Vahid Tabatabaee, Jeffrey K. Hollingsworth:
Tuning parallel applications in parallel. 475-492
Volume 35, Numbers 10-11, October - November 2009
- Diane Lingrand, Tristan Glatard, Johan Montagnat:
Modeling the latency on production grids with respect to the execution context. 493-511 - Anshu Dubey, Katie Antypas, Murali K. Ganapathy, Lynn B. Reid, Katherine Riley, Daniel J. Sheeler, Andrew R. Siegel, Klaus Weide:
Extensible component-based architecture for FLASH, a massively parallel, multiphysics simulation code. 512-522 - Ismael Marín Carrión, Enrique Arias-Antúnez, M. M. Artigao Castillo, Julio José Águila Guerrero, Juan José Miralles Canals:
Thread-based implementations of the false nearest neighbors method. 523-534 - Hamid Mahini, Hamid Sarbazi-Azad:
Resource placement in three-dimensional tori. 535-543 - Henning Meyerhenke, Burkhard Monien, Stefan Schamberger:
Graph partitioning and disturbed diffusion. 544-569
Volume 35, Number 12, December 2009
- Franck Cappello, Thomas Hérault, Jack J. Dongarra:
Foreword. 571
- Bin Jia:
Process cooperation in multiple message broadcast. 572-580 - Peter Sanders, Jochen Speck, Jesper Larsson Träff:
Two-tree algorithms for full bandwidth broadcast, reduction and scan. 581-594 - Daniel Becker, Rolf Rabenseifner, Felix Wolf, John C. Linford:
Scalable timestamp synchronization for event traces of message-passing applications. 595-607 - Rajeev Thakur, William Gropp:
Test suite for evaluating performance of multithreaded MPI communication. 608-617
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.