Building High Performance Computers and Clusters with GPU's

Welcome to the Building High Performance Computers and Servers with GPU's community.

 Champions:

Wen-mei W. Hwu: I am a professor at the Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign. Since 1997. My research interests include architecture, implementation, and compilation for high performance computer systems. I am currently the co-principal investigator for the Blue Waters Project., a supercomputer  which will be capable of one petaflop of performance when it goes live in 2011.

Volodymyr Kindratenko: I am a research scientist in the Innovative Systems Laboratory (ISL) at the National Center for Supercomputing Applications (NCSA), University of Illinois at Urbana-Champaign (UIUC). My research interests are mainly in the area of high-performance computing and special-purpose computing architectures, GPU clusters in particular.
Featured Forum Topics

Convergence Iterations - Conjugate Gradient CUDA


I am doing some research in the Conjugate Gradient solver and was wondering about number of iterations to converge on the GPU verses the CPU.  When I execute the CPU version of the preconditioned conjugate gradient solver, it takes more iterations to converge as compared to the GPU version.  The GPU always converges with less iterations - I use different CPUs and GPUs (one is NVIDIA Quadro fx5600 and the other is Tesla M2070) but same iteration count results.


Read More

_______________________________________________________________________________________________

Opensource Tools to create Multi GPU cluster across a network


 Hello Friends,

I have 4 Tesla C2050 cards and have hooked each of them to individual computers, i.e each forming a node. I want to run CUDA programs such that it uses the compute power of this cluster.

  • Are there any Opensource tools that can be used to establish the cluster using these 4 nodes?
  • How to write a CUDA program that is distributed across the GPU nodes?
Thanks,
MAK
 

Read More

_______________________________________________________________________________________________

Latest Stories and Papers

Title New Replies Last postsort icon
Atomic-free Irregular Computations on GPUs New 0 new 02-15-2013
A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches New 0 new 02-11-2013
Data-driven versus Topology-driven Irregular Computations on GPUs New 0 new 02-11-2013
Design and Performance Measurement of a High-performance Computing Cluster (IEEE) New 0 new 01-03-2013
Participation of foreign institutions in the Project New 0 new 11-08-2012
CUDA 5 - Production Release Now Available - Many New Enabling Technologies New 0 new 10-15-2012
A Simulated Annealing Algorithm for GPU Clusters New 0 new 09-07-2012
CFP: 3rd International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS12) New 0 new 08-10-2012
Clustered principal components for precomputed radiance transfer (ACM) New 1 new 04-10-2012
Virtual Geographical Space visualization based on a high-performance graphics cluster (IEEE) New 0 new 01-23-2012
MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit (IEEE) New 0 new 01-23-2012
Online Accelarated Implementation of the Fuzzy C-Means Algorithm with the use of the GPU Platform (IEEE) New 0 new 01-23-2012
Non-Parametric Co-Clustering of Large Scale Sparse Bipartite Networks on the GPU (IEEE) New 0 new 12-15-2011
A Dynamic Load Balance on GPU Cluster for Fork-Join Search (IEEE) New 0 new 12-15-2011
Programming-Level Power Measurement for GPU Clusters (IEEE) New 0 new 12-14-2011
Design of MILC Lattice QCD Application for GPU Clusters (IEEE) New 0 new 12-14-2011

Latest Blog Posts

Title New Replies Last postsort icon

Latest Wiki Pages

Title New Replies Last postsort icon


WIKI RSS Feed

Unread Items

Type Title Author Replies Last postsort icon