The rCUDA team is pleased to invite you to the live rCUDA demo at Mellanox Technologies booth #2722 at SC13 in Denver, CO (November 17-22, 2013). Please visit us and learn how remote GPU virtualization can help you to reduce the cost of your computing infrastructure.

The set of applications tested with rCUDA has recently increased with the addition of the GROMACS application. With this new addition, the list of applications successfully executed with rCUDA includes LAMMPS, WideLM, CUDASW++, OpenFOAM, HOOMD-blue, mCUDA-MEME, GPU-BLAST and GROMACS.

The last developments of rCUDA, including a thorough performance analysis, were presented on September 12th at the HPC Advisory Council Spain Conference 2013, held in Barcelona (Spain) and co-hosted by the Barcelona Supercomputer Center. The slides of the presentation are available here. You can also access a video with the presentation in this link.

The rCUDA team is glad to announce that its last research results were presented on Tuesday September 24th in the CLUSTER 2013 Conference held in Indianapolis, IN (USA).The presented paper, titled "Influence of InfiniBand FDR on the performance of remote GPU virtualization", showed the benefits that the last version of the InfiniBand technology provides to GPU virtualization. The paper was awarded with the Best Paper Award.

The rCUDA remote GPU virtualization technology, whose last release also supports the ARM processor architecture, has been now leveraged to execute the LAMMPS and CUDASW++ applications in ARM-based systems. More specifically, the Tegra 3 ARM Cortex A9 quad-core CPUs (1.4 GHz) present in the NVIDIA Development Kits CARMA and KAYLA have been used to execute the CPU code of these applications, whereas the GPU code has been offloaded to an NVIDIA GeForce GTX480 “Fermi” GPU installed in a remote regular Xeon-based system. Results clearly show that by using rCUDA and remote accelerators, noticeable performance improvements are achieved over using the local NVIDIA Quadro 1000M GPU already present in the CARMA system, despite of using a traditional 1Gbps Ethernet network.