NVIDIA has launched their next generation GPU based on their Kepler Architecture. They followed it up with a rather quick update to their CUDA toolkit. Considering that we have access to 3 generations of their GTX cards (480, 580 and 680), we thought we would show case how the performance has changed over the generations. Matrix multiplication: It can be seen that the GTX 680 breaches the 1 Terraflop mark comfortably for single precision, while the GTX 580 barely scratches it. However the performance seems to peak around 2048 x 2048 and then rallies downward to match the performance of the GTX 580 at larger sizes. The high end Tesla C2070 finishes last for single precision behind the third placed …
12,288 CUDA Cores in One Computer
Kepler is here. And it’s fantastic! The news came out today that the first Kepler GPU, the GeForce GTX 680, has been launched. A single GPU has 1,536 CUDA Cores. This means that those high-end workstations with 8 PCIe slots will be able to pack 12,288 CUDA cores into a single computer. That’s some serious computational power. Current high-end Fermi cards have 512 cores, so this new Kepler architecture boasts 3X the number of computation cores. Normally we focus on the higher-end Tesla products because those more aptly fit the needs of our science, engineering, and financial computing readers. But we are excited nonetheless by this GeForce GPU. It is a major step forward in GPU technology. And this GeForce card portends …