
Introduction to GPU Programming Getting Help with CUDA
- Start with the CUDA Forums
- Register as a CUDA Developer
- File bugs, get early access to software releases
- CUDA driver
- C for CUDA toolkit
- Includes CUDA GDB Debugger and CUDA Visual Profiler
- OpenCL Drivers, Visual Profiler, SDK Code Samples and more
- Fortran for CUDA toolkit (coming soon)
- NVIDIA provided
- CUDA SDK Code samples
- CUDA BLAS library (part of toolkit)
- CUDA FFT library (part of toolkit)
- NVPP: NVIDIA Performance Primitives early access: Focuses on image and video processing
- 3rd party provided
- CUDA data parallel primitives library (cuDPP)
- CULA Tools: LAPACK on CUDA GPUs from EM Photonics
- MAGMA: LAPACK on CUDA GPUs and Multi-core CPUs from Dongarra’s Group
- Jacobi-preconditioned Conjugate Gradient
- GPULib: Library of Mathematical functions for IDL and MATLAB
- GPU VSIPL signal processing library
- Computer Vision and imaging library
- PGI’s Fortran and C to CUDA C Auto-parallelizing Compiler
- CAPS HMPP C and Fortran to CUDA C Auto-parallelizing Compiler
- MATLAB
- Comprehensive page about MATLAB on CUDA support
- Jacket CUDA pluginfor MATLAB
- MATLAB plugin for CUDA using MEX
- GPULib: Library of mathematical functions with MATLAB and IDL bindings
- Mathematica Plugin for CUDA
- Enabling GPU Computing in the R Statistical Environment
- CUDA GPU Library for LabVIEW from National Instruments
- Fortran
- PGI CUDA Fortran compiler
- Fortran to C for CUDA translator
- Fortran wrapper for CUDA
- FLAGON: Fortran 95 library for GPU Numerics
- Python wrapper for CUDA: PyCUDA
- Java wrapper
- .NET integration for CUDA
- Thrust: C++ template Library for CUDA
- CuPP : C++ framework for CUDA
- Libra: C/C++ abstraction layer for CUDA
- F# for CUDA
- Eclipse support for CUDA
- CUDA Occupancy Calculator
- Roll for Rocks cluster
- Rocks 5.0 for CUDA 2.0 on Linux 64-bit
- Rocks Rolls for CUDA from ClusterCorp