Get CUDA Basic Topics

Categories:


Separable Convolution For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements a separable convolution filter of a 2D signal with a gaussian kernel.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Texture-Based Separable Convolution For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Texture-based implementation of a separable 2D convolution with a gaussian kernel. Used for performance comparison against convolutionSeparable.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Black-Scholes Option Pricing For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample evaluates fair call and put prices for a given set of European options by Black-Scholes formula.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Bitonic Sort For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Bitonic sort is a very simple parallel sorting algorithm that is very efficient when sorting a small number of elements: http://citeseer.ist.psu.edu/blelloch98experimental.html. This implementation is based on: http://www.tools-of-computing.com/tc/CS/Sorts/bitonic_sort.htm.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Scalar Product For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample calculates scalar products of a given set of input vector pairs.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Clock For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This example shows how to use the clock function to measure the performance of kernel accurately.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Multi-GPU For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This application demonstrates how to use the CUDA API to use multiple GPUs.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Aligned Types For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

A simple test, showing huge access speed gap between aligned and misaligned structures.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

asyncAPI For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample uses CUDA streams and events to overlap execution on CPU and GPU.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

cudaOpenMP For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample shows how to use OpenMP API to write an application for multiple GPUs.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows

simpleStreams For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample uses CUDA streams to overlap kernel executions with memcopies between the device and the host.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple Atomics For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

A simple demonstration of global memory atomic instructions.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Device Query For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample enumerates the properties of the CUDA devices present in the system.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple Templates For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample is a templatized version of the template project. It also shows how to correctly templatize dynamically allocated shared memory arrays.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Bandwidth Test For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This is a simple test program to measure the memcopy bandwidth of the GPU. It currently is capable of measuring device to device copy bandwidth, host to device copy bandwidth for pageable and page-locked memory, and device to host copy bandwidth for pageable and page-locked memory.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple Texture (Driver Version) For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Simple example that demonstrates use of textures in CUDA using the driver API.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple Texture For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Simple example that demonstrates use of textures in CUDA.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Matrix Multiplication (Driver Version) For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements matrix multiplication using the CUDA driver API. It has been written for clarity of exposition to illustrate various CUDA programming principles, not with the goal of providing the most performant generic kernel for matrix multiplication. CUBLAS provides high-performance matrix multiplication.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Template For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

A trivial template project that can be used as a starting point to create new CUDA projects.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple CUFFT For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Example of using CUFFT. In this example, CUFFT is used to compute the 1D-convolution of some signal with some filter by transforming both into frequency domain, multiplying them together, and transforming the signal back to time domain.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple Direct3D For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Simple program which demonstrates interoperability between CUDA and Direct3D. The program modifies vertex positions with CUDA and uses Direct3D to render the geometry.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows

Simple OpenGL For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Simple program which demonstrates interoperability between CUDA and OpenGL. The program modifies vertex positions with CUDA and uses OpenGL to render the geometry.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Simple CUBLAS For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Example of using CUBLAS.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Matrix Multiplication For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements matrix multiplication and is exactly the same as Chapter 6 of the programming guide. It has been written for clarity of exposition to illustrate various CUDA programming principles, not with the goal of providing the most performant generic kernel for matrix multiplication. CUBLAS provides high-performance matrix multiplication.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

C++ Integration For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This example demonstrates how to integrate CUDA into an existing C++ application, i.e. the CUDA entry point on host side is only a function which is called from C++ code and only the file containing this function is compiled with nvcc. It also demonstrates that vector types can be used from cpp.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

 

© 2008 NVIDIA Corporation | Privacy Policy | Legal Info
 
NVIDIA CUDA Zone Home