Get CUDA Data-Parallel Algorithms

Categories:


256-bin Histogram For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample demonstrates efficient implementation of 256-bin histogram.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

64-bin Histogram For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample demonstrates efficient implementation of 64-bin histogram.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Separable Convolution For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample implements a separable convolution filter of a 2D signal with a gaussian kernel.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Texture-Based Separable Convolution For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Texture-based implementation of a separable 2D convolution with a gaussian kernel. Used for performance comparison against convolutionSeparable.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Bitonic Sort For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Bitonic sort is a very simple parallel sorting algorithm that is very efficient when sorting a small number of elements: http://citeseer.ist.psu.edu/blelloch98experimental.html. This implementation is based on: http://www.tools-of-computing.com/tc/CS/Sorts/bitonic_sort.htm.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

N-Body Simulation For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample demonstrates efficient all-pairs simulation of a gravitational n-body simulation in CUDA. This sample accompanies the GPU Gems 3 chapter "Fast N-Body Simulation with CUDA".
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Parallel Reduction For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

A parallel sum reduction that computes the sum of large arrays of values. This sample demonstrates several important optimization stratezies for parallel algorithms like reduction.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Mandelbrot For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This sample uses CUDA to compute and display the Mandelbrot set.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Fast Walsh Transform For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

Naturally(Hadamard)-ordered Fast Walsh Tranform for batched vectors of arbitrary eligible(power of two) lengths.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Download - Windows
Download - Linux

Scan For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This example demonstrates an efficient CUDA implementation of parallel prefix sum, also known as "scan". Given an array of numbers, scan computes a new array in which each element is the sum of all the elements before it in the input array.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

Scan of Large Arrays For a direct link to this sample, right-click and copy the URL (shortcut) of this link icon.

This example demonstrates an efficient CUDA implementation of parallel prefix sum (also known as "scan") for arbitrary-sized arrays. Given an array of numbers, scan computes a new array in which each element is the sum of all the elements before it in the input array.
GeForce® 8 Series
Quadro® FX 5600 or later
Tesla™

Whitepaper
Download - Windows
Download - Linux

 

© 2008 NVIDIA Corporation | Privacy Policy | Legal Info
 
NVIDIA CUDA Zone Home