X hits on this document

Powerpoint document

The Missouri S&T CS GPU Cluster - page 15 / 18

32 views

0 shares

0 downloads

0 comments

15 / 18

A few CUDA API functions

cudaSetDevice(int dev) - Sets the device to run the kernel.

__syncthreads() - Blocks execution of all threads within a block until they synchronize.

cudaMalloc(void** devPtr, size_t count) - Allocates count bytes in GPU memory and returns a pointer to it in the parameter *devPtr.

cudaMemcpy(void* dst, const void* src, size_t count, enum cudaMemcpyKind kind) - copies count bytes from src to dst where kind is

A complete listing of the CUDA API functions can be found in the Reference Manual.

Document info
Document views32
Page views32
Page last viewedSat Dec 03 13:54:30 UTC 2016
Pages18
Paragraphs213
Words1253

Comments