CUDA Kernels

A comprehensive collection of NVIDIA CUDA kernel examples exploring parallel computation concepts and GPU programming techniques using CUDA 11.5. This repository provides practical implementations demonstrating various aspects of CUDA programming, from basic device properties to advanced image processing operations.

Overview

This repository contains multiple CUDA kernel implementations that showcase different aspects of parallel computing on NVIDIA GPUs. Each kernel is designed to demonstrate specific CUDA programming concepts and best practices.

Prerequisites

NVIDIA GPU: CUDA-capable GPU with compute capability 3.0 or higher
CUDA Toolkit: CUDA 11.5 or compatible version
Visual Studio: Visual Studio 2019 or later (for Windows development)
NVIDIA Nsight Systems: Optional, for performance profiling and analysis

Building and Running

Windows (Visual Studio)

Open the solution file (.sln) in the desired kernel directory
Ensure CUDA Toolkit is properly installed and configured in Visual Studio
Build the project using Visual Studio (F7 or Build → Build Solution)
Run the executable from the output directory

Command Line (nvcc)

For individual kernels, you can compile using nvcc:

nvcc kernel.cu -o output_name
./output_name

Performance Profiling

To profile kernels using NVIDIA Nsight Systems:

nsys profile ./your_executable

License

This project is licensed under CC0 1.0 Universal. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
Device_Properties		Device_Properties
Image_Color_Manipulation_Kernel		Image_Color_Manipulation_Kernel
Increment_Kernel		Increment_Kernel
Occupancy		Occupancy
Threads_Indices_Kernel		Threads_Indices_Kernel
Vector_Addition_Kernel		Vector_Addition_Kernel
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA Kernels

Table of Contents

Overview

Prerequisites

Available Kernels

Device Properties

Increment Kernel

Threads and Indices

Vector Addition

Image Color Manipulation

Occupancy Calculator

Building and Running

Windows (Visual Studio)

Command Line (nvcc)

Performance Profiling

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CUDA Kernels

Table of Contents

Overview

Prerequisites

Available Kernels

Building and Running

Windows (Visual Studio)

Command Line (nvcc)

Performance Profiling

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Uh oh!

Languages