GPU-acceleration routines for DifferentialEquations.jl and the broader SciML scientific machine learning ecosystem
-
Updated
Aug 9, 2024 - Julia
GPU-acceleration routines for DifferentialEquations.jl and the broader SciML scientific machine learning ecosystem
Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sorting of large arrays. Includes both CPU and GPU versions, along with a performance comparison.
Optimized Parallel Sum program demonstrating CPU vs GPU performance
Scaling Unet in Tensorflow
Introduction to the concept of automatic experiment parallelization
Add a description, image, and links to the gpu-parallelism topic page so that developers can more easily learn about it.
To associate your repository with the gpu-parallelism topic, visit your repo's landing page and select "manage topics."