♨️ Optimized Gaussian blur filter on CPU.
-
Updated
Dec 12, 2017 - C++
♨️ Optimized Gaussian blur filter on CPU.
Dependence-Based Code Transformation for Coarse-Grained Parallelism
Complex ray tracing algorithm optimized by using parallelization over different partitioning schemes and explore the performance gains through grain size and processing units (parameters) over sequential algorithm to render a high resolution image.
The project utilizes OpenMP to implement parallelism in a large dataset by leveraging multicore processor architectures to concurrently execute code sections, optimizing performance and scalability for efficient database processing
High throughput data-parallel GEMM implementations in Cuda using Cuda cores and Tensor cores
C++ Parallel Image Processing Tool for Dataset of 10K+ Images. Applies multiple OpenCV functions on the whole batch using both Data and Task Parallelism for faster processing.
Add a description, image, and links to the data-parallelism topic page so that developers can more easily learn about it.
To associate your repository with the data-parallelism topic, visit your repo's landing page and select "manage topics."