Stars
2
Forks
0
Language
Cuda
Last Updated
Mar 05, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Cuda | 2 | Sample codes for parallel programming using OpenMP on CPU and CUDA on GPU | Nov 16, 2022 | |
C++ | 8 | SHA-3 cpu and gpu (CUDA) calculation | May 09, 2022 | |
Cuda | 10 | GPU Parallel Computing software solution examples with CUDA | Apr 30, 2023 | |
C | 69 | GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV | May 14, 2023 | |
C++ | 51 | A Massively Parallel FFT Library for CPU/GPU | Nov 21, 2022 | |
Jupyter Notebook | 4 | GPU-enabled docker container with Jupyterlab for artificial intelligence | Mar 24, 2023 | |
Jupyter Notebook | 2 | GPU-enabled docker container with Jupyterlab for artificial intelligence | May 26, 2023 | |
Python | 12 | High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc. :sunny: | Aug 31, 2022 | |
C++ | 3 | Large hybrid CPU/GPU sorting network using CUDA and MPI | Jan 25, 2022 | |
C++ | 2 | Tests for object recognition using CUDA/GPU and with CPU | Dec 05, 2018 | |
Cuda | 3 | FIR filter implemented both on CPU and GPU with CUDA | Aug 25, 2022 | |
Julia | 31 | Parallel CPU and GPU high-performance computing - short course | Jul 27, 2022 | |
C++ | 2 | Parallel implementation of NW algorithms with NVIDIA GPU and CUDA C++ | Mar 11, 2023 | |
Cuda | 7 | Parallel Simulated annealing in GPU using CUDA (used for floorplanning problem) | Mar 08, 2023 | |
Jupyter Notebook | 9 | A notebook testing CPU speed vs GPU speed with Pytorch and CUDA | Jul 22, 2022 | |
C++ | 6 | A Turbulent CFD solver on CPU and GPU using CUDA and Vulkan | Apr 13, 2023 | |
C | 12 | GPU-accelerated image morphology written in C++/CUDA (100x faster than CPU!) | Dec 22, 2019 | |
C++ | 2 | Shows different programming techniques for parallel computing on CPU and GPU | May 16, 2020 | |
Julia | 22 | Proof of Concept: a C-callable GPU-enabled parallel 2-D heat diffusion solver written in Julia using … | Dec 20, 2022 | |
Python | 2 | An example for building CPU/GPU agnostic code in Python and C++/CUDA | Dec 22, 2023 | |
Shell | 299 | Scripts to setup a GPU / CUDA-enabled compute server with libraries for deep learning | Dec 01, 2021 | |
Shell | 2 | Scripts to setup a GPU / CUDA enabled compute server with libraries for deep learning | Mar 17, 2016 | |
C++ | 8 | Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations. | Mar 24, 2023 | |
C++ | 176 | Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, … | Aug 03, 2022 | |
Python | 11 | Fast Numba-enabled CPU and GPU computations of Earth Mover's (scipy.stats.wasserstein_distance) and Euclidean distances. | Oct 04, 2022 | |
Cuda | 3 | Machine Learning and Deep Learning in CUDA C++14 for CUDA 9, CUDNN 7 and above; … | Feb 13, 2020 | |
Cuda | 15 | optimized realtime harmonic/percussive source separation using the GPU (NVIDIA CUDA) and CPU (Intel IPP) | Mar 13, 2023 | |
Python | 10 | Tools for simple inference testing using TensorRT, CUDA and OpenVINO CPU/GPU and CPU providers. Simple … | Jun 22, 2022 | |
Cuda | 2 | Huffman encoding and decoding with simple Serial and Parallel implementations for CPU and GPU | Dec 01, 2023 | |
C++ | 1080 | Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning | Jul 18, 2022 | |
C++ | 32 | Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning | Jul 09, 2022 | |
C++ | 2 | Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning | Aug 06, 2019 | |
Python | 82 | a finite element solver based on Taichi, being parallel (CPU/GPU), portable and open-source | May 17, 2023 | |
FORTRAN | 3 | Fast Monte Carlo tool for modeling indirect x-ray detectors using CPU and GPU in parallel | Jan 28, 2020 | |
Jupyter Notebook | 7 | compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending … | Oct 12, 2022 | |
C | 6 | quickLD, a large-scale, long-range Linkage Disequilibrium analysis tool utilizing parallel CPU, GPU or heterogeneous computing. | Jun 10, 2022 | |
C++ | 2 | You can test Keras training weights in CPP environment with CPU or GPU | Oct 04, 2023 | |
Cuda | 3 | Cuda-based matrix multiplication, compared with cuBLas performance. Refer to the https://github.com/NervanaSystems/maxas/wiki/SGEMM | Aug 23, 2021 | |
Cuda | 4 | Test suite for probing the numerical behavior of NVIDIA tensor cores | May 18, 2022 | |
Cuda | 4 | NTLM password cracker using TMTOs, optimized for GPU computation. | May 27, 2022 | |
Cuda | 4 | Parallel Implementation of SM4-CTR Algorithm based on General Computing Platform. 基于CUDA通用GPU平台的SM4-CTR算法并行化实现。利用本地GPU资源,进行CTR工作模式下SM4算法高速加解密的并行实现和优化方案。 | May 20, 2022 | |
Cuda | 4 | None | Jul 04, 2022 | |
Cuda | 4 | A way to compute PCA through CUDA and GPU | May 30, 2022 | |
Cuda | 4 | None | Apr 16, 2022 | |
Cuda | 4 | 3d visualizer, using raymarching technology to draw vectorized primitives | Feb 12, 2022 | |
Cuda | 4 | Introduction to Parallel Programming class code | Feb 28, 2020 | |
Cuda | 4 | RCCL Performance Benchmark Tests | Nov 16, 2021 | |
Cuda | 4 | Simple starter CMake project that uses NVBench. | Jul 30, 2022 | |
Cuda | 4 | Gridding for the square kilometer array using GPUs | Sep 27, 2021 | |
Cuda | 4 | Implementations of two types of quotient filters using GPUs | Mar 30, 2022 |