Stars
2
Forks
0
Language
Cuda
Last Updated
Feb 23, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 5 | Toy package for custom CUDA kernels + JAX | Oct 03, 2022 | |
Jupyter Notebook | 63 | CUDA kernels for generalized matrix-multiplication in PyTorch | Sep 05, 2022 | |
Python | 7 | Implement custom operators in PyTorch with cuda/c++ | Mar 16, 2023 | |
Julia | 7 | Winograd Convolution in Julia with CUDA kernels | Oct 20, 2020 | |
Cuda | 8 | An example repository of PyTorch BlockSparse implementation with custom CUDA extension | Jun 21, 2022 | |
Python | 42 | custom cuda kernel for {2, 3}d relative attention with pytorch wrapper | Jan 28, 2023 | |
None | 2 | Tutorial for building a custom CUDA function for Pytorch | May 13, 2021 | |
Python | 452 | Tutorial for building a custom CUDA function for Pytorch | May 03, 2023 | |
C | 2 | 8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier | May 10, 2023 | |
Vim script | 2 | vim/nvim configuration with proper documentation | Jul 25, 2022 | |
None | 2 | How to write cuda kernels or c functions in pytorch, especially for former caffe users. | Aug 03, 2021 | |
None | 6 | Docker Images with proper dependencies for building ROMs/Kernels/Android Apps | Feb 17, 2023 | |
C++ | 2 | Vector math and other CUDA helper functions for OptiX kernels | Jan 18, 2023 | |
Python | 261 | Extending JAX with custom C++ and CUDA code | Apr 24, 2023 | |
Python | 2 | PyTorch - FID calculation with proper image resizing and quantization steps | Jan 09, 2022 | |
C++ | 7 | Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner | May 02, 2023 | |
C | 6 | Simple examples for PTX code extraction from CUDA and OpenCL kernels. | Sep 03, 2021 | |
C++ | 5 | CUDA Custom Buffers and example blocks | Aug 17, 2022 | |
Python | 3 | Template for CUDA / C++ extension writing with PyTorch | Jun 25, 2021 | |
Python | 10 | Write documentation in comments and render it with templates. | Feb 02, 2022 | |
Cuda | 85 | tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF) | Aug 30, 2022 | |
Ruby | 887 | Pretty documentation generator for Github projects with proper Readme. | Sep 04, 2022 | |
JavaScript | 3 | Pretty documentation generator for Github projects with proper Readme. | Sep 17, 2014 | |
Python | 5 | Reproduce and improve ChexNet by Python Pytorch,CUDA | Nov 05, 2021 | |
Cuda | 15 | Matrix exponential in cuda for pytorch and tensorflow | Mar 24, 2021 | |
Python | 10 | Pytorch Implementation of RetinaNet with CUDA accelerate nms operation. | Jun 30, 2022 | |
C | 69 | GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV | May 14, 2023 | |
Python | 2 | Instagram scraper with proper pagination, that can collect posts, likes, comments and a lot more. | Feb 26, 2024 | |
Jupyter Notebook | 9 | A notebook testing CPU speed vs GPU speed with Pytorch and CUDA | Jul 22, 2022 | |
Cuda | 3 | Cuda-based matrix multiplication, compared with cuBLas performance. Refer to the https://github.com/NervanaSystems/maxas/wiki/SGEMM | Aug 23, 2021 | |
Cuda | 4 | Test suite for probing the numerical behavior of NVIDIA tensor cores | May 18, 2022 | |
Cuda | 4 | NTLM password cracker using TMTOs, optimized for GPU computation. | May 27, 2022 | |
Cuda | 4 | Parallel Implementation of SM4-CTR Algorithm based on General Computing Platform. 基于CUDA通用GPU平台的SM4-CTR算法并行化实现。利用本地GPU资源,进行CTR工作模式下SM4算法高速加解密的并行实现和优化方案。 | May 20, 2022 | |
Cuda | 4 | None | Jul 04, 2022 | |
Cuda | 4 | A way to compute PCA through CUDA and GPU | May 30, 2022 | |
Cuda | 4 | None | Apr 16, 2022 | |
Cuda | 4 | 3d visualizer, using raymarching technology to draw vectorized primitives | Feb 12, 2022 | |
Cuda | 4 | Introduction to Parallel Programming class code | Feb 28, 2020 | |
Cuda | 4 | RCCL Performance Benchmark Tests | Nov 16, 2021 | |
Cuda | 4 | Simple starter CMake project that uses NVBench. | Jul 30, 2022 | |
Cuda | 4 | Gridding for the square kilometer array using GPUs | Sep 27, 2021 | |
Cuda | 4 | Implementations of two types of quotient filters using GPUs | Mar 30, 2022 | |
Cuda | 4 | A C and CUDA implementation of tabulating linear regression for an exhaustive pairwise interaction search … | Aug 18, 2020 | |
Cuda | 4 | Finding strings that get md5'ed to php type juggleable hash | Jul 13, 2022 | |
Cuda | 4 | Numerical solution for the movement of a magnetic pendulum | Feb 02, 2021 | |
Cuda | 4 | Implementation of Neuroevolution of Augmenting Topologies(NEAT) using a distributed architecture for training ANNs with Forex … | Jul 16, 2022 | |
Cuda | 4 | Playground to test different approaches for CUDA kernel performance | Aug 22, 2016 | |
Cuda | 5 | TenTrans High-Performance Inference Toolkit | Jun 10, 2022 | |
Cuda | 5 | None | Jun 19, 2022 | |
Cuda | 5 | Using CUDA to implement "Raytracing in one weekend" by Peter Shirley | Jun 11, 2022 |