杨现's Projects
A profiler to disclose and quantify hardware features on GPUs.
🔊 Text-Prompted Generative Audio Model
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
食铁兽(feater.top)ffmpeg4入门系列教程代码
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
使用Docker Stack搭建Milvus向量数据库集群
深度学习模型加解密工具
neon优化实例代码
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, WebAssembly
Pybind11对DALI和nvjpeg的封装, 加速机器学习和深度学习图像编解码.
Implementation of popular deep learning networks with TensorRT network definition API
A simple C++11 Thread Pool implementation
tvm arm gpu opencl
Config files for my GitHub profile.