royinx's Projects
Bilinear Image Resize with openmp/cuda
blog script repo
clothes segmentation
color clustering
resize image in (CUDA, python, cupy)
CUDA Templates for Linear Algebra Subroutines
CV-CUDAโข is an open-source, graphics processing unit (GPU)-accelerated library for cloud-scale image processing and computer vision.
exmaple / demo hierarchy for go
build java mini container image
Jetpack4.5 Triton server
Kernel Tuner
LFD is a big update upon LFFD. Generally, LFD is a multi-class object detector characterized by lightweight, low inference latency and superior precision. It is for real-world appilcations.
A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......
LLM in Triton , Hugging Face -> Pytorch -> ONNX -> TensorRT -> Triton
MSBD5001-kaggle
Instant neural graphics primitives: lightning fast NeRF and more
Nsight Systems in Docker
Deep learning installation when new ubuntu installed
simple API , including Flask and FastAPI
blank flask template for quick start and testing
Model (ONNX, Pytorch) to TensorRT inference server
triton server ensemble model demo
tensorRT_yolov3
Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions