Yang Cao's Projects
Easily create a beautiful website using Academic and Hugo
😎 Awesome lists of papers and codes about Large Vision-Language Models
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
awesome-semantic-segmentation
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
The Code of Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection(CVPR2019)
Deep Compression on AlexNet
Deep Learning Book Chinese Translation
Deep Watershed Transform for Instance Segmentation
The code for the paper "FakeMix and AdaptiveASPP for Transparent Object Detection"
pytorch implementation of group normalization in https://arxiv.org/abs/1803.08494
The script is modified from the old script, the later one is provided by KITTI website (http://www.cvlibs.net/datasets/kitti/raw_data.php). The old version can be easily interrupted when downloading files. The new script helps to solve this problem.
Caffe Implementation of Google's MobileNets (v1 and v2)
This model is trained on SBD dataset.
Python Toolkit
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
This is a fast caffe implementation of ShuffleNet.
Simplified implementations of deep learning related works
An Open Source Machine Learning Framework for Everyone
My personal homepage