Light

yangcaoai Goto Github PK

followers: 49.0 following: 118.0 repos: 25.0 gists: 0.0

Name: Yang Cao

Type: User

Company: @HKUST

Bio: Machine learning and computer vision

Twitter: yangcao_

Blog: https://yangcaoai.github.io/

Hi there, I'm Yang Cao👋

About Me

🎓 I'm currently a Ph.D. student in CSE, HKUST supervised by Prof. Dan Xu.
🔭 Before that, I received my Master's Degree from Nankai University, supervised by Prof. Ming-Ming Cheng.
🌱 My recent research interests mainly focus on 3D vision, open-vocabulary perception and multi-modal learning.
👯 I’m looking to collaborate on related topics.
📫 How to reach me: Email: [email protected]

Yang Cao's Projects

academic-kickstart

Easily create a beautiful website using Academic and Hugo

awesome-large-vision-language-models

😎 Awesome lists of papers and codes about Large Vision-Language Models

awesome-llm-3d

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

awesome-open-vocabulary-perception

😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D

awesome-semantic-segmentation

awesome-talking-head-generation

coda_neurips2023

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

contrastprior

The Code of Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection(CVPR2019)

deep-compression-alexnet

Deep Compression on AlexNet

deeplearningbook-chinese

Deep Learning Book Chinese Translation

dwt

Deep Watershed Transform for Instance Segmentation

fanet

The code for the paper "FakeMix and AdaptiveASPP for Transparent Object Detection"

groupnormalization

pytorch implementation of group normalization in https://arxiv.org/abs/1803.08494

kitti-download-scipt

The script is modified from the old script, the later one is provided by KITTI website (http://www.cvlibs.net/datasets/kitti/raw_data.php). The old version can be easily interrupted when downloading files. The new script helps to solve this problem.

lol_prediction

mobilenet-caffe

Caffe Implementation of Google's MobileNets (v1 and v2)

my-page

objectdetection

This model is trained on SBD dataset.

pytools

Python Toolkit

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

shufflenet

This is a fast caffe implementation of ShuffleNet.

simplified-deeplearning

Simplified implementations of deep learning related works

tensorflow

An Open Source Machine Learning Framework for Everyone

yangcaoai

yangcaoai.github.io

My personal homepage

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.