addf400 Goto Github PK

followers: 205.0 following: 133.0 repos: 31.0 gists: 0.0

Name: Hangbo Bao

Type: User

Company: Microsoft Research

Bio: Deep learning & Pre-Training for NLP/CV

Twitter: HangboBao

Location: China

Blog: https://addf400.github.io/

Hi there 👋

💬 I’m a researcher in Microsoft Reserach Asia.
🔭 I’m currently working on the pre-trained models for language and vision (Language: UniLMv2 & Vision: BEiT & Langugae + Vision: BEiT-3.
📫 How to reach me: email: x@y, x = addf400 & y = foxmail.com. Google scholar profile.

Hangbo Bao's Projects

betterdummy

Software Dummy Display Adapter for Apple Silicon Macs to Have Custom HiDPI Resolutions.

chinese-poetry

最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

cnn-dailymail

Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization

driver_cuda_cudnn

instruction for the installation of the Nvidia driver + cuda + cudnn

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks.

machine-learning-coursera

Course from https://www.coursera.org/learn/machine-learning/home/welcome

megatron-lm

Ongoing research training transformer language models at scale, including: BERT

mpnet

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

prophetnet

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training https://arxiv.org/pdf/2001.04063.pdf

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more