Coder Social home page Coder Social logo

Hi!👋 I am Siteng Huang (黄思腾 in Chinese). I am about to join DAMO Academy in Hangzhou.

Currently, My research has centered on multi-modal large models, especially vision-language models (VLMs). I have published 20+ papers Google Scholar at the top international AI conferences. If you are seeking any form of academic cooperation, please feel free to email me at [email protected].

I received my Ph.D. degree from Zhejiang University in June 2024, affiliated with a joint program with Westlake University at Machine Intelligence Laboratory (MiLAB) and advised by Prof. Donglin Wang. Before that, I received my B.Eng. Degree from School of Computer Science, Wuhan University in June 2019. Please refer to my full paper list at my personal homepage.

Twitter GitHub GitHub

### 📢 News
  • 2024/07/16 [MM'24] One paper (ProFD) got accepted for ACM MM 2024.
  • 2024/07/09 [Scholar'24] 2024 Scholar Metrics was released by Google Scholar. Our paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting" ranked 7th of the CIKM 2019 conference according to the citations, and 13th within five years.
  • 2024/07/01 [ECCV'24] Two papers (PiTe and QUAR-VLA) got accepted for ECCV 2024.
  • 2024/06/04 [Graduation] I successfully defended my dissertation. So many thanks to my Ph.D. committee (Prof. Xiaogang Jin, Prof. Mai Xu, Prof. Changxin Gao, Prof. Fajie Yuan, Prof. Peidong Liu, Prof. Xiaofei Li) and my advisor!
  • 2024/03/29 [VALSE'24] Troika got accepted as VALSE 2024 Poster! 2024/05/05 Our Cobra was selected for VALSE 2024 Annual Progress Representation. Thanks to all the committee for the approval!
  • 2024/03/21 [Preprint] Cobra, an efficient multi-modal large language model, was released. Project page has been available. The paper has been featured by Hugging Face Daily Papers! Demo has been available!
  • 2024/03/13 [ICME'24] One paper about parameter-efficient tuning for visual grounding got accepted for ICME 2024 (Oral).
  • 2024/02/27 [Award] Awarded as Zhejiang University 2024 Outstanding Graduates!
  • 2024/02/27 [CVPR'24] Three papers (ADI, Troika, SimM) as first/co-first author got accepted for CVPR 2024. Congratulations to all collaborators!
  • 2023/12/13 [ICASSP'24] One paper (VGDiffZero) on diffusion model-based zero-shot visual grounding got accepted for ICASSP 2024. Congratulations to all collaborators!
  • 2023/12/09 [AAAI'24] One paper on VLM-based unsupervised domain adaptation got accepted for AAAI 2024.
  • 2023/04/02 [ICMR'23] One paper (RL-CZSL) about reference-limited compositional learning got accepted for ICMR 2023. Congratulations to all collaborators!
  • 2023/02/28 [CVPR'23] One paper (VoP) about parameter-efficient text-video retrieval got accepted for CVPR 2023. Congratulations to all collaborators!

Kyon Huang's Projects

actionbench icon actionbench

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation (CVPR 2024)

agam icon agam

Code for the AAAI 2021 paper "Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition".

algorithms-notes icon algorithms-notes

《算法(第4版)》笔记及代码 | 《Algorithms(Fourth Edition)》notes & code

biliinfocrawler icon biliinfocrawler

基于 Java 的 BiliBili 视频信息爬虫(可能已经失效) | BiliBili video crawler based on Java

blog icon blog

个人博客 | Personal blog

calculator icon calculator

学习编译原理写出的一个简单计算器

cs330-notes icon cs330-notes

:memo:Notes in Chinese for CS330 at Stanford: Deep Multi-Task and Meta Learning (Fall 2019)

dsanet icon dsanet

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

hung-yi-lee-ml-notes icon hung-yi-lee-ml-notes

:memo:《李宏毅机器学习》课程笔记(暂停更新) | Notes for Hung-yi-Lee Machine Learning Spring 2019 (Suspension)

jdbc_management icon jdbc_management

基于Spring Boot + Mysql + Vue + BootStrap 开发的教务管理系统

leetcode-everyday icon leetcode-everyday

每天一题 LeetCode | workin' everyday~Hustle everyday~LeetCode everyday~yuh yuh yuh

ml-beginning-projects icon ml-beginning-projects

基础的机器学习项目集,包含数据预处理、模型评估与选择、可视化以及分类算法等

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.