- š Iām interested in perception and prediction in computer vision, such as Multi-object Tracking. I'm also interested in the inference acceleration of neural networks in edge devices.
- š± I'm currently exploring multimodal large language models.
- š« How to reach me: zhihu
lingyvkong / vary Goto Github PK
View Code? Open in Web Editor NEWThis project forked from ucas-haoranwei/vary
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.