vhzy Goto Github PK

followers: 2.0 following: 20.0 repos: 98.0 gists: 0.0

Name: Aaron Han

Type: User

Bio: AI phd condidate

Blog: 知乎：https://www.zhihu.com/people/kai-h

Aaron Han's Projects

github-chinese-top-charts

:cn: GitHub中文排行榜，帮助你发现高分优秀中文项目、更高效地吸收国人的优秀经验成果；榜单每周更新一次，敬请关注！（提前祝贺大家春节快乐，春运一路畅通！）

glance-focus

This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)

hfut_course_report_template

合肥工业大学课程设计 LaTeX 模板

invreg

Invariant Feature Regularization for Fair Face Recognition (ICCV'23)

jaanet

ECCV 2018 "Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment"

latex-template-cn

\LaTeX 中文模版收集。

lavis

LAVIS - A One-stop Library for Language-Vision Intelligence

llava

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

llm-adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

llovi

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

lstp-chat

A Video Chat Agent with Temporal Prior

machine-learning-interview

算法工程师-机器学习面试题总结

markdown-notes

me-graphau

[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code

minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

movie_knowledge_graph_app

电影知识图谱，主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)

musicrecommend

:star: 本科毕业设计：基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码，使用Django构建了前后端。

next-gqa

Can I Trust Your Answer? Visually Grounded VideoQA (Accepted to CVPR'24)

notes

prophet

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

psvl

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

pythia

python_for_data_analysis_2nd_chinese_version

《利用Python进行数据分析·第2版》

pytorch-jaanet

PyTorch implementation of JAA-Net including both ECCV version and IJCV version

pytorch_learning

sam-textvqa

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

serve

Serve PyTorch models in production

sevila

Self-Chained Image-Language Model for Video Localization and Question Answering

vhzy Goto Github PK

Aaron Han's Projects

Recommend Projects

Recommend Topics

Recommend Org