Coder Social home page Coder Social logo

Aaron Han's Projects

github-chinese-top-charts icon github-chinese-top-charts

:cn: GitHub中文排行榜,帮助你发现高分优秀中文项目、更高效地吸收国人的优秀经验成果;榜单每周更新一次,敬请关注!(提前祝贺大家春节快乐,春运一路畅通!)

glance-focus icon glance-focus

This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)

invreg icon invreg

Invariant Feature Regularization for Fair Face Recognition (ICCV'23)

jaanet icon jaanet

ECCV 2018 "Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment"

lavis icon lavis

LAVIS - A One-stop Library for Language-Vision Intelligence

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

llm-adapters icon llm-adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

llovi icon llovi

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

me-graphau icon me-graphau

[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code

minigpt-4 icon minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

mm-cot icon mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

mmf icon mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

movie_knowledge_graph_app icon movie_knowledge_graph_app

电影知识图谱,主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)

musicrecommend icon musicrecommend

:star: 本科毕业设计:基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码,使用Django构建了前后端。

next-gqa icon next-gqa

Can I Trust Your Answer? Visually Grounded VideoQA (Accepted to CVPR'24)

prophet icon prophet

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

psvl icon psvl

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

pytorch-jaanet icon pytorch-jaanet

PyTorch implementation of JAA-Net including both ECCV version and IJCV version

sam-textvqa icon sam-textvqa

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

serve icon serve

Serve PyTorch models in production

sevila icon sevila

Self-Chained Image-Language Model for Video Localization and Question Answering

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.