lecooo Goto Github PK
Type: User
Type: User
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
Code for our MM'23 paper "Learning Event-Specific Localization Preferences for Audio-Visual Event Localization"
A curated list of audio-visual learning methods and datasets.
Resources for Multiple Object Tracking (MOT)
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
Collect super-resolution related papers, data, repositories
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
Use ChatGPT to summarize the arXiv papers.
cross modal background suppression for audio-visual event localization
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
[2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
D2Net
deep learning for image processing including classification and object-detection etc.
A paper list of object detection using deep learning.
NeurIPS'2023 official implementation code
Download files from Google Drive using Python 2 or Python 3
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
Object Tracking by Jointly Exploiting Frame and Event Domain (ICCV 2021)
证件照更换底色
This repository is an official PyTorch implementation of the paper "Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer". (IJCAI 2022)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MindSpore-YOLOv7
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.