Coder Social home page Coder Social logo

xmu-xiaoma666's Projects

clip4clip icon clip4clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

doragd icon doragd

A ✨special ✨ repository to show myself on my homepage.

dtnet icon dtnet

The official repository for “Image Captioning via Dynamic Path Customization”.

external-attention-pytorch icon external-attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

fightingcv-course icon fightingcv-course

深度学习/计算机视觉/多模态/机器学习/人工智能零基础理论/实战教程汇总分享

fightingcv-paper-reading icon fightingcv-paper-reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

imagecaptionmetrics icon imagecaptionmetrics

This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).

lstnet icon lstnet

Towards Local Visual Modeling for Image Captioning

mfm icon mfm

An official implementation for "Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning"

sdatr icon sdatr

Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)

swin-transformer icon swin-transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

visualizer icon visualizer

helper tools for attention visualization in deep learning

x-clip icon x-clip

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

x-dreamer icon x-dreamer

A pytorch implementation of “X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation”

x-llm icon x-llm

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

x-mesh icon x-mesh

A pytorch implementation of “ X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance”

yoloair icon yoloair

🔥🔥🔥YOLOAir:Including YOLOv5, YOLOv7, Transformer, YOLOX, YOLOR and other networks... Support to improve backbone, head, loss, IoU, NMS...The original version was created based on YOLOv5

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.