Coder Social home page Coder Social logo

ttp's Introduction

Hi there ๐Ÿ‘‹

  • ๐Ÿ”ญ My research interests are robust visual perception by understanding and explaining AI behavior through adversarial machine learning, temporal perception, representation learning (self-supervision, self-distillation, self-critique), and configuring the role of language models (LLMs) in building visual AI systems.
  • ๐ŸŒฑ You are welcome to explore my research work along with the provided code below. Seven of the papers are accepted as Oral/Spotlight at ICLR, NeurIPS, AAAI, CVPR, BMVC, and ACCV.
  • ๐Ÿ“ซ How to reach me: [email protected]
  • โšก Fun fact: I am really into fitness and thinking of joining the GYM for quite some time now ๐Ÿ˜„

๐ŸŒฑ Repositories

Topic Application Paper Repo Venue
Vision-Language Learning Composed Video Retrieval Composed Video Retrieval via Enriched Context and Discriminative Embeddings composed-video-retrieval CVPR'24
Self-supervision Multi-Spectral Satellite Imagery Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery satmae_pp CVPR'24
Vision-Language Learning Video grounding Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding Video-GroundingDINO CVPR'24
Vision-Language Learning Language Driven VLM for Remote Sensing Geochat: Grounded large vision-language model for remote sensing GeoChat CVPR'24
Vision-Language Learning Leaverging LLM to generate complex scenes (Zero-Shot) LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts llmblueprint ICLR'24
Self-supervision Self-structural Alignment of Foundational Models (Zero-Shot) Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment S3A AAAI'24-Oral
Vision-Language Learning Test-Time Alignment of Foundational Models (Zero-Shot) Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization PromptAlign NeurIPS'23
Vision-Language Learning Regulating Foundational Models Self-regulating Prompts: Foundational Model Adaptation without Forgetting PromptSRC ICCV'23
Network Engineering Video Recognition Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition Video-FocalNets ICCV'23
Vision-Language Learning Face Anti-spoofing FLIP: Cross-domain Face Anti-spoofing with Language Guidance FLIP ICCV'23
3D Medical Segmentation Adversarial Training Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation VAFA MICCAI'23
Vision-Language Learning Facial Privacy CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search Clip2Protect CVPR'23
Vision-Language Learning Video Recognition (Zero-shot) Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting Vita-CLIP CVPR'23
Prompt learning Image Recognition (Category Discovery) PromptCAL for Generalized Novel Category Discovery PromptCAL CVPR'23
Prompt learning Adversarial Attack Boosting Adversarial Transferability using Dynamic Cues DCViT-AT ICLR'23
Self-supervision Video Recognition Self-Supervised Video Transformer SVT CVPR'22-Oral
Contrastive learning Adversarial Defense Stylized Adversarial Training SAT IEEE-TPAMI'22
Self-supervision Adversarial Attack Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations ARP BMVC'22-Oral
Self-supervision Image Recognition How to Train Vision Transformer on Small-scale Datasets? VSSD BMVC'22
Self-distillation Image Recognition (Domain Generalization) Self-Distilled Vision Transformer for Domain Generalization SDViT ACCV'22-Oral
Attention Analysis Understanding Vision Transformer Intriguing Properties of Vision Transformers IPViT NeurIPS'21-Spotlight
Self-ensemble Adversarial Attack On Improving Adversarial Transferability of Vision Transformers ATViT ICLR'21-Spotlight
Distribution matching Adversarial Attack On Generating Transferable Targeted Perturbations TTP ICCV'21
Contrastive learning Image Recognition Orthogonal Projection Loss OPL ICCV'21
Self-supervision Adversarial Defense A Self-supervised Approach for Adversarial Robustness NRP CVPR'20-Oral
Relativistic optimization Adversarial Attack Cross-Domain Transferability of Adversarial Perturbations CDA NeurIPS'19
Gradient Smoothing Adversarial Defense Local Gradients Smoothing: Defense Against Localized Adversarial Attacks LGS WACV'19

ttp's People

Contributors

muzammal-naseer avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

ttp's Issues

Our recent work on transferable targeted perturbations

Hi Muzammal,

Firstly congratulations on the acceptance of TTP to ICCV 2021. Here I am pleased to bring into view our recent work on transferable targeted perturbations (https://arxiv.org/abs/2012.11207), where we provided a very simple baseline iterative attack that can achieve SOTA performance.
We also compared this simple baseline to TTP in the โ€œ10-Targets (all-source)โ€ setting and found that it can achieve slightly better results.

I will appreciate it if you could add this work to your list.

Best,
Zhengyu

About the choice of source domain data

Hi Muzammal,
As mentioned in your paper, your source domain data are 50k random images from ImageNet train set.
Do these images include the target category you want to attack?
Thanks a lot~

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.