Coder Social home page Coder Social logo

jason-qiu / mmwatermark-robustness Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 0.0 280.43 MB

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Home Page: https://mmwatermark-robustness.github.io/

Python 23.31% Jupyter Notebook 75.79% Shell 0.09% Dockerfile 0.01% CSS 0.01% C 0.04% C++ 0.12% Cuda 0.58% HTML 0.01% JavaScript 0.04% Cython 0.01%

mmwatermark-robustness's Introduction

MMWatermark_Robustness

The official codebase for our paper "Evaluating Durability: Benchmark Insights into Multimodal Watermarking".

Jielin Qiu*, William Jongwon Han*, Xuandong Zhao, Shangbang Long, Christos Faloutsos, Lei Li.

More details can be found on the project webpage.

Citation

If you feel our code or models helps in your research, kindly cite our paper:

@inproceedings{Qiu2024EvaluatingDB,
  title={Evaluating Durability: Benchmark Insights into Multimodal Watermarking},
  author={Jielin Qiu and William Han and Xuandong Zhao and Shangbang Long and Christos Faloutsos and Lei Li},
  journal={arXiv preprint arXiv:2406.03728},
  year={2024}
}

Getting Started

We generally recommend the following pipeline:

  1. Generate text and images utilizing multimodal models.
  2. Watermark generated text and images.
  3. Perturb watermarked text and images.
  4. Detect perturbed, watermarked text and image.

We will now go a bit more in depthon how to do each step.

Environments

In our study, we follow the existing codebases for comprehensive benchmarking.

We recommend creating separate environments for each multimodal model and watermarking method. All perturbations (Text and Image) can be done through one environment.

One thing to note is that some of the links are not in fact repositories but Hugging Face tutorials on how to utilize the models. For such models, we experienced that downloading the latest transformers version works well. However, if there are any errors utilizing multiple multimodal models with a singular environment, please feel free to create another environment.

We provide the link to all of the necessary repositorys for this project. Please carefully follow their environment settings and generate, watermark, perturb in separate environments. We thank all of the repositories as well for open sourcing their code.

Type Link
Multimodal Model NExT-GPT
Multimodal Model RPG
Multimodal Model LCMs
Multimodal Model Kandinsky
Multimodal Model PIXART
Multimodal Model SDXL-Lightning
Multimodal Model DALLE3
Multimodal Model Stable Diffusion
Multimodal Model Fuyu-8B
Multimodal Model InternLM-XComposer
Multimodal Model InstructBLIP
Multimodal Model LLaVA 1.6
Multimodal Model MiniGPT-4
Multimodal Model mPLUG-Owl2
Multimodal Model Qwen-VL
Watermark KGW
Watermark KTH
Watermark Blackbox
Watermark Unigram
Watermark DwtDctSvd
Watermark RivaGAN
Watermark SSL
Watermark Stega Stamp
Image and Text Perturbations MM_Robustness

COCO Dataset

Please download the COCO validation split from the official website cocodataset. You can download images-val2017 and annotations-val2017.

If for some reason there is a problem with the link, a copy of the data can be found here.

Then move the data into the COCO folder. the coco.py file is the data loader used to iterate through the data.

Multimodal Models and Generation

All multimodal models used in this study is available in the mm_model directory. We do want to note that not all models had a Github repository, however, we still provide an example of how to utilize the model for text or image generation.

Additionally, we want to note that some of the models on Hugging Face are fairly large. We recommend to set the model download cache path to a specific folder on your local machine that has enough memory.

Watermarking

All watermarks are in the watermark directory. After setting up their respective enironments and having already generated the text or images, please proceed to watermark all generated texts or images.

Perturbations

All perturbations are in the perturbation directory. After setting up the perturbation evironment from the perturbations/MM_Robustness repository, please proceed to perturb all of the watermarked images or text. Additionally, inside the perturbation directory, the image_perturb.py and text_perturb.py files contain all of the needed image and text perturbations for this study.

Detection

Due to each watermarking method having their own way of detection, we provide an example pipeline of detecting watermarks. Please view them to see examples of how to detect them. We also provide the calculation of the other metrics as well (e.g., ROUGE, PSNR, etc.).

License

This project is licensed under CC BY-NC-SA License.

Contact

If you have any questions, please contact [email protected], [email protected].

mmwatermark-robustness's People

Contributors

jason-qiu avatar

Stargazers

Wenhao Ding avatar Jiacheng Zhu avatar  avatar  avatar Zuxin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.