Coder Social home page Coder Social logo

kodf's Introduction

KoDF

Abstract

A variety of effective face-swap and face-reenactment methods have been publicized in recent years, democratizing the face synthesis technology to a great extent. Videos generated as such have come to be called deepfakes with a negative connotation, for various social problems they have caused. Facing the emerging threat of deepfakes, we have built the Korean DeepFake Detection Dataset (KoDF), a large-scale collection of synthesized and real videos focused on Korean subjects. In this paper, we provide a detailed description of methods used to construct the dataset, experimentally show the discrepancy between the distributions of KoDF and existing deepfake detection datasets, and underline the importance of using multiple datasets for real-world generalization. KoDF is publicly available at https://moneybrain-research.github.io/kodf in its entirety (i.e. real clips, synthesized clips, clips with adversarial attack, and metadata).

Download

Please fill out this form to download KoDF. If your request has been approved, we will send you the download link. Koreans should download KoDF from https://aihub.or.kr/aidata/8005. If you have any questions, please contact us at [email protected].

Acknowledgement

We gratefully acknowledge that KoDF was built as part of the AI Training Data Construction Project 2020 hosted by the Ministry of Science and ICT (MSIT) and supported by the National Information Society Agency (NIA) of South Korea. This research was partly supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by MSIT (2021-0-00888).

Citation

@InProceedings{Kwon_2021_ICCV,
    author    = {Kwon, Patrick and You, Jaeseong and Nam, Gyuhyeon and Park, Sungwoo and Chae, Gyeongsu},
    title     = {KoDF: A Large-Scale Korean DeepFake Detection Dataset},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10744-10753}
}

kodf's People

Contributors

luaperl avatar

Watchers

 avatar

kodf's Issues

Dataset size problem

Thank you for your outstanding contribution and excellent work. Could you provide a smaller dataset sampled from the original dataset? The entire dataset, which is 2.6TB, exceeds the capacity of a standard server. Alternatively, could it be split and compressed so that it can be downloaded in parts?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.