Coder Social home page Coder Social logo

robo-alex / dreamdance Goto Github PK

View Code? Open in Web Editor NEW
14.0 3.0 1.0 404.04 MB

DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer

License: MIT License

Jupyter Notebook 92.61% Python 7.32% Shell 0.07%

dreamdance's Introduction

DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer

Results of Pipeline 1

dance_1

dance_2

orange_justice_1

orange_justice_2

The motion transfer is quite successful, even if the the character in the reference video performs large motion, like dancing and rotating.

Note that limited by the computing resources, we only generated the imitation videos of low-resolution. The performance of motion imitation is good.

Results of Pipeline 2

Input images of prompt: miguel playing guitar on the street, pixar, cartoon, high quality, full body, single person

input_guitar

Output video

output_guitar

Input images of prompt: miguel running in a forest, pixar, cartoon, green eyes, red hat, high quality, standing, full body, single person

input_running

Output video

output_running

Input images of prompt: miguel in a forest, pixar, cartoon, green eyes, red hat, high quality, standing, full body, single person

input_2

Output video

output_2

Input images with prompt: miguel, pixar, cartoon, playing guitar, high quality, full body, single person

input_guitar_2

Output video

output_guitar_2

We noticed that if the changes are even larger, the interpolation still handled the video synthesis pretty well. Although the are some artifacts in the mid-frames, our limitations are mainly from the input image generation side. If future text-to-image synthesis models have the capability of generating more promising images with high consistency of all the factors above, frame interpolation will be a powerful method of text-to-video generation.

dreamdance's People

Contributors

robo-alex avatar stonov avatar

Stargazers

Feng Chen avatar Sandalots avatar 爱可可-爱生活 avatar Yongtao Ge avatar Wenhao Chai avatar  avatar Zhengkai Jiang avatar Max Ku avatar chongzicbo avatar Hay Kim avatar YANHONG ZENG avatar beastars avatar  avatar Keep Growing And Moving Forward avatar

Watchers

Zoey Li avatar Feng Chen avatar  avatar

Forkers

songbojin

dreamdance's Issues

ask for help

A very valuable attempt!
How is the motion transfer in the first result generated?
Which script has been run and the pre-training model can be provided?
We look forward to your reply

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.