Coder Social home page Coder Social logo

mooneese's Projects

audio2photoreal icon audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

audiocraft icon audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

champ icon champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

depth-anything icon depth-anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

dynamicrafter icon dynamicrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

emote-hack icon emote-hack

using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. WIP

instantid icon instantid

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

ootdiffusion icon ootdiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

open-sora icon open-sora

Open-Sora: Democratizing Efficient Video Production for All

paddleseg icon paddleseg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

sadtalker-video-lip-sync icon sadtalker-video-lip-sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

score_sde icon score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

stableviton icon stableviton

[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On

streamdiffusion icon streamdiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

tts icon tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

vbench icon vbench

[CVPR2024] VBench: Comprehensive Benchmark Suite for Video Generative Models

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.