mooneese,github

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

depth-anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

dynamicrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

emote-hack

using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. WIP

grok-1

Grok open release

instantid

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

moore-animateanyone

ootdiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

open-sora

Open-Sora: Democratizing Efficient Video Production for All

paddleseg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

prompt-engineering-guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

real-time-voice-cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

roop-unleashed

Evolved Fork of roop with Web Server and lots of additions

sadtalker-video-lip-sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。

score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

stableviton

[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On

streamdiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

vbench

[CVPR2024] VBench: Comprehensive Benchmark Suite for Video Generative Models

mooneese Goto Github PK

mooneese's Projects

Recommend Projects

Recommend Topics

Recommend Org