Coder Social home page Coder Social logo

natlamir's Projects

a11 icon a11

Stable Diffusion web UI

audio-webui icon audio-webui

A webui for different audio related Neural Networks

audiosep icon audiosep

implementation of "Separate Anything You Describe"

bark icon bark

🔊 Text-Prompted Generative Audio Model

dinet icon dinet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

dinet-ui icon dinet-ui

Windows Forms user interface for making lip sync videos with DINet and OpenFace

dream icon dream

Generative Gaussian Splatting for Efficient 3D Content Creation

emotivoice icon emotivoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

llava-windows icon llava-windows

[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.

magic-animate icon magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

onlyspeaktts icon onlyspeaktts

Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.

oogabooga icon oogabooga

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

openface icon openface

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

piper icon piper

A fast, local neural text to speech system

pixart-alpha icon pixart-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

projectfiles icon projectfiles

Where I will be storing misc files with details / links used during the installation process, etc

sadtalker icon sadtalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

show-1 icon show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

tpsm icon tpsm

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

vid2densepose icon vid2densepose

Convert your videos to densepose and use it on MagicAnimate

video-retalking icon video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

zero123plus icon zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.