Coder Social home page Coder Social logo

alonzoleeeooo / lcdg Goto Github PK

View Code? Open in Web Editor NEW
30.0 30.0 2.0 116.49 MB

The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".

Home Page: https://arxiv.org/abs/2305.11520

Python 99.95% Shell 0.05%
diffusion-models image-generation text-to-image-generation

lcdg's Introduction

Hi there 👋

github stats

🧑🏻‍🎓 I am currently a PhD student at the University of Science and Technology of China (USTC), supervised by Prof. Dong Liu.

🔍 My research direction involves a wide series of applications based on generative models such as GANs, Transformers, and diffusion models. For now, I am mainly interested in these tasks: image inpainting, text-to-image generation, and video generation.

📂 To record up-to-date resources of the aforementioned research directions, I have maintained some GitHub repos, which you can find here. Newly published papers and resources will be updated in these repos.

🤝 I am looking for long-term collaboration in ground-breaking projects in computer vision, please feel free to contact me if you are interested.

📜 You can find more information about me in the following websites.

🔥 Recent News:

Visitor Count

lcdg's People

Contributors

alonzoleeeooo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

ptnv-s yifan-xie

lcdg's Issues

Exception during LatentDiffusion.instantiate_first_stage

Hi @AlonzoLeeeooo,

I tried to run the training code, and encountered an error when ldm.models.autoencoder.AutoencoderKL was loading checkpoint params from pretrained first_stage_models,error detail shows below:
image

But when I tried debug the code, the current model showed different from the error detail above. I paste the corresponding part of the encoder model structure below:
image

I tried to find other works like T2I, the didn't use checkpoint for ldm.models.autoencoder.AutoencoderKL. So I wonder if you encountered this error before, and could you give some advice concerned.
Any tips or suggetions would be great appreciated. Looking forward for your reply.

Best regards.

release window of code implementation

Hey there, just wondering if there's any word on the release window for the code implementation. Any updates on the availability of the codebase would be greatly appreciated.

Loss mismatch related issues

In the paper, it is written that the loss is DM + cond, but the code only shows the cond related loss. Looking forward to hearing from you!
Snipaste_2024-07-09_11-35-51
Snipaste_2024-07-09_11-36-28

Extract which features in UNet? Which conference will this work be published?

Thank you for your amazing work about controlling Diffusion generation process! But I didn't find any ablation experiments about your choice of the features in UNet, which means a lot to the controlling and quality of results as for me. Are the features in Encoder too noisy to use to train the Condition Adaptor? Can you provide some more experiments? Thanks a lot!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.