Coder Social home page Coder Social logo

hive's People

Contributors

shugerdou avatar yihaocs avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

hive's Issues

Problem in finding train data

Hello, I'm trying to run the code from your repository, and I encountered an error stating
'No such file or directory: 'data/training/instructpix2pix/seeds.json'.' It seems that the required file 'seeds.json' is missing from the specified directory.

Additionally, I tried executing the 'bash scripts/download_hive_data.sh' command to download the train dataset, but it doesn't seem to be working as expected.
Could you please provide some guidance on how to resolve these issues? Thank you!

Reproducing first training step

Hey, great job on this project! Really exciting work.

I want to reproduce the first part of your training process (fine-tuning IP2P) on SD2.1 768x768 but having some trouble.

I've tried to piece together the original IP2P training config and your generate config for SD2.1 but the layer params don't match up between my config and my checkpoint. Do you have any idea how to make the checkpoint and model match?

My model

https://huggingface.co/stabilityai/stable-diffusion-2-1/blob/main/v2-1_768-ema-pruned.ckpt

My config

# train.yaml
model:
  base_learning_rate: 1.0e-04
  target: ldm.models.diffusion.ddpm_edit_v21.LatentDiffusion
  params:
    ckpt_path: checkpoints/v2-1_768-ema-pruned.ckpt
    # ...

The errors

I get a lot of errors like these when starting training:

size mismatch for model.diffusion_model.input_blocks.2.1.x.y.z: copying a param with shape torch.Size([320, 1024]) from checkpoint, the shape in current 
model is torch.Size([320, 768]).

The mismatches are between a shape element being 1024 in checkpoint vs. 768 in the model definition, or something like torch.Size([640, 640]) vs. torch.Size([640, 640, 1, 1])

Exact errors

Screenshot 2023-05-07 at 12 03 12

HIVE does not work as good as InstructPix2Pix

Hi Authors, recently I am doing a comparison between HIVE and InstructPix2Pix. To my surprise HIVE is not working that good as compared to InstructPix2Pix. It is even quite off compared to it. I am really wondering if this is really the case or am I missing something. I have tested both condition and weighted based models with SD 2.1 as backbone. Both are not good and weighted based is quite even worse than condition based. And, in the paper figure 20 it shows that they are not supposed to be too different. I am testing with simple instructions even. Can you please help me here what is going on? Thanks a lot!

Releasing collected human annotations

Hi, thanks for your great work and releasing your code and data publicly! I was wondering if you are planning to release the human annotations you collected for training your reward model? Thanks!

other way to download dataset

Dear HIVE team,

I really enjoy your work. Thanks for sharing this great work.

Is there another way to download your dataset? It is not easy for us to use google cloud.

Thank you for your help.

Best Wishes,
Zongze

Can you share a working xformers conda setup?

Hey, for me the conda environment in this repo doesn't resolve upon adding xformers via conda.
Adding xformers via pip seems to break something about PyTorch/CUDA compat which leads to some tensors being on the wrong device at runtime (does not happen without xformers installation).

I used the installation procedures from the xformers Readme.

I assume you used xformers for training so let me know if you can share a working setup.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.