Great job! <a class="user-mention notranslate" data-hovercard-type="user" data-hoverca

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

How to use generator to generate random image? about pixel2style2pixel HOT 5 CLOSED

eladrich commented on May 27, 2024

How to use generator to generate random image?

from pixel2style2pixel.

Comments (5)

yuval-alaluf commented on May 27, 2024

Hi @zerollzeng ,
First, please note that the implementation for the generator was taken from rosinality.

With that said, in our repo we provide code that you can adopt for generating random samples using our pSp network and StyleGAN generator. Specifically, please refer to our inference.py script for an implementation for generating a random latent vector that we use for style mixing here:

pixel2style2pixel/scripts/inference.py

Lines 117 to 121 in ac7da53

    
           # get latent vector to inject into our input image 
        
           vec_to_inject = np.random.randn(1, 512).astype('float32') 
        
           _, latent_to_inject = net(torch.from_numpy(vec_to_inject).to("cuda"), 
        
                                     input_code=True, 
        
                                     return_latents=True)

Although here we only store latent_to_inject, you can make a few changes that will store the generated corresponding random face image for the random latent vector.

For example, the following code should be a good starting point for your needs:

n_images_to_generate = 10
generated_images = []
for _ in range(n_images_to_generate):
        random_vec = np.random.randn(1, 512).astype('float32')
        random_image, _ = net(torch.from_numpy(random_vec).to("cuda"), input_code=True, return_latents=True)
        generated_images.append(random_image)

Here, we generate random w vectors which are fed into pSp to randomly generate face images of size 256x256.

Please let me know if this is clear and if you have further questions.

from pixel2style2pixel.

zerollzeng commented on May 27, 2024

Oh thanks for your detailed explanation, I understand it now ::smile::
I'm reading your paper and code, doing some experiment, your jobs is fantastic! I think it would be great if we keep this issue open so that we can continue the discussion instead of open a new issue 👍

from pixel2style2pixel.

zerollzeng commented on May 27, 2024

I'm going to train your encoder network from scratch, here are some question I'm interested:

What's the difference between GradualStyleEncoder, BackboneEncoderUsingLastLayerIntoW, and BackboneEncoderUsingLastLayerIntoWPlus?
Does the num_layer's (50, 100, 152) affect the reconstruct result a lot?

from pixel2style2pixel.

yuval-alaluf commented on May 27, 2024

Thank you for the kind words!
Regarding your questions:

These architectures refer to official pSp encoder, the Naive W+ encoder, and W Encoder mentioned in the paper, respectively. The pSp architecture is explained in Section 3 in the paper and is illustrated in Figure 2. For an explanation of the two later architectures, please refer to the results shown in Section 4.1 (StyleGAN Inversion) in the paper where we discuss the ablation study performed.
In our experiments, we did not change the num_layers from the default value of 50. That is, in all experiments, we use a pretrained IR-SE50 model (which we linked in the README).

Please let me know if you have further questions.

from pixel2style2pixel.

yuval-alaluf commented on May 27, 2024

Closing due to inactivity.

from pixel2style2pixel.

How to use generator to generate random image? about pixel2style2pixel HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

	# get latent vector to inject into our input image
	vec_to_inject = np.random.randn(1, 512).astype('float32')
	_, latent_to_inject = net(torch.from_numpy(vec_to_inject).to("cuda"),
	input_code=True,
	return_latents=True)