How to generate instance mask, only one channel? about oneformer HOT 6 CLOSED

shi-labs commented on May 20, 2024

How to generate instance mask, only one channel?

from oneformer.

Comments (6)

praeclarumjj3 commented on May 20, 2024

Hi @rockywind, thanks for your interest in our work.

If I understand your question correctly, you want to generate a single-channel segmentation mask for the instance segmentation predictions. Please provide more description about your issue if this is not the case.

You can loop through the pred_masks stored in the instance predictions, assign an ID to each mask, and aggregate those into a single channel mask.

OneFormer/oneformer/oneformer_model.py

Line 475 in 7611899

result.pred_masks = (mask_pred > 0).float()

from oneformer.

rockywind commented on May 20, 2024

Hi,
thank you for your help.
Each pixel value represents an instance category, the value is 1,2,3, and so on. The 0 is the representation's background.
But, I found that the value of result.pred_masks is between 0 and 1, the shape of result.pred_masks is [7, 1114, 2191], the image's size is [1114, 2191] .

from oneformer.

praeclarumjj3 commented on May 20, 2024

I believe you are talking about the semantic segmentation result, where each pixel corresponds to the corresponding object's category.

You need to do an argmax operation on the semantic predictions to obtain those.

OneFormer/demo/predictor.py

Line 68 in 7611899

predictions["sem_seg"].argmax(dim=0).to(self.cpu_device), alpha=0.7

from oneformer.

rockywind commented on May 20, 2024

Hi,
Sorry for not being clear before. The following is sample data。
There are 3 cars in the picture, the first car's pixel value is 1, the second car's pixel value is 2, and the third car's pixel is 3.

from oneformer.

praeclarumjj3 commented on May 20, 2024

Each pixel value represents an instance category, the value is 1,2,3, and so on. The 0 is the representation's background.
But, I found that the value of result.pred_masks is between 0 and 1, the shape of result.pred_masks is [7, 1114, 2191], the image's size is [1114, 2191] .

Right, that's what I thought you wanted to do. You can loop through the result.pred_masks, assign an ID (starting from 1) to each mask, and aggregate them on an all-zeros mask. Please find the pseudo-code below:

# create an all-zeros mask
single_channel_mask = torch.zeros_like(image) # or torch.zeros((1114, 2191))
count = 0

# loop through all instance masks
for mask in result.pred_masks:
    count += 1
    mask *= count
    single_channel_mask = torch.max(single_channel_mask, mask)

Let me know if you have any more issues.

from oneformer.

rockywind commented on May 20, 2024

Thank you very much.
I have a try!

from oneformer.

How to generate instance mask, only one channel? about oneformer HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent