Comments (7)
Hi @trungpham2606 and welcome back!
Our line of thought was roughly as follows: The heads need information about the objects (encoded in the scene representation) and information whether any given object belongs to the reference category (encoded in the difference of scene and reference embeddings). Simply concatenating both allows the heads to decide, which information they use for which part of the prediction (region proposals, classification, bbox regression and segmentation). The additional 1x1 convolution was simply chosen to reduce the computational footprint as we assumed, that some of the information in both encodings is redundant.
Does that make it clearer?
from siamese-mask-rcnn.
Do you have an example image? In general false positives are one of the biggest challenges our model has, so an additional detection of the box does not surprise me.
from siamese-mask-rcnn.
If all your images are very similar you could solve the problem with a simple hack: Throw away all large boxes. Alternatively you could create a new dataset from the good predictions and finetune Mask R-CNN or Siamese Mask R-CNN on that.
from siamese-mask-rcnn.
@michaelisc Iam trying to use your model to solve my problem that I will have a bunch of coins in a box and try to segment as many coins as possible. But besides the good segment results, the model also segment the whole box. Do you have any explanation or idea for coping with that situation ?
from siamese-mask-rcnn.
@michaelisc here's my result:
from siamese-mask-rcnn.
Oh and of course you can wait until we fix the fundamental problem but that may take a while ;D
from siamese-mask-rcnn.
@michaelisc what a trick =)))) I am looking forward to your upcoming fix ^^
from siamese-mask-rcnn.
Related Issues (20)
- How to use the pretrained weights? HOT 5
- plz help HOT 5
- shape error HOT 1
- Problem with mask predictions HOT 17
- How to design the loss function? HOT 4
- Multiple runs on training not evaluation HOT 2
- typo HOT 1
- Small typo HOT 1
- Dimension dismatch error when evaluate the retrained model for COCO, from epoch 2 HOT 2
- Problem with custom dataset HOT 8
- Under segmentation for close items HOT 4
- Performance is not as good as Mask RCNN when training on a small custom dataset HOT 1
- Modifications for training on custom dataset
- Test.ipynb
- One-Shot Detection - input_target shape Error HOT 3
- TypeError: unhashable type: 'ListWrapper' when try to train model HOT 1
- issue re-running
- Support for Tensorflow 2.4+ HOT 1
- Loading weights
- Why are model loading and inference times so slow?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from siamese-mask-rcnn.