Comments (10)
Ivan, thank you for the mention!
The above is intentional, assuming the image came from the training partition. We only have one box per image per entity, for the training set. However, for the validation set, we tried to have all instances boxed.
from dataset.
I am curious about this method also. @lamerman @samihaija any updates about this?
from dataset.
@samihaija Sami, can you please comment on this?
from dataset.
@samihaija @rkrasin thank you.
@samihaija I'm not a big expert in object detection algorithms, but my first guess would be that a neural network that is learning on this data will be penalized while training. It predicts multiple chairs, but the training data has only one and my guess would be that when it predicts YES and the training data says it NO, when in reality it's YES, the network will be penalized for such predictions.
It's much more a question than statement, as I am not sure.
What do you think, could it be a problem?
P.s. I'm trying to teach YOLO using openimages.
from dataset.
I looked at the loss function of YOLO
And it seems like absence of bounding box for image when in reality it should be will affect the loss function on line 4. And it's interesting what was the reasoning behind making only one bounding box for openimages.
from dataset.
Does someone has an explanation concerning the fact that training data contains only one bounding box instead of all boxes ?
As a result, this dataset cannot be used to train object detection algorithm ?
from dataset.
because of this, Yolo does not train well with this dataset.
from dataset.
@dashesy hi, do you try any other detection algorithm ( like faster rcnn, ssd ) with openimages dataset? I try to train faster rcnn with mxnet with openimages, but I have many problems when preprocessing the dataset.
from dataset.
@dashesy hi, do you try any other detection algorithm ( like faster rcnn, ssd ) with openimages dataset? I try to train faster rcnn with mxnet with openimages, but I have many problems when preprocessing the dataset.
from dataset.
Further, a lot of images have missing labels. Is this problem fixable?
from dataset.
Related Issues (20)
- OpenImages V6 data set HOT 1
- there are no cat and dog coarse-grain category. HOT 1
- Image 01a624308e2f8c5d in oidv6-train-annotations-bbox.csv is mislabled
- Mislabeled Images HOT 1
- segmentations.csv mask 3 coordinates HOT 1
- Decoding Openimages v6 mask coordinates HOT 2
- BadZipFile Error HOT 3
- Soil-dataset
- L
- Golf rounds
- OIDv4 Tool Kit Windows 10 Python 3.7 HOT 2
- Extended dataset download per category? HOT 1
- (V5) Mismatched image and mask resolutions. HOT 2
- Explore UI does not load images HOT 2
- How to report invalid/questionable images? HOT 5
- Open Image Dataset V5 to COCO JSON format
- Why not build a video instance segmentation dataset?
- Where can I download the OpenImage V2 dataset? HOT 1
- Hierarchy question
- Request to add pretrained large-scale object detector to "Community Contributions" HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dataset.