Graph

flowchart TD
	node1["checkpoints/detector.dvc"]
	node2["checkpoints/keypoint.dvc"]
	node3["checkpoints/segmentation.dvc"]
	node4["datasets/watch-faces.json.dvc"]
	node5["download-images"]
	node6["eval-detector"]
	node7["eval-end-2-end"]
	node8["eval-keypoint"]
	node9["train-detector"]
	node10["train-keypoint"]
	node11["train-segmentation"]
	node12["update-metrics"]
	node1-->node9
	node2-->node10
	node3-->node11
	node4-->node5
	node5-->node7
	node5-->node9
	node5-->node10
	node5-->node11
	node6-->node12
	node7-->node12
	node8-->node12
	node9-->node6
	node9-->node7
	node9-->node8
	node9-->node12
	node10-->node7
	node10-->node8
	node10-->node12
	node11-->node7
	node11-->node12

Metrics

Path	train.1-min_acc	train.10-min_acc	train.60-min_acc	val.1-min_acc	val.10-min_acc	val.60-min_acc
metrics/end_2_end_summary.json	0.066	0.082	0.168	0.013	0.026	0.039

Path	AP @IoU=0.50	AP @IoU=0.50:0.95	AP @IoU=0.75	AP @IoU=0.95	AR @maxDets=1	AR @maxDets=10	AR @maxDets=100	Num Images	eval.loss	step	train.loss
metrics/detector.json	-	-	-	-	-	-	-	-	0.42	59	0.041
metrics/detector/coco_train.json	0.459	0.327	0.403	-1.0	0.42	0.455	0.455	127	-	-	-
metrics/detector/coco_val.json	1.0	0.73	0.832	-1.0	0.75	0.75	0.75	6	-	-	-

Path	AP @IoU=0.50	AP @IoU=0.50:0.95	AP @IoU=0.75	AR @IoU=0.50	AR @IoU=0.50:0.95	AR @IoU=0.75	Num Images	eval.iou_score	eval.loss	step	train.iou_score	train.loss
metrics/keypoint.json	-	-	-	-	-	-	-	0.62	0.38	59	0.899	0.106
metrics/keypoint/coco_train.json	0.522	0.365	0.315	0.65	0.503	0.458	127	-	-	-	-	-
metrics/keypoint/coco_val.json	1.0	0.754	0.632	1.0	0.783	0.667	6	-	-	-	-	-

Path	eval.iou_score	eval.loss	step	train.iou_score	train.loss
metrics/segmentation.json	0.448	0.381	59	0.748	0.114

End 2 end metrics definitions

Final metric for the entire system is 'x-min accuracy' which is the fraction of system predictions accurate within x minutes. Example:
$$\text{1-min-acc} = 1 - {|{|time - {time}{pred}| < 1min}| \over N{samples}}$$

Demo - version 2

models used:

bbox detector for finding clock face in the image
classifier for clock orientation estimation
keypoint detection for center and top
semantic segmentation for finding clock hands
KDE for splitting the binary segmentation mask into individual clock hands

Watch crop with center and top keypoint

Detected mask of watch hands

KDE of pixel angles

Fitted lines to segmented pixels

Final selected and rejected lines

Installation

Install watch_recognition module, run pip in the main repository dir

pip install watch_recognition/

Tested on Python 3.7 and 3.8

Running models

Checkout example notebook: notebooks/demo-on-examples.ipynb

Models description

TODO

Demo - version 1

models used:

bbox detector for finding clock face in the image
classifier for clock orientation
keypoint detection for center, top and end of clock hands

Downloading images from OpenImage Dataset

wget https://raw.githubusercontent.com/openimages/dataset/master/downloader.py

python scripts/downloader.py ./download_data/train_ids_small.txt --download_folder=./download_data/train/

python scripts/downloader.py ./download_data/test_ids_small.txt --download_folder=./download_data/test/

python scripts/downloader.py ./download_data/validation_ids_small.txt --download_folder=./download_data/validation/

Convert tagged data into keypoint dataset

see notebook ./notebooks/generate_kp_dataset.ipynb

Train keypoint detection model

see notebook ./notebooks/cell-coder.ipynb.ipynb

Label Studio setup

https://labelstud.io/

<View>
    <Image name="image" value="$image" zoom="true" zoomControl="true"/>
      <KeyPointLabels name="kp" toName="image">
        <Label value="Center" background="#FFA39E"/>
        <Label value="Top" background="#D4380D"/>
        <Label value="Crown" background="#FFC069"/>
    </KeyPointLabels>
    <PolygonLabels name="polygon" toName="image" strokeWidth="3" pointSize="small" opacity="0.9">
        <Label value="Hands" background="#45fc03"/>
    </PolygonLabels>
    <RectangleLabels name="bbox" toName="image">
        <Label value="WatchFace" background="#FFA39E"/>
    </RectangleLabels>
      <TextArea name="transcription" toName="image" editable="true" perRegion="true" required="false" maxSubmissions="1" rows="5" placeholder="Recognized Time" displayMode="region-list"/>    
</View>

References

OpenImagesDataset https://opensource.google/projects/open-images-dataset

trellixvulnteam / analog-watch-recognition_89rg Goto Github PK

analog-watch-recognition_89rg's Introduction