Object Detection Challenge

Prerequisite

create a GPU environment

tensorflow 2.1.0 is the last version to support python 2

conda env create -f py3-tf2-gpu.yml
conda activiate py3-tf2-gpu

my nvidia card is RTX3070, requires nvidia driver version > 450, so cudatoolkit version should >= 11.0 using cudatoolkit 10 may cause compatibility issues for gpu support when using tensorflow.

the above conda environment will install python 3.7 and cudatoolkit 11.0.
install tensorflow 2.4.0 separately, don't install it with conda as it will install cuda 10.2 and cudnn 7 along side, so it may conflict with the new version installed.

pip install tensorflow-gpu

download cudnn 8, and copy all the files from bin folder of the downloaded, cudnn 8 folder, then paste it in the bin folder of the conda environment folder.

tar -xzvf cudnn-11.0-linux-x64-v8.0.5.39.tgz

# copy and past to conda envs lib and include
sudo cp cuda/include/cudnn*.h   /anaconda3/envs/py3-tf2-gpu/include
sudo cp cuda/lib64/libcudnn*    /anaconda3/envs/py3-tf2-gpu/lib
sudo chmod a+r /usr/local/cuda/include/cudnn*.h    /anaconda3/envs/py3-tf2-gpu/lib/libcudnn*

Open Image Data v6 Download

clone the code

git clone https://github.com/pythonlessons/OIDv4_ToolKit.git

download class "Apple"

python main.py downloader --classes Apple --type_csv validation

An OID folder will be created with two sub folder "csv_folder" and "Dataset" where all the downloaded images are located. Ignore the promot "Error", Enter "y" for download the missing file 3. support OIDv6 download

open https://storage.googleapis.com/openimages/web/download.html, click "V6" button in the first line and then scroll down to "Download the annotations and metadata" and click "Train" button to download train csv file in "Boxes" row.
After downloading, put the "oidv6-train-annotations-bbox.csv" into the folder of "OID/scv_folder", and change the name to "train-annotation-bbox.csv" (you also can edit the code in main.py if you don't want to rename the csv file)
repeat previous steps for downloading the "test" and "validation" annotation bbox
now the csv_folder contains 4 files including the "class-description-boxable.csv"
use the same command as usual.

download images

create classes.txt (specify your interested classes)
download all the images of interested classes with a limited number for train, validation and test

python main.py downloader --classes classes.txt --type_csv train --limit 1500

Convert OID image to tfrecord

using the generator in "oid_tfrecord" or clone the original code, and make replace 'app' with 'compat.v1' in the code

git clone https://github.com/zamblauskas/oidv4-toolkit-tfrecord-generator.git

convert oid image dataset to tfrecord for train, validation and test

python generate-tfrecord.py \
--classes_file ../OIDv4_ToolKit/classes.txt \
--class_descriptions_file ../OIDv4_ToolKit/OID/csv_folder/class-descriptions-boxable.csv \
--annotations_file ../OIDv4_ToolKit/OID/csv_folder/train-annotations-bbox.csv \
--images_dir ../OIDv4_ToolKit/OID/Dataset/train \
--output_file ../dataset/oid/train.tfrecord \

Yolov3-tensorflow2

clone the code

git clone https://github.com/zzh8829/yolov3-tf2.git

Convert yolo3 pre-trained Darknet weights

wget https://pjreddie.com/media/files/yolov3.weights -O data/yolov3.weights
python convert.py --weights ./data/yolov3.weights --output ./checkpoints/yolov3.tf

Verify model

python detect.py --image ./data/meme.jpg

Training with your own interested classes

with Transfer Learning Origanl pretrained yolov3 has 80 classes, here we learn 9 classes.

python train.py \
	--dataset ../dataset/oid/train.tfrecord \
	--val_dataset ../dataset/oid/val.tfrecord \
	--classes ../OIDv4_ToolKit/classes.txt \
	--num_classes 9 \
	--mode fit --transfer darknet \
	--batch_size 16 \
	--epochs 10 \
	--weights ./checkpoints/yolov3.tf \
	--weights_num_classes 80 \

Training from random weights Training from scrath is very difficult to converge. The original paper trained darknet on imagenet before training the whole network as well.

python train.py \
	--dataset ./dataset/oid/train.tfrecord \
	--val_dataset ./dataset/oid/val.tfrecord \
	--classes ./OIDv4_ToolKit/classes.txt \
	--num_classes 9 \
	--mode fit --transfer none \
	--batch_size 16 \
	--epochs 10 \

Inference

detect from images

python detect.py \
	--classes ../OIDv4_ToolKit/classes.txt \
	--num_classes 9 \
	--weights ./checkpoints/yolov3_train_10.tf \
	--image ./data/street.jpg \

lower the threshold to recheck the result if it is not success in the first time.

# in the model.py
flags.DEFINE_float('yolo_iou_threshold', 0.5, 'iou threshold')
flags.DEFINE_float('yolo_score_threshold', 0.5, 'score threshold')

detect from validation set

python detect.py \
	--classes ../OIDv4_ToolKit/classes.txt \
	--num_classes 9 \
	--weights ./checkpoints/yolov3_train_10.tf \
	--tfrecord ./data/oidv6_val.tfrecord \

jiaya12 / object_detection Goto Github PK

object_detection's Introduction

Object Detection Challenge

Prerequisite

create a GPU environment

Open Image Data v6 Download

Convert OID image to tfrecord

Yolov3-tensorflow2

Training with your own interested classes

Inference

References

object_detection's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent