Coder Social home page Coder Social logo

mtisz / car_detection_yolo_faster_rcnn_uvsc2019 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from aniskoubaa/car_detection_yolo_faster_rcnn_uvsc2019

0.0 0.0 0.0 60.33 MB

This repo contains all the source code and dataset used in the paper Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

Python 100.00%

car_detection_yolo_faster_rcnn_uvsc2019's Introduction

Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3

  • This repo contains the dataset and link to the source used in the paper "Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3"

Links

  • paper - Link to the paper
  • video - Video of the paper

Abstract

  • Unmanned Aerial Vehicles are increasingly being used in surveillance and traffic monitoring thanks to their high mobility and ability to cover areas at different altitudes and locations. One of the major challenges is to use aerial images to accurately detect cars and count them in real-time for traffic monitoring purposes. Several deep learning techniques were recently proposed based on convolution neural network (CNN) for real-time classification and recognition in computer vision. However, their performance depends on the scenarios where they are used. In this paper, we investigate the performance of two state-of-the-art CNN algorithms, namely Faster R-CNN and YOLOv3, in the context of car detection from aerial images. We trained and tested these two models on a large car dataset taken from UAVs. We demonstrated in this paper that YOLOv3 outperforms Faster R-CNN in sensitivity and processing time, although they are comparable in the precision metric.

Citation

@misc{benjdira2018car,
    title={Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3},
    author={Bilel Benjdira and Taha Khursheed and Anis Koubaa and Adel Ammar and Kais Ouni},
    year={2018},
    eprint={1812.10968},
    archivePrefix={arXiv},
    primaryClass={cs.RO}
}

Dataset used in the paper

  • To perform the experimental part of our study, we built a UAV imagery dataset divided into a training set and a test set.
  • We tried to collect cars from different environments andscales to assure the validity of our experiment and to test the genericity of the algorithms. For example, some images are taken from an altitude of 55m and others are taken from above 80m.
  • The training set contains 218 images and 3,365 instances of labeled cars. The test set contains 52 images and 737 instances of cars.
  • This dataset was collected from:
    • images taken by an UAV flown above Prince Sultan University campus. This is what we provide in this repository. We provided three folders:
      • The images: The image ataken by the UAV flown above Prince Sultan University campus.
      • The labels for these images in XML format.
      • scripts to convert the labels from the xml format to the VOC format and to the YOLO format.
    • from an open source dataset available in Github here: https://github.com/jekhor/aerial-cars-dataset. We used the images and their labels.

Training Faster R-CNN

python train.py --logtostderr --train_dir=training/ --pipeline_config_path=training/faster_rcnn_inception_v2_coco.config

Training YOLO v3

  • To train the model YOLO v3, we used the default YOLO v3 implementation provided here: YOLO v3.
  • After building the training binary for YOLO: run the following command:
./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg darknet53.conv.74

Car detection using Faster R-CNN

alt text

Car detection using YOLO v3

alt text

car_detection_yolo_faster_rcnn_uvsc2019's People

Contributors

aniskoubaa avatar bilel-bj avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.