Coder Social home page Coder Social logo

veremi-dataset.github.io's Introduction

VeReMi dataset

These pages describe the Vehicular Reference Misbehavior (VeReMi) dataset, a dataset for the evaluation of misbehavior detection mechanisms for VANETs. This dataset consists of message logs of on-board units, including a labelled ground truth, generated from a simulation environment. The dataset includes malicious messages intended to trigger incorrect application behavior, which is what misbehavior detection mechanisms aim to prevent. The initial dataset contains a number of simple attacks: the idea of this dataset release is not just to provide a baseline for the comparison of detection mechanisms, but also to serve as a starting point for more complex attacks. VeReMi is part of a recently accepted paper, submitted to the SecureComm conference.

Overview

VeReMi is a simulated dataset, generated using LuST (Version 2) and VEINS (with modifications, based on Version 4.6). It consists of message logs per vehicle, containing both GPS data (labeled as type=2) about the local vehicle and BSM messages (labeled as type=3) received from other vehicles though DSRC. It has two primary purposes: it serves as a baseline to assess how misbehavior detection mechanisms operate on a city scale, and it saves you a lot of computational power typically required to run VEINS sufficiently often. VeReMi consists of three different density levels, five different attacks, and three different attacker densities. The code and configuration files that are the input of VEINS are available in a separate repository on the securecomm2018 branch. We also provide a popper-like repository that contains all the scripts we used to generate the dataset (see below).

File structure

Each simulation log contains a ground truth file for every message and a set of message logs for every vehicle that received messages. The file name of a message log identifies the receiver by vehicle number and OMNeT++ module number, e.g., JSONlog-0-7-A0.json refers to the 0th vehicle with OMNeT++ module ID 7. The latter is the number also used to identify the sender as such in any reception log and the ground truth file. A0 refers to the fact that this vehicle is not an attacker (refer to the paper for a discussion of the attacks, or check out the source code below).

Accessing & Using VeReMi

To use VeReMi, or parts of the dataset, clone the corresponding repository, which is also on github. The repository uses git-lfs to store the actual message logs, because their size exceeds the amounts suitable for git. Due to Github storage limitations, we're currently hosting the dataset as a release here. We're thinking about a self-hosted repository with git-lfs to enable contributions again, but for now please download the dataset from the public release and write Rens if you wish to contribute.

Reproducing the Dataset

The repository with scripts that we used to generate the data (as well as all the other processing tasks done for the first paper) can be found on github. We are currently working on making those scripts more portable -- right now, a lot of deployment happens manually, and this is only documented in notes in the scripts. Please contact us if you have issues with these scripts.

Extending the Dataset

There are many ways to extend the dataset: for example, running on a different subset of LuST, adding new attacks, or changing simulation parameters. To add your results to this repository, please create a pull request on the data repository through the following steps:

  • publish your code (preferably as a fork of VeReMi's fork of veins)
  • fork VeReMi
  • clone VeReMi locally
  • create a folder in the repository and update the VeReMi index.md file
  • add your results into the new folder and use git LFS to add the message outputs
  • push your results to your fork on github
  • create a pull request in the github interface

Please note that at this stage we will only accept pull requests with public source code. If you'd like to contribute real-world data, please contact us directly.

Acknowledgement

The dataset was primarily put together by Rens van der Heijden at the Institute of Distributed Systems, part of Ulm University. Please contact Rens if you have any questions, comments or criticism. This work was supported in part by the Baden-Württemberg Stiftung gGmbH Stuttgart as part of the project IKT-05 AutoDetect of its IT security research programme. Simulations for this work were performed on the computational resource bwUniCluster funded by the Ministry of Science, Research and the Arts Baden-Württemberg and the Universities of the State of Baden-Württemberg, Germany, within the framework program bwHPC.

veremi-dataset.github.io's People

Contributors

namnatulco avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.