Coder Social home page Coder Social logo

tensorflow-tbcnn's Introduction

Tensorflow Implementation of Tree-based Convolutional Neural Network

Implementation of TBCNN using Tensorflow. Our report here.

Instead of training on C programs, This version is used for SQL injection detection. Dataset is provided in the data folder.

Installation

It's highly recommended to install into a virtual environment. Run the following command after activating venv:

pip install -r requirements/dev.txt

Usage

tbcnn.embedding is the entry point for pretraining the embedding matrix.

tbcnn.tbcnn is the entry point for the tbcnn model. It requires a pre-trained embedding matrix.

tensorflow-tbcnn's People

Contributors

aetf avatar

Stargazers

boX avatar Wildfield avatar  avatar Vamshi Vemula avatar sql_dp avatar Elaine Laguerta avatar Rahul Vast avatar Roman Shpir avatar  avatar alive avatar ZhiChen avatar T. Hayashi avatar  avatar Hiroto.Takatoshi avatar gt4404gb avatar Surendiran S avatar Alexey Rodionov avatar  avatar CoderGang avatar Emilio Cabrera avatar yhf avatar fdS avatar  avatar zhaozhehao avatar Kaito avatar justthefracts avatar yat1ma30 avatar Hao Wei avatar Pasquale Salza avatar  avatar Jeff Liu avatar Peng Yu avatar Falaq M avatar Yuxin Tang avatar Yuliang (Zack) Zou avatar Rayimpr avatar

Watchers

Alexey Rodionov avatar James Cloos avatar  avatar  avatar

tensorflow-tbcnn's Issues

SQl parser

Hey, I just need the sql parser script that you used to parser the sql payload and collect the dataset.
Just that part. which file is it?

why training hangs

@Aetf
I created the relevant environment and run embedding.py on my own computer according to your documentation. The program hung after it run and printed 1-25 pieces of information (the position of the stall was different each time the program was run), but it did not exit.

2018-04-01 06:01:12.024821: myglobal 1 epoch 1 step 1 loss = 21.25 (0.9 samples/sec; 1.175 sec/batch)
2018-04-01 06:01:12.354372: myglobal 2 epoch 1 step 2 loss = 17.27 (3.2 samples/sec; 0.312 sec/batch)
2018-04-01 06:01:12.787619: myglobal 3 epoch 1 step 3 loss = 10.45 (2.9 samples/sec; 0.346 sec/batch)
2018-04-01 06:01:13.477380: myglobal 4 epoch 1 step 4 loss = 17.19 (1.5 samples/sec; 0.678 sec/batch)
2018-04-01 06:01:14.020272: myglobal 5 epoch 1 step 5 loss = 17.10 (1.9 samples/sec; 0.518 sec/batch)
2018-04-01 06:01:14.258575: myglobal 6 epoch 1 step 6 loss = 10.39 (4.4 samples/sec; 0.228 sec/batch)
2018-04-01 06:01:14.698754: myglobal 7 epoch 1 step 7 loss = 26.52 (2.5 samples/sec; 0.407 sec/batch)
2018-04-01 06:01:14.965694: myglobal 8 epoch 1 step 8 loss = 15.85 (4.1 samples/sec; 0.246 sec/batch)
2018-04-01 06:01:15.259785: myglobal 9 epoch 1 step 9 loss = 17.02 (3.6 samples/sec; 0.274 sec/batch)
<------it hangs and do nothing forever and different position in next rerunning

Ctrl+c does not work, and ctrl+z can exit.
I used the "top" command to see that the host's CPU and memory were idle and not busy running any more.

my system is Ubuntu16.04 LTS, tensorflow=1.0.0, tensorflow_fold_fold=0.0.1 python=3.5, CPU only

Linux ubuntu 4.13.0-37-generic #42~16.04.1-Ubuntu SMP Wed Mar 7 16:03:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

How do i solve this problem?
Thanks very much!

Error when installing tensorflow-fold

Hello @Aetf,

I ran into problems when installing the tensorflow-fold package.
It seems that this package is not available in pip. Do you have any recommendations on this?

Which python version are you using?
Thank you very much in advance!

Include a lot of semantical information in addition to the (tree) structure

In the paper of yours you're mentioning a further work including extension with capturing semantical information ("what the input strings or integeres or ... contain or even mean", "what are the relationships between the contents of different strings", etc.).

Did you make any progress on this? I came across a tree-like problem which is though very sensitive to the contents of the leafs and I'm first looking for existing approaches to the problem before I start with something on my own.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.