The tmp-cctestbed_5z35 from trellixvulnteam

trellixvulnteam / tmp-cctestbed_5z35 Goto Github PK

View Code? Open in Web Editor NEW

This is a copy of Ray's repo

Shell 1.42% JavaScript 0.02% C++ 3.30% Python 76.32% C 18.00% CSS 0.02% Makefile 0.04% HTML 0.63% Vim Script 0.26%

tmp-cctestbed_5z35's Introduction

Classification

To classify the congestion control algorithm of websites, run python3.6 classify_websites.py --website [website1] [file1] --website [website2] [file2]...

How it works

classify_websites.py performs the following steps for each website:

Runs ccalg_predict.py, which completes at most 12 experiments, each with a different set of network conditions. Some network conditions may be skipped.
Gets the predicted label for each experiment with classify_websites.snakefile.
Experiments which are marked invalid by classify_websites.snakefile are rerun up to 3 times. If an experiment is still marked invalid after the third run, the predicted label for the experiment is considered unknown.
Counts the predicted labels of the final experiments. If a label has a strict majority, the congestion control algorithm of the website is classified as the majority label. Otherwise the algorithm is classified as unknown.
The predicted label for the website along with the names of the final experiments used in the prediction are printed and saved to /tmp/data-processed/[website]-[year][month][day][time].results, e.g. cca.org-20190915T103243.results.

The classification for a website will fail if an error is encountered when running either ccalg_predict.py or classify_websites.snakefile.