In main.py, the parameters are given at the top and the program should run in the same order.
Local directories are saved the in the dictionary. Some cached files are saved for testing and checking.
Part 1 can be run in one time. But may have to run again due to closed remote network connection.
Part 2, note that the parameters of threshold in getMedianWithinThreshhold() is retrived from plots generated by getThresholdRTTForWeek() and pd.getThresholdRTTFreqOfWeek().
- get valid dates
- update dict_dates
- collect dataset
- Unzip
- ReadWarts
- MapKey
- get rid of self-loop
- get node freqency
- get node set
- get subset by given node set
- get plots for t
- get dataset of median within threshhold
- get CSV file to feed in model
- check connectivity
Uncomment and run unit_test.py -> Module -> Step
Note:
- You can uncomment the required module in line 77 and 78.
- The numbered steps for each module is listed in testForDownloadDataset and testForProcessDataset with description of each method. You can uncomment the required step which is tdd/tpd.*()
UnitTest folder -> testfor_download_dataset.py and testfor_process_dataset.py -> testForSomething()