Nora Mohamed & Lucy Wilcox's python data visualization project for Software Design SP 2015
This project’s goal is to create an interactive visualization of our dataset of over 10 million username and password combinations, focusing on the password portion of the dataset. The program creates a GUI using TKinter, where the user can look at graphs of commonly occurring data on the passwords or look at graphs with information comparing user input to the data set. The graphs are generated in the user’s web browser using Bokeh, which allows them to be interactive as well.
IMPORTANT: datafunctions.py assumes you are using a Linux distibution where the location of your computer's english dictionary is /usr/share/dict/american-english. Change this if it's not true. Also make sure to download the dataset below, as it's too big to push to github.
Download our dataset:
Please download to the repository you are running this code from:
10-million-combos.txt
Libraries to import:
-Bokeh
-TkInter
-Levenshtein