How to visualize 400 million games of chess (PGN text files) using Python and few (moderately powered) laptops.
Goals:
- Download PGN files from: https://database.lichess.org/
- Split those PGN files into individual PGNs, each representing 1x game of chess
- Use python-chess library to scrape interesting data about the final board piece positions
- Save this data to a .csv
- Use some kind of multi-processing to speed everything up (total file size = ~400GB)
- Create pivot table > heatmap using pandas and seaborn