List of useful Python packages for data science:
- pandas-profiling
- memory_profiler
- py-spy
- pyflame ❗ (Does not support Windows)
- pyramid-arima
- fbprophet - time series forecasting (additive model) which performs best with high frequency data
- pyflux - time series library
- dask
- pyarrow - Python bindings for Apache Arrow
- fastparquet
- vaex
- modin - pandas on Ray execution framework
- dampr
- datatable - pandas equivalent for performing big data (up to 100GB) operations on a single-node machine, at the maximum speed possible
- PyViz stack (part of PyData stack)
- Panel - dashboarding from PyData Stack
- Plotly
- Dash - dashboard library from plotly
- matplotlib
- seaborn
- altair
- pdvega
- chartify
- dataspyre - dashboard framework with flask backend
- folium
- geoplot
- plotnine - clone of R's ggplot2
- joypy
- bqplot
- jmpy
- pyqtgraph
- probscale - probability scaled axis
- adjustText - add non-overlapping annotated text
- ipysheet
- ipypivot
- papermill + scrapbook - parameterized notebooks
- nteract-scapbook - for passing data between notebooks
- jupytext - edit notebooks as text files
- papermill + scrapbook - parameterized notebooks
- dequindre
- petl
- bonobo
- pypeln
- botflow
- mara data integration
- dbt
- Luigi from Spotify - works with Windows
- Apache Airflow - Windows not supported
- scikit-learn
- sklearn-pandas
- imbalanced-learn
- hyperopt-sklearn - not pip installable yet
- tpot
- xgboost
- lightgbm
- DMTK - Microsoft Distributed Machine Learning Toolkit
- fastText
- Netron - visualize NNs
- BlingFire -Microsoft Bing team, a tokenizer designed for fast-speed and quality tokenization of Natural Language text
- beautifulsoup4
- mechanicalsoup
- selenium - headless-browser framework for extracting javascript data
- scrapy - OOP framework
- newspaper3k - easily extract text from articles
- requests-html
- tldr-python-client - replacement for man pages
- bropages](http://bropages.org/) -
sudo apt-get install ruby-dev, sudo gem install bropages
- howdoi
- inspect
- prettypandas
- fuzzywuzzy
- fuzzymatcher
- pytest
- requests
- psutil
- click - for making CLI
- fire - for making CLI
- questionary
- glances - CPU/memory monitoring
- pendulum - a better datetime library, better than arrow
- visidata - free, open-source tool that lets you quickly open, explore, summarize, and analyze datasets in your computer’s terminal
- schedule - job scheduling for humans
- pyautogui
- ptpython - better REPL
- xlwings - Excel VBA, but with Python instead
- openpyxl
- scriptedforms
- black - source code formatter