Libraries for data science in Python

Fundamental Libraries for Scientific Computing

  • IPython Notebook
  • NumPy
  • pandas
  • SciPy

Math and Statistics

  • SymPy
  • Statsmodels

Machine Learning

  • Scikit-learn
  • Shogun
  • PyBrain
  • PyLearn2
  • PyMC

Plotting and Visualization

  • Bokeh
  • d3py
  • ggplot
  • matplotlib
  • plotly
  • prettyplotlib
  • seaborn

Data formatting and storage

  • csvkit
  • PyTables
  • sqlite3