Scikit-learn is a Python library used for machine learning. More specifically, it's a set of – as the authors say – simple and efficient tools for data mining and data analysis. The framework is built on top of several popular Python packages, namely NumPy, SciPy, and matplotlib .