Statistical/ML modeling with scikit-learn ======================================================== The `sckit-learn module `_ is a full featured Python module for all kinds of data analysis and predictive modeling algorithms. In the pcda class we did `one session at the end of the semester `_ that just introduced this library and did some basic statistical/ML modeling. We'll pick up where that session left off and dive a little deeper both into some advanced modeling concepts as well as some of of sklearn's features such as `preprocessors `_ and `pipelines `_. Through this module you will: * review using sklearn to train and use statistical and machine learning models, * learn about and build ensemble models in sklearn, * use cookiecutter templates to create project folder/file structures and learn concepts for data science project file management, * learn about regularization methods within the context of building classifier models for the Pump it Up competition, * build logistic and tree based classifiers in sklearn using preprocessors and pipelines. .. toctree:: :maxdepth: 1 :caption: Sub-modules: Review of sklearn and ensemble models Using cookiecutters to structure projects Data prep for Pump it Up project Regularization and more classifier models * :ref:`search`