Here are a few of the things you'll learn in these 50 short video lessons:
How to build, evaluate, and tune a Pipeline
Two easy ways to visualize a decision tree
How to benefit from missing values using a "missing indicator"
How to plot an ROC curve in one line of code
How to speed up a grid search
How to add feature selection to a Pipeline
Why you should use scikit-learn (not pandas) for preprocessing
How to create an interactive diagram of a Pipeline
How to save your best Pipeline for future predictions
Why dropping a level when one-hot encoding is usually a bad idea
How to create custom transformers for feature engineering
Why you should use stratified sampling with train/test split
How to build and tune an ensemble of models
Why you should try ordinal encoding with tree-based models
And much, much more!
This is the perfect course for you if:
You've already completed my introductory ML course
You want to work more efficiently in scikit-learn
You want to learn best practices for Machine Learning code
You want to keep up-to-date with scikit-learn's latest features
I uploaded this series to YouTube in 2020 and 2021, and it has since gotten more than 450,000 views.
Here's why you'll have a better learning experience by taking the course here:
You can watch the videos without ads
You can save your progress and return later to the same spot
You can download the course notebooks
You can run the notebooks online using Binder or Colab
You can access relevant links about each tip
You can post your own questions, and I'll do my best to respond
After completing the course, you'll receive a certificate of completion
You should take my follow-up course, Master Machine Learning with scikit-learn.
My name is Kevin, and I've taught Data Science in Python to over a million students.
My courses explain data science topics in a clear, thorough, and step-by-step manner.
I'd love to teach you, regardless of your educational background or professional experience.
Thanks for joining me! 🙌