10. Building a Machine Learning workflow

Why should you use a Pipeline?
How do you encode categorical features with OneHotEncoder?
How do you apply OneHotEncoder to selected columns with ColumnTransformer?
How do you build and cross-validate a Pipeline?
How do you make predictions on new data using a Pipeline?
Why should you use scikit-learn (rather than pandas) for preprocessing?

2 Lessons