• Why should you use a Pipeline?
  • How do you encode categorical features with OneHotEncoder?
  • How do you apply OneHotEncoder to selected columns with ColumnTransformer?
  • How do you build and cross-validate a Pipeline?
  • How do you make predictions on new data using a Pipeline?
  • Why should you use scikit-learn (rather than pandas) for preprocessing?