50 scikit-learn tips
Buy now
Learn more
Introduction
Welcome to the course!
Download the course notebooks
Data Preprocessing
1. Use ColumnTransformer to apply dif...
2. Seven ways to select columns using...
3. What is the difference between "fi...
4. Use "fit_transform" on training da...
38. Get the feature names output by a...
42. Passthrough some columns and drop...
Using pandas
5. Four reasons to use scikit-learn (...
35. Don't use .values when passing a ...
39. Load a toy dataset into a DataFrame
Categorical Features
6. Encode categorical features using ...
7. Handle unknown categories with One...
15. Three reasons not to use drop='fi...
41. Drop the first category from bina...
43. Use OrdinalEncoder instead of One...
Missing Values
9. Add a missing indicator to encode ...
11. Impute missing values using KNNIm...
14. HistGradientBoostingClassifier na...
27. Two ways to impute missing values...
Pipelines
8. Use Pipeline to chain together mul...
12. What is the difference between Pi...
13. Examine the intermediate steps in...
22. Use the correct methods for each ...
28. Save a model or Pipeline using jo...
30. Four ways to examine the steps of...
34. Add feature selection to a Pipeline
37. Create an interactive diagram of ...
48. Access part of a Pipeline using s...
50. Adapt this pattern to solve many ...
Parameter Tuning
16. Use cross_val_score and GridSearc...
17. Try RandomizedSearchCV if GridSea...
18. Display GridSearchCV or Randomize...
19. Important tuning parameters for L...
25. Prune a decision tree to avoid ov...
40. Estimators only print parameters ...
44. Speed up GridSearchCV using paral...
49. Tune multiple models simultaneous...
Model Evaluation
20. Plot a confusion matrix
21. Compare multiple ROC curves in a ...
26. Use stratified sampling with trai...
31. Shuffle your dataset when using c...
32. Use AUC to evaluate multiclass pr...
Model Inspection
23. Display the intercept and coeffic...
24. Visualize a decision tree two dif...
Model Ensembling
46. Ensemble multiple models using Vo...
47. Tune the parameters of a VotingCl...
Feature Engineering
29. Vectorize two text columns in a C...
33. Use FunctionTransformer to conver...
45. Create feature interactions using...
Coding Practices
10. Set a "random_state" to make your...
36. Most parameters should be passed ...
Conclusion
Request your certificate of completion
Take another course from Data School!
Earn money by promoting Data School's...
Products
Course
Section
Lesson
1. Use ColumnTransformer to apply different preprocessing to different columns
1. Use ColumnTransformer to apply different preprocessing to different columns
50 scikit-learn tips
Buy now
Learn more
Introduction
Welcome to the course!
Download the course notebooks
Data Preprocessing
1. Use ColumnTransformer to apply dif...
2. Seven ways to select columns using...
3. What is the difference between "fi...
4. Use "fit_transform" on training da...
38. Get the feature names output by a...
42. Passthrough some columns and drop...
Using pandas
5. Four reasons to use scikit-learn (...
35. Don't use .values when passing a ...
39. Load a toy dataset into a DataFrame
Categorical Features
6. Encode categorical features using ...
7. Handle unknown categories with One...
15. Three reasons not to use drop='fi...
41. Drop the first category from bina...
43. Use OrdinalEncoder instead of One...
Missing Values
9. Add a missing indicator to encode ...
11. Impute missing values using KNNIm...
14. HistGradientBoostingClassifier na...
27. Two ways to impute missing values...
Pipelines
8. Use Pipeline to chain together mul...
12. What is the difference between Pi...
13. Examine the intermediate steps in...
22. Use the correct methods for each ...
28. Save a model or Pipeline using jo...
30. Four ways to examine the steps of...
34. Add feature selection to a Pipeline
37. Create an interactive diagram of ...
48. Access part of a Pipeline using s...
50. Adapt this pattern to solve many ...
Parameter Tuning
16. Use cross_val_score and GridSearc...
17. Try RandomizedSearchCV if GridSea...
18. Display GridSearchCV or Randomize...
19. Important tuning parameters for L...
25. Prune a decision tree to avoid ov...
40. Estimators only print parameters ...
44. Speed up GridSearchCV using paral...
49. Tune multiple models simultaneous...
Model Evaluation
20. Plot a confusion matrix
21. Compare multiple ROC curves in a ...
26. Use stratified sampling with trai...
31. Shuffle your dataset when using c...
32. Use AUC to evaluate multiclass pr...
Model Inspection
23. Display the intercept and coeffic...
24. Visualize a decision tree two dif...
Model Ensembling
46. Ensemble multiple models using Vo...
47. Tune the parameters of a VotingCl...
Feature Engineering
29. Vectorize two text columns in a C...
33. Use FunctionTransformer to conver...
45. Create feature interactions using...
Coding Practices
10. Set a "random_state" to make your...
36. Most parameters should be passed ...
Conclusion
Request your certificate of completion
Take another course from Data School!
Earn money by promoting Data School's...
Lesson unavailable
Please
login to your account
or
buy the course
.