Max Kuhn, Kjell Johnson
This is a book on data analysis with a specific focus on the practice of predictive modeling. The term predictive modeling may stir associations such as machine learning, pattern recognition, and data mining. Indeed, these associations are appropriate and the methods implied by these terms are an integral piece of the predictive modeling process. But predictive modeling encompasses much more than the tools and techniques for uncovering patterns within data. The practice of predictive modeling defines the process of developing a model in a way that we can understand and quantify the model’s prediction accuracy on future, yet-to-be-seen data. The entire process is the focus of this book.
- Introduction
- A Short Tour of the Predictive Modeling Process
- Data Pre-processing
- Over-Fitting and Model Tuning
- Measuring Performance in Regression Models
- Linear Regression and Its Cousins
- Nonlinear Regression Models
- Regression Trees and Rule-Based Models
- A Summary of Solubility Models
- Case Study: Compressive Strength of Concrete Mixtures
- Measuring Performance in Classification Models
- Discriminant Analysis and Other Linear Classification Models
- Nonlinear Classification Models
- Classification Trees and Rule-Based Models
- A Summary of Grant Application Models
- Remedies for Severe Class Imbalance
- Case Study: Job Scheduling
- Measuring Predictor Importance
- An Introduction to Feature Selection
- Factors That Can Affect Model Performance
20181201 First view
20190525 Second review