uCertify

An Introduction to Statistical Learning with Applications in Python

Transform your data science career by mastering statistical learning, the definitive skill set for the modern data professional.

(STATS-PYTHON.AU1) / ISBN : 979-8-90059-021-9

Lessons

Lab

AI Tutor (Add-on)

Get A Free Trial

This course includes:

Free pre-assessment and first 2 lessons

14+ Interactive Lessons

Accessible on mobile and tablet too

Certificate of completion

Are you an instructor?

Access detailed information about the course content, learning objectives, activities, and assessments before adding it to your curriculum.

About This Course

Are you ready to move beyond basic data manipulation and truly leverage machine learning in Python to revolutionize your decision-making process? The role of the data analyst is undergoing a fundamental shift, requiring specialized knowledge in how to strategically model complex systems. This ISLP course moves you past simple summary statistics and dives deep into the art and science of supervised learning and high-dimensional data analysis.

You will master the foundational mathematical frameworks, learn professional cross-validation techniques for model selection, and explore unsupervised learning to uncover hidden patterns in unlabeled data. Whether you are aiming for precise predictions using Linear Regression, building robust classifiers with support vector machines, or exploring the frontier of deep learning, this program provides the practical, hands-on knowledge to design and launch advanced models. From the bias-variance trade-off to modern resampling methods, you will learn to build systems that are both accurate and interpretable.

Skills You’ll Get

Foundations & Linear Models: Master the core of statistical learning, building from basic matrix algebra to multiple linear regression and logistic regression for powerful predictive modeling.
Resampling & Regularization: Tackle model accuracy through cross-validation and the bootstrap, while optimizing high-dimensional performance using ridge and lasso resampling methods.
Tree-Based & Support Vector Machines: Move beyond simple linearity with Decision Trees, Random Forests, and Support Vector Machines to handle complex, non-linear datasets with precision.
Deep & Unsupervised Learning: Explore the power of Neural Networks alongside Unsupervised Learning techniques like Clustering and PCA to find insights in data without predefined labels.

Interactive Lessons

14+ Interactive Lessons

Hands-On Labs

52+ LiveLab | 52+ Video tutorials | 01:41+ Hours

Download Course Outline

Preface

Introduction

An Overview of Statistical Learning
A Brief History of Statistical Learning
This Course
Who Should Read This Course?
Notation and Simple Matrix Algebra
Organization of This Course
Data Sets Used in Labs and Exercises

Statistical Learning

What is Statistical Learning?
Assessing Model Accuracy
Lab: Introduction to Python
Exercises

Linear Regression

Simple Linear Regression
Multiple Linear Regression
Other Considerations in the Regression Model
The Marketing Plan
Comparison of Linear Regression with K-Nearest Neighbors
Lab: Linear Regression
Exercises

Classification

An Overview of Classification
Why Not Linear Regression?
Logistic Regression
Generative Models for Classification
A Comparison of Classification Methods
Generalized Linear Models
Lab: Logistic Regression, LDA, QDA, and KNN
Exercises

Resampling Methods

Cross-Validation
The Bootstrap
Lab: Cross-Validation and the Bootstrap
Exercises

Linear Model Selection and Regularization

Subnet Selection
Shrinkage Methods
Dimension Reduction Methods
Considerations in High Dimensions
Lab: Linear Models and Regularization Methods
Exercises

Moving Beyond Linearity

Polynomial Regression
Step Functions
Basis Functions
Regression Splines
Smoothing Splines
Local Regression
Generalized Additive Models
Lab: Non-Linear Modeling
Exercises

Tree-Based Methods

The Basics of Decision Trees
Bagging, Random Forests, Boosting, and Bayesian Additive Regression Trees
Lab: Tree-Based Methods
Exercises

Support Vector Machines

Maximal Margin Classifier
Support Vector Classifiers
Support Vector Machines
SVMs with More than Two Classes
Relationship to Logistic Regression
Lab: Support Vector Machines
Exercises

Deep Learning

Single Layer Neural Networks
Multilayer Neural Networks
Convolutional Neural Networks
Document Classification
Recurrent Neural Networks
When to Use Deep Learning
Fitting a Neural Network
Interpolation and Double Descent
Lab: Deep Learning
Exercises

Survival Analysis and Censored Data

Survival and Censoring Times
A Closer Look at Censoring
The Kaplan-Meier Survival Curve
The Log-Rank Test
Regression Models With a Survival Response
Shrinkage for the Cox Model
Additional Topics
Lab: Survival Analysis
Exercises

Unsupervised Learning

The Challenge of Unsupervised Learning
Principal Components Analysis
Missing Values and Matrix Completion
Clustering Methods
Lab: Unsupervised Learning
Exercises

Multiple Testing

A Quick Review of Hypothesis Testing
The Challenge of Multiple Testing
The Family-Wise Error Rate
The False Discovery Rate
A Re-Sampling Approach to p-Values and False Discovery Rates
Lab: Multiple Testing
Exercises

Introduction

Analyzing the Wage Dataset
Analyzing Stock Market Trends Using the Smarket Dataset

Statistical Learning

Implementing the Bayes Classifier
Implementing the Bias-Variance Trade-Off
Indexing the Data

Linear Regression

Implementing Qualitative Predictors Using the Credit Dataset
Implementing Non-Linear Transformations of Predictors
Performing Multiple Linear Regression
Implementing Simple Linear Regression

Classification

Implementing Multiple Logistic Regression
Implementing Multinomial Logistic Regression
Generating and Visualizing a Multivariate Gaussian Distribution
Implementing GLM
Implementing Poisson Regression
Implementing KNN on the Caravan Dataset
Implementing Naive Bayes Classification
Implementing QDA
Implementing LDA

Resampling Methods

Implementing LOOCV
Implementing Bootstrapping Techniques on the Portfolio Dataset
Implementing K-Fold Cross-Validation
Implementing the Validation Set Approach

Linear Model Selection and Regularization

Implementing Forward and Backward Stepwise Selection
Improving Predictions with PCR
Implementing PLS
Implementing Lasso Regression
Implementing Ridge Regression
Implementing Subset Selection Methods Using the Hitters Dataset

Moving Beyond Linearity

Implementing Splines
Implementing a Step Function
Improving GAM
Implementing Polynomial Regression

Tree-Based Methods

Building and Analyzing a Classification Tree Using the Carseats Dataset
Improving Model Performance Using Boosting
Implementing Bagging and Random Forests
Fitting Regression Trees

Support Vector Machines

Implementing the Maximal Margin Classifier
Creating and Analyzing an ROC Curve
Implementing SVM with Multiple Classes
Implementing SVC

Deep Learning

Implementing RNN for Time Series Prediction
Creating an Image Classifier Using CNNs

Survival Analysis and Censored Data

Implementing the Kaplan-Meier Survival Curve
Applying the Log-Rank Test
Incorporating Shrinkage Techniques into the Cox Model

Unsupervised Learning

Implementing a Dendrogram
Analyzing the NCI60 Dataset
Implementing K-Means Clustering

Multiple Testing

Implementing Holm's Step-Down Procedure
Implementing the BH Procedure
Implementing FDR
Implementing FWER

Any questions?
Check out the FAQs

Still have unanswered questions and need to get in touch?

This program is ideal for data scientists, statisticians, and software developers who want to master statistical learning using Python. It is perfect for those transitioning from basic analytics to advanced predictive modeling.

Yes! Beyond classical models, the course features dedicated modules on deep learning (CNNs and RNNs) and Unsupervised Learning techniques like Clustering and matrix completion.

We go deep into the mechanics of Support Vector Machines, covering everything from Maximal Margin Classifiers to kernels and ROC curves, ensuring you can handle even the most complex classification boundaries.

It is a balanced approach. While we cover the mathematical notation, the core of the course is heavily focused on practice, featuring extensive labs on Linear Regression, Resampling Methods, and validation strategies like Cross-Validation.

Related Courses

All Courses

Lab

CCNA 200-301 Pearson uCertify Network Simulator

ISBN: 9781616918378

200-301-SIMULATOR.AB1

Lessons AI Tutor

Accounting Course 101

ISBN: 9781644597002

ACCOUNT-WRKBK.AE1

Lessons Lab

Accounting All-in-One

ISBN: 9781644594490

ACCOUNTS.AE1

Lessons TestPrep

ACCUPLACER For Beginners

ISBN: 9781644595732

ACCUPLACER.AE1

Lessons TestPrep

ACT Prep 2024

ISBN: 9781644594889

ACT-PREP.AE1

Lessons Lab TestPrep

Mastering Active Directory

ISBN: 9781644595909

ACTV-DIRECT.AJ1

Lessons Lab AI Tutor

Adversarial Machine Learning

ISBN: 9798900590165

ADV-ML.AU1

This course includes:

Free pre-assessment and first 2 lessons

14+ Interactive Lessons

Accessible on mobile and tablet too

Certificate of completion

Are you an instructor?

Access detailed information about the course content, learning objectives, activities, and assessments before adding it to your curriculum.

An Introduction to Statistical Learning with Applications in Python

Are you an instructor?