From the course: Complete Guide to AI and Data Science for SQL Developers: From Beginner to Advanced
Unlock the full course today
Join today to access over 23,200 courses taught by industry experts.
Checking assumptions for linear regression - SQL Tutorial
From the course: Complete Guide to AI and Data Science for SQL Developers: From Beginner to Advanced
Checking assumptions for linear regression
- [Instructor] What's up? And welcome back. After your model refinement in the last step, step 16, where you trimmed away less significant variables to create a more efficient model, the next logical step is to ensure your model is performing as expected. So why are we doing this now? Well, think of it as similar to testing a newly built car to make sure it runs smoothly. When you've honed your model down to a more streamlined version, you want to ensure it behaves reliably. To do that, you assess a set of fundamental assumptions that have become a crucial part of linear regression analysis, and here they are. Starting with assumption one, check for mean residuals. Imagine your predictions as darts thrown on a dartboard. You aim for the bullseye, and ideally, you want your darts to land evenly around it. If on average you consistently miss the mark, it suggests your model needs some adjustment. Assumption number two, check…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
-
-
-
-
-
-
(Locked)
Creating the linear regression model and model summary: Part 19m 33s
-
(Locked)
Creating the linear regression model and model summary: Part 27m 16s
-
(Locked)
Creating the linear regression model and model summary: Part 35m 33s
-
(Locked)
Dropping insignificant variables and re-creating the model7m 57s
-
(Locked)
Checking assumptions for linear regression3m 18s
-
(Locked)
Assumption 1: Checking for mean residuals2m 47s
-
(Locked)
Assumption 2: Checking homoscedasticity3m 13s
-
(Locked)
Assumption 3: Checking linearity2m 12s
-
(Locked)
Assumption 4: Checking normality of error terms3m 24s
-
(Locked)
Q-Q plot for checking the normality of error terms3m 14s
-
(Locked)
Model performance comparison on train and test data6m 7s
-
(Locked)
Applying cross-validation and evaluation4m 40s
-
(Locked)
Challenge: Model building48s
-
(Locked)
Solution: Model building1m 16s
-
(Locked)
-
-
-