Evaluating Fit Linear Models

1 / 50

Etherpad

https://etherpad.wikimedia.org/p/607-lm-2022

2 / 50

Putting Linear Regression Into Practice with Pufferfish

Pufferfish are toxic/harmful to predators
Batesian mimics gain protection from predation - why?
Evolved response to appearance?
Researchers tested with mimics varying in toxic pufferfish resemblance

3 / 50

Question of the day: Does Resembling a Pufferfish Reduce Predator Visits?

4 / 50

Digging Deeper into Regression

Assumptions: Is our fit valid?
How did we fit this model?

5 / 50

You are now a Statistical Wizard. Be Careful. Your Model is a Golem.

(sensu Richard McElreath)

6 / 50

A Case of "Great" versus "Not as Great" Fits...

7 / 50

The Two Fits

8 / 50

Assumptions (in rough descending order of importance)

Validity
Representativeness
Model captures features in the data
Additivity and Linearity
Independence of Errors
Equal Variance of Errors
Normality of Errors
Minimal Outlier Influence

9 / 50

Validity: Do X and Y Reflect Concepts I'm interested In

What if predator approaches is not a good measure of recognition? Or mimics just don't look like fish?

10 / 50

Solution to lack of validity:Reframe your question! Change your framing! Question your life choices!11 / 50

Representativeness: Does Your Data Represent the Population?

For example, say this is your result...

12 / 50

But is that all there is to X in nature?13 / 50

Representativeness: Does Your Data Represent the Population?

What if you are looking at only a piece of the variation in X in your population?

14 / 50

Representativeness: Does Your Data Represent the Population?

How should you have sampled this population for a representative result?

15 / 50

Representativeness: Does Your Data Represent the Population?

It's better to have more variation in X than just a bigger N

16 / 50

Representativeness: Does Your Data Represent the Population?

Always question if you did a good job sampling
Use natural history and the literature to get the bounds of values
If experimenting, make sure your treatment levels are representative
If you realize post-hoc they are not, qualify your conclusions

17 / 50

Model captures features in the data

Does the model seem to fit the data? Are there any deviations? Can be hard to see...

18 / 50

Simulating implications from the model to see if we match features in the data

Is anything off?

19 / 50

But what to wolves say to you?

20 / 50

Additivity and Linearity: Should account for all of the variation between residual and fitted values - what you want

21 / 50

Additivity and Linearity: Wolf Problems?

22 / 50

Additivity and Linearity: Wolf Problems?

Solutions: Nonlinear transformations or a better model!

22 / 50

Independence of Errors

Are all replicates TRULY independent
Did they come from the same space, time, etc.
Non-independence can introduce BIAS
- SEs too small (at the least)
- Causal inference invalid
Incoporate Non-independence into models (many methods)

23 / 50

Equal Variance of Errors: No Pattern to Residuals and Fitted Values

24 / 50

Equal Variance of Errors: What is up with intermediate Wolf Values

25 / 50

Equal Variance of Errors: Problems and Solutions

Shapes (cones, footballs, etc.) with no bias in fitted v. residual relationship
A linear relationship indicates an additivity problem
Can solve with a better model (more predictors)
Can solve with weighting by X values, if source of heteroskedasticity known
- This actually means we model the variance as a function of X
- $\epsilon_i \sim(N, f(x_i))$
Minor problem for coefficient estimates
Major problem for doing inference and prediction as it changes error

26 / 50

Normality of errors: Did we fit the error generating process that we observed?

We assumed $\epsilon_i \sim N(0,\sigma)$ - but is that right?
Can assess with a QQ-plot
- Do quantiles of the residuals match quantiles of a normal distribution?
Again, minor problem for coefficient estimates
Major problem for doing inference and prediction, as it changes error

27 / 50

Equal Variance of Errors: Puffers

28 / 50

Equal Variance of Errors: Wolves underpredict at High Levels

29 / 50

Outliers: Cook's D

30 / 50

Leverage: Cook's D Scaled by Value

31 / 50

Leverage: Cook's D - wolves OK

32 / 50

Everyone worries about outliers, but...

Are they real?
Do they indicate a problem or a nonlinearity?
Remove only as a dead last resort
If from a nonlinearity, consider transformation

33 / 50

Assumptions (in rough descending order of importance)

Validity: only you know!
Representativeness: look at nature
Model captures features in the data: compare model v. data!
Additivity and Linearity: compare model v. data!
Independence of Errors: consider sampling design
Equal Variance of Errors: evaluate res-fit
Normality of Errors: evaluate qq and levene test
Minimal Outlier Influence: evaluate Cook's D

34 / 50

Digging Deeper into Regression

Assumptions: Is our fit valid?
How did we fit this model?
How do we draw inference from this model?

35 / 50

So, uh.... How would you fit a line here?

36 / 50

Lots of Possible Lines - How would you decide?

37 / 50

Method of Model Fitting

Least Squares
- Conceptually Simple
- Minimizes distance between fit and residuals
- Approximations of quantities based on frequentist logic
Likelihood
- Flexible to many models
- Produces likelihood surface of different parameters
- Equivalent to LS for Gaussian likelihood
- Approximations of quantities based on frequentist logic
Bayesian
- Incorporates prior knowledge
- Probability for any parameter is likelihood * prior
- Superior for quantifying uncertainty
- With "flat" priors, equivalent to least squares/likelihood
- Analytic or simulated calculation of quantities

38 / 50

Basic Principles of Least Squares Regression

$\widehat{Y} = \beta_0 + \beta_1 X + \epsilon$ where $\beta_0$ = intercept, $\beta_1$ = slope

Minimize Residuals defined as $SS_{residuals} = \sum(Y_{i} - \widehat{Y})^2$

39 / 50

Let's try it out!40 / 50

Analytic Solution: Solving for Slope

$\LARGE b=\frac{s_{xy}}{s_{x}^2}$ $= \frac{cov(x,y)}{var(x)}$

41 / 50

Analytic Solution: Solving for Slope

$\LARGE b=\frac{s_{xy}}{s_{x}^2}$ $= \frac{cov(x,y)}{var(x)}$

$\LARGE = r_{xy}\frac{s_{y}}{s_{x}}$

41 / 50

Analytic Solution: Solving for Intercept

Least squares regression line always goes through the mean of X and Y

$\Large \bar{Y} = \beta_0 + \beta_1 \bar{X}$

42 / 50

Analytic Solution: Solving for Intercept

Least squares regression line always goes through the mean of X and Y

$\Large \bar{Y} = \beta_0 + \beta_1 \bar{X}$

$\Large \beta_0 = \bar{Y} - \beta_1 \bar{X}$

42 / 50

Least Squares Visualized

43 / 50

LikelihoodFlexible to many models
Produces likelihood surface of different parameters 
Equivalent to LS for Gaussian likelihood
Approximations of quantities based on frequentist logic
44 / 50

Likelihood

Flexible to many models
Produces likelihood surface of different parameters
Equivalent to LS for Gaussian likelihood
Approximations of quantities based on frequentist logic

$L = \prod p(Data|parmeters)$

44 / 50

Likelihood

Flexible to many models
Produces likelihood surface of different parameters
Equivalent to LS for Gaussian likelihood
Approximations of quantities based on frequentist logic

$L = \prod p(Data|parmeters)$ $L(\theta | D) = \prod dnorm(y_i, \mu = \beta_0 + \beta_1 x_i, \sigma)$

44 / 50

Likelihood

Flexible to many models
Produces likelihood surface of different parameters
Equivalent to LS for Gaussian likelihood
Approximations of quantities based on frequentist logic

$L = \prod p(Data|parmeters)$ $L(\theta | D) = \prod dnorm(y_i, \mu = \beta_0 + \beta_1 x_i, \sigma)$ Deviance = -2 * Log Likelihood

44 / 50

Likelihood: Minimizing Deviance (Maximizing Likelihood) by Search

Preliminary iteration .. Done
Profiling for parameter (Intercept) ... Done
Profiling for parameter resemblance ... Done

45 / 50

Bayesian

Incorporates prior knowledge
Probability for any parameter is likelihood * prior
Superior for quantifying uncertainty
With "flat" priors, equivalent to least squares/likelihood
Analytic or simulated calculation of quantities

$p(H|D) = \frac{p(D|H)p(H)}{p(D)}$

46 / 50

Bayes: Creating a Posterior Probability Distribution

47 / 50

Bayes: Creating a Posterior Probability Distribution

Searches $p(H|D) = \frac{p(D|H)p(H)}{p(D)}$

47 / 50

Bayes: Creating a Posterior Probability Distribution

48 / 50

Linear Regression - the Core of EverythingMake sure you meet assumptions  Don't burn down Prague

Many ways to fit   We will talk inference later
The key is looking at estimated values and their implications
Look at precision - do you feel comfortable with inference?

49 / 50

<!--

-->

<!--

-->

50 / 50

Help

Keyboard shortcuts

↑, ←, Pg Up, k

Go to previous slide

↓, →, Pg Dn, Space, j

Go to next slide

Home

Go to first slide

End

Go to last slide

Number + Return

Go to specific slide

b / m / f

Toggle blackout / mirrored / fullscreen mode

Clone slideshow

Toggle presenter mode

Restart the presentation timer

?, h

Toggle this help