BUT - what if the effect of one variable depends on the level of another?
This is an INTERACTION and is quite common
BUT - what if the effect of one variable depends on the level of another?
This is an INTERACTION and is quite common
BUT - what if the effect of one variable depends on the level of another?
This is an INTERACTION and is quite common
Biology: The science of "It depends..."
This is challenging to think about and visualize, but if you can master it, you will go far!
^yi=β0+β1x1i+β2x2i yi∼N(^yi,σ)
^yi=β0+β1x1i+β2x2i yi∼N(^yi,σ)Could become...
^yi=β0+β1x1i+β2x21i
^yi=β0+β1x1i+β2x2i yi∼N(^yi,σ)Could become...
^yi=β0+β1x1i+β2x21iCould be...
^yi=β0+β1x1i+β2x2i+β3x1ix2i
^yi=β0+β1x1i+β2x2i yi∼N(^yi,σ)Could become...
^yi=β0+β1x1i+β2x21iCould be...
^yi=β0+β1x1i+β2x2i+β3x1ix2iIt is ALL additive terms
Replicating Categorical Variable Combinations: Factorial Models
Evaluating Interaction Effects
How to Look at Means and Differences with an Interaction Effect
Continuous Variables and Interaction Effects
Note: You can have as many treatment types or observed category combinations as you want (and then 3-way, 4-way, etc. interactions)
yijk=β0+∑βixi+∑βjxj+∑βijxij+ϵijk
ϵijk∼N(0,σ2) xi=0,1,xj=0,1,xij=0,1
yijk=β0+∑βixi+∑βjxj+∑βijxij+ϵijk
ϵijk∼N(0,σ2) xi=0,1,xj=0,1,xij=0,1
yijk=β0+∑βixi+∑βjxj+∑βijxij+ϵijk
ϵijk∼N(0,σ2) xi=0,1,xj=0,1,xij=0,1
Note the new last term
Deviation due to combination of categories i and j
yijk=β0+∑βixi+∑βjxj+∑βijxij+ϵijk
ϵijk∼N(0,σ2) xi=0,1,xj=0,1,xij=0,1
Note the new last term
Deviation due to combination of categories i and j
\Large \boldsymbol{Y} = \boldsymbol{\beta X} + \boldsymbol{\epsilon}
height | herbivores | sqrtarea |
---|---|---|
low | minus | 9.4055728 |
low | plus | 11.9767608 |
mid | minus | 0.7071068 |
mid | plus | 0.7071068 |
(Intercept) | heightmid | herbivoresplus | heightmid:herbivoresplus |
---|---|---|---|
1 | 0 | 0 | 0 |
1 | 0 | 1 | 0 |
1 | 1 | 0 | 0 |
1 | 1 | 1 | 1 |
graze_int <- lm(sqrtarea ~ height + herbivores + height:herbivores, data=algae)## ORgraze_int <- lm(sqrtarea ~ height*herbivores, data=algae)
Replicating Categorical Variable Combinations: Factorial Models
Evaluating Interaction Effects
How to Look at Means and Differences with an Interaction Effect
Continuous Variables and Interaction Effects
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
Intercept chosen as basal condition (low, herbivores -)
Changing height to mid is associated with a loss of 10 units of algae relative to low/-
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
Intercept chosen as basal condition (low, herbivores -)
Changing height to mid is associated with a loss of 10 units of algae relative to low/-
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
Intercept chosen as basal condition (low, herbivores -)
Changing height to mid is associated with a loss of 10 units of algae relative to low/-
Adding herbivores is associated with a loss of 22 units of algae relative to low/-
BUT - if you add herbivores and mid, that's also associated with an additional increase of 25 units of algae relative to mid and + alone
term | estimate | std.error |
---|---|---|
(Intercept) | 32.91450 | 3.855532 |
heightmid | -10.43090 | 5.452546 |
herbivoresplus | -22.51075 | 5.452546 |
heightmid:herbivoresplus | 25.57809 | 7.711064 |
NEVER TRY AND INTERPRET ADDITIVE EFFECTS ALONE WHEN AN INTERACTION IS PRESENT
that way lies madness
# R2 for Linear Regression R2: 0.228 adj. R2: 0.190
Eh, not great, not bad...
# R2 for Linear Regression R2: 0.228 adj. R2: 0.190
Eh, not great, not bad...
Replicating Categorical Variable Combinations: Factorial Models
Evaluating Interaction Effects
How to Look at Means and Differences with an Interaction Effect
Continuous Variables and Interaction Effects
herbivores emmean SE df lower.CL upper.CL minus 27.7 2.73 60 22.2 33.2 plus 18.0 2.73 60 12.5 23.4Results are averaged over the levels of: height Confidence level used: 0.95
height emmean SE df lower.CL upper.CL low 21.7 2.73 60 16.2 27.1 mid 24.0 2.73 60 18.6 29.5Results are averaged over the levels of: herbivores Confidence level used: 0.95
contrast estimate SE df lower.CL upper.CL minus - plus 9.72 3.86 60 2.01 17.4Results are averaged over the levels of: height Confidence level used: 0.95
height herbivores emmean SE df lower.CL upper.CL low minus 32.9 3.86 60 25.20 40.6 mid minus 22.5 3.86 60 14.77 30.2 low plus 10.4 3.86 60 2.69 18.1 mid plus 25.6 3.86 60 17.84 33.3Confidence level used: 0.95
contrast | estimate | SE | df | t.ratio | p.value |
---|---|---|---|---|---|
low minus - mid minus | 10.430905 | 5.452546 | 60 | 1.913034 | 0.0605194 |
low minus - low plus | 22.510748 | 5.452546 | 60 | 4.128484 | 0.0001146 |
low minus - mid plus | 7.363559 | 5.452546 | 60 | 1.350481 | 0.1819337 |
mid minus - low plus | 12.079843 | 5.452546 | 60 | 2.215450 | 0.0305355 |
mid minus - mid plus | -3.067346 | 5.452546 | 60 | -0.562553 | 0.5758352 |
low plus - mid plus | -15.147189 | 5.452546 | 60 | -2.778003 | 0.0072896 |
contrast | estimate | SE | df | t.ratio | p.value |
---|---|---|---|---|---|
low minus - mid minus | 10.430905 | 5.452546 | 60 | 1.913034 | 0.0605194 |
low minus - low plus | 22.510748 | 5.452546 | 60 | 4.128484 | 0.0001146 |
low minus - mid plus | 7.363559 | 5.452546 | 60 | 1.350481 | 0.1819337 |
mid minus - low plus | 12.079843 | 5.452546 | 60 | 2.215450 | 0.0305355 |
mid minus - mid plus | -3.067346 | 5.452546 | 60 | -0.562553 | 0.5758352 |
low plus - mid plus | -15.147189 | 5.452546 | 60 | -2.778003 | 0.0072896 |
height = low: herbivores emmean SE df lower.CL upper.CL minus 32.9 3.86 60 25.20 40.6 plus 10.4 3.86 60 2.69 18.1height = mid: herbivores emmean SE df lower.CL upper.CL minus 22.5 3.86 60 14.77 30.2 plus 25.6 3.86 60 17.84 33.3Confidence level used: 0.95
It Depends is a rule in biology
Context dependent interactions everywhere
It Depends is a rule in biology
Context dependent interactions everywhere
Using categorical predictors in a factorial design is an elegant way to see interactions without worrying about shapes of relationships
It Depends is a rule in biology
Context dependent interactions everywhere
Using categorical predictors in a factorial design is an elegant way to see interactions without worrying about shapes of relationships
BUT - it all comes down to a general linear model! And the same inferential frameworks we have been dealing with since linear regression!
Replicating Categorical Variable Combinations: Factorial Models
Evaluating Interaction Effects
How to Look at Means and Differences with an Interaction Effect.
Continuous Variables and Interaction Effects
y_i = \beta_0 + \beta_{1}x_{1i} + \beta_{2}x_{2i}+ \beta_{3}x_{1i}x_{2i} + \epsilon_{i}
\epsilon_{i} \sim \mathcal{N}(0,\sigma^2)
Or in code:
keeley_lm_int <- lm(firesev ~ age * elev, data=keeley)
# Check for MulticollinearityLow Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI age 3.20 [2.37, 4.52] 1.79 0.31 [0.22, 0.42]Moderate Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI elev 5.52 [3.95, 7.92] 2.35 0.18 [0.13, 0.25] age:elev 8.29 [5.83, 11.98] 2.88 0.12 [0.08, 0.17]
# Check for MulticollinearityLow Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI age 3.20 [2.37, 4.52] 1.79 0.31 [0.22, 0.42]Moderate Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI elev 5.52 [3.95, 7.92] 2.35 0.18 [0.13, 0.25] age:elev 8.29 [5.83, 11.98] 2.88 0.12 [0.08, 0.17]
Collinearities between additive predictors and interaction effects are not problematic.
However, you should make sure your ADDITIVE predictors do not have VIF problems in a model with no interactions.
# Check for MulticollinearityLow Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI age 3.20 [2.37, 4.52] 1.79 0.31 [0.22, 0.42]Moderate Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI elev 5.52 [3.95, 7.92] 2.35 0.18 [0.13, 0.25] age:elev 8.29 [5.83, 11.98] 2.88 0.12 [0.08, 0.17]
Collinearities between additive predictors and interaction effects are not problematic.
However, you should make sure your ADDITIVE predictors do not have VIF problems in a model with no interactions.
\huge X_i - \bar{X}
\huge X_i - \bar{X}
\huge X_i - \bar{X}
Additive coefficients are the effect of a predictor at the mean value of the other predictors
Intercepts are at the mean value of all predictors
\huge X_i - \bar{X}
Additive coefficients are the effect of a predictor at the mean value of the other predictors
Intercepts are at the mean value of all predictors
Visualization will keep you from getting confused!
y = \beta_0 + \beta_{1}(x_{1}-\bar{x_{1}}) + \beta_{2}(x_{2}-\bar{x_{2}})+ \beta_{3}(x_{1}-\bar{x_{1}})(x_{2}-\bar{x_{2}})
Variance Inflation Factors for Centered Model:
# Check for MulticollinearityLow Correlation Term VIF VIF 95% CI Increased SE Tolerance Tolerance 95% CI age_c 1.02 [1.00, 2376.13] 1.01 0.98 [0.00, 1.00] elev_c 1.04 [1.00, 7.04] 1.02 0.96 [0.14, 1.00] age_c:elev_c 1.04 [1.00, 9.78] 1.02 0.96 [0.10, 1.00]
term | estimate | std.error | statistic | p.value |
---|---|---|---|---|
(Intercept) | 1.8132153 | 0.6156070 | 2.945411 | 0.0041484 |
age | 0.1206292 | 0.0208618 | 5.782298 | 0.0000001 |
elev | 0.0030852 | 0.0013329 | 2.314588 | 0.0230186 |
age:elev | -0.0001472 | 0.0000431 | -3.416029 | 0.0009722 |
R2 = 0.3235187.
Note that additive coefficients signify the effect of one predictor in the abscence of all others.
term | estimate | std.error | statistic | p.value |
---|---|---|---|---|
(Intercept) | 4.6091266 | 0.1463029 | 31.503991 | 0.0000000 |
age_c | 0.0581123 | 0.0117591 | 4.941901 | 0.0000038 |
elev_c | -0.0006786 | 0.0005792 | -1.171587 | 0.2445985 |
age_c:elev_c | -0.0001472 | 0.0000431 | -3.416029 | 0.0009722 |
R2 = 0.3235187
Note that additive coefficients signify the effect of one predictor at the average level of all others.
What the heck does a continuous interaction effect mean?
We can look at the effect of one variable at different levels of the other
What the heck does a continuous interaction effect mean?
We can look at the effect of one variable at different levels of the other
We can look at a surface
What the heck does a continuous interaction effect mean?
We can look at the effect of one variable at different levels of the other
We can look at a surface
We can construct counterfactual plots showing how changing both variables influences our outcome
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |