This extension to the dolphin data analysis is for those who may be a bit ahead and would like to :
I recommend you continue with your previous R script for the GLM_2 exercise, in your RStudio Project.
13. To address the limitations of the previous analysis, fit a new
model with categorical predictors only, and interactions between them,
two by two:
fTide4 + fMonth + fTime6 + fTide4:fMonth + fTide4:fTime6 + fMonth:fTime6
.
What hypotheses do these interactions correspond to?
14. Perform model selection “by hand” using the AIC, and construct an
AIC table, using the example in the lecture. “By hand” means without
using drop1
or step
functions. You can get the
AIC value from the model summary summary(YourModel)
or by
typing AIC(YourModel)
. There are 18 possible models in
total, including the full model above. You can choose to evaluate all 18
models in a completely exploratory (the “ignorant and brave”) approach,
or only a selection of models based on more specific research questions
or predictions of your own (the “clear thinker” approach).
15. Do the validation of your best model, using the approach taken in question 7.
16. Interpret the model, using plots of the predictions (use the approach taken in question 9) .
End of the optional extension to Binomial (Bernoulli) GLM exercise
For info, the publication here offers a different approach to analysing these data, using slightly fancier GLMs with smooth terms (called GAMs, for Generalized Additive Models), and a few additional refinements: [https://www.nature.com/articles/s41598-019-38900-4]. What assumptions differ between this and your approach?