12.3 A simple additive model
Now that we have tools available to estimate smooth curves and surfaces, linear regression models can be extended to additive models as The are functions whose shapes are unrestricted, apart from an assumption of smoothness and the constraint, for identifiability, that for each . As a consequence, we usually estimate by . This gives a very flexible set of modelling tools.
To see one approach to how these models can be fitted, consider the case of only two covariates, A rearrangement of this as suggests that an estimate of component can then be obtained by smoothing the residuals of the data after fitting , and that, similarly, subsequent estimates of can be obtained as These smoothing operations are repeated until convergence. This is called the backfitting algorithm.
However, the regression and p-spline approaches described above allow models to be fitted directly. Each term can be represented by its own set of basis functions to create the usual linear model form , where now denotes a collections of several sets of coefficients, one for each term, and the columns of similarly cover the basis functions for all terms in the model. Combined with a set of penalty matrices and associated penalty parameters, the direct solution still applies, where now denotes the sum of the individual penalty matrices weighted by their associated penalty parameters.
Here is an example of an additive model fitted to the trawl data. This uses the gam
function in the mgcv
package in R
, which is a very powerful set of tools comprehensively described by Wood (2017). There are many options for how the model is fitted but here we will simply accept the defaults.
library(sm)
library(mgcv)
library(gratia)
ind <- (trawl$Year==0 & trawl$Zone==1)
trawl.model <- gam(Score1 ~ s(Latitude) + s(Longitude), data = trawl, subset = ind)
draw(trawl.model, residuals = TRUE)
The non-linear effect of Longitude is apparent, as we saw in earlier examples. Here the model selection methods included in mgcv
have led to a proposed linear relationship with Latitude.