12.3 A simple additive model

Now that we have tools available to estimate smooth curves and surfaces, linear regression models can be extended to additive models as yi=β0+s1(x1i)++sp(xpi)+εi,i=1,,n. The mi are functions whose shapes are unrestricted, apart from an assumption of smoothness and the constraint, for identifiability, that isj(xji)=0 for each j. As a consequence, we usually estimate β0 by y¯. This gives a very flexible set of modelling tools.

To see one approach to how these models can be fitted, consider the case of only two covariates, yi=β0+s1(x1i)+s2(x2i)+εi,i=1,,n. A rearrangement of this as yiβ0s2(x2i)=s1(x1i)+εi suggests that an estimate of component m1 can then be obtained by smoothing the residuals of the data after fitting s^2, m^1=S1(yy¯s^2) and that, similarly, subsequent estimates of s2 can be obtained as m^2=S2(yy¯s^1). These smoothing operations are repeated until convergence. This is called the backfitting algorithm.

However, the regression and p-spline approaches described above allow models to be fitted directly. Each term s1,s2, can be represented by its own set of basis functions to create the usual linear model form y=Bβ, where β now denotes a collections of several sets of coefficients, one for each term, and the columns of B similarly cover the basis functions for all terms in the model. Combined with a set of penalty matrices and associated penalty parameters, the direct solution β^=(BTB+P)1BTy still applies, where P now denotes the sum of the individual penalty matrices weighted by their associated penalty parameters.

Here is an example of an additive model fitted to the trawl data. This uses the gam function in the mgcv package in R, which is a very powerful set of tools comprehensively described by Wood (2017). There are many options for how the model is fitted but here we will simply accept the defaults.

library(sm)
library(mgcv)
library(gratia)
ind         <- (trawl$Year==0 & trawl$Zone==1)
trawl.model <- gam(Score1 ~ s(Latitude) + s(Longitude), data = trawl, subset = ind)
draw(trawl.model, residuals = TRUE)

The non-linear effect of Longitude is apparent, as we saw in earlier examples. Here the model selection methods included in mgcv have led to a proposed linear relationship with Latitude.