4 The GLM Model

Let \(y_{1},\ldots,y_{n}\) denote \(n\) independent observations on a response. We treat \(y_{i}\) as a realisation of a random variable \(Y_{i}\). In the general linear model we assume that \(Y_{i}\) has a normal distribution with mean \(\mu_{i}\) and variance \(\sigma^{2}\) \[Y_{i}\sim N(\mu_{i},\sigma^{2}),\] and we further assume that the expected value \(\mu_{i}\) is a linear function of \(p\) predictors that take values \(\mathbf{x}'_{i}=(x_{i1},\ldots,x_{ip})\) for the \(i^{th}\) case, so that \[\mu_{i}=\mathbf{x}_{i}\mathbf{\beta}\] where \(\mathbf{\beta}\) is a vector of unknown parameters. We will generalise this in two steps, dealing with the stochastic and systematic components of the model.