Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
If you read my post about Fast Bayesian Inference with INLA you might wonder which models are included within the class of latent Gaussian models (LGM), and can therefore be fitted with INLA. Next I will give a general definition about LGM and later I will describe three completely different examples that belong to this class. The first example will be a mixed effects model, the second will be useful in a time series context while the third will incorporate spatial dependence. All these examples, among others, can be found on the examples and tutorials page in the INLA website.
Latent Gaussian Models
Generally speaking, the class of LGM can be represented by a hierarchical structure containing three stages. The first stage is formed by the conditionally independent likelihood function
,
where
The second stage is formed by the latent Gaussian field, where we attribute a Gaussian distribution with mean
.
Finally, the third stage is formed by the prior distribution assigned to the hyperparameters,
.
That is the general definition and if you can fit your model into this hierarchical structure you can say it is a LGM. Sometimes it is not easy to identify that a particular model can be written in the above hierarchical form. In some cases, you can even rewrite your model so that it can then fit into this class. Some examples can better illustrate the importance and generality of LGM.
Mixed-effects model
The description of the EPIL example can be found on the OpenBUGS example manual. It is an example of repeated measures of Poisson counts. The mixed model is given by
- First stage: We have
individuals, each with 4 successive seizure counts. We assume the counts follow a conditionally independent Poisson likelihood function.
- Second stage: We account for linear effects on some covariates
for each individual, as well as random effects on the individual level, , and on the observation level, .So, in this case
. A Gaussian prior was assigned for each element of the latent field, so that is Gaussian distributed. - Third stage:
, where
Here you can find the data and INLA code to fit this model.
Smoothing time series of binomial data
The number of occurrences of rainfall over 1 mm in the Tokyo area for each calendar year during two years (1983-84) are registered. It is of interest to estimate the underlying probability
- First stage: A conditionally independent binomial likelihood function
with logit link function
- Second stage: We assume that
, where follows a circular random walk 2 (RW2) model with precision . The RW2 model is defined byThe fact that we use a circular model here means that in this case,
is a neighbor of , since it makes sense to assume that the last day of the year has a similar effect when compared with the first day. So, in this case and again is Gaussian distributed. - Third stage:
, where
Here you can find the data and INLA code to fit this model, and below is the posterior mean of
Disease mapping in Germany
Larynx cancer mortality counts are observed in the 544 district of Germany from 1986 to 1990. Together with the counts, for each district, the level of smoking consumption c is registered. The model is given by
- First stage: We assume the data to be conditionally independent Poisson random variables with mean
, where is fixed and accounts for demographic variation, and is the log-relative risk. - Second stage: The linear predictor
is formed by an intercept , an smooth function of the smoking consumption covariate , an spatial model over different district locations , and an unstructured random effect to account for overdispersion. So thatand the detailed description of each of the above model components can be found here. But again (surprise surprise)
is Gaussian distributed. - Third stage:
is formed by the precision parameters associated with the smooth model for the covariate , the spatial model and the random effect , all of which we assign gamma priors.
Here you can find the data and INLA code to fit this model, and below is the posterior mean of the spatial effect in each of the 544 districts in Germany.
Further reading:
– Examples and tutorials page in the INLA website.
– Slides of a Bayesian computation with INLA short-course I gave in the AS2013 conference in Ribno, Slovenia.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.