LGCPs - Multiple Likelihoods • inlabru

Introduction

For this vignette we are going to be working with the inlabru’s ´gorillas_sf´ dataset which was originally obtained from the R package spatstat. The data set contains two types of gorillas nests which are marked as either major or minor. We will set up a multi-likelihood model for these nests which creates two spatial LGCPs that share a common intercept but have employ different spatial smoothers.

Setting things up

Load libraries

library(inlabru)
library(INLA)
library(ggplot2)

Get the data

For the next few practicals we are going to be working with a dataset obtained from the R package spatstat, which contains the locations of 647 gorilla nests. We load the dataset like this:

data(gorillas_sf, package = "inlabru")

Plot the nests and visualize the group membership (major/minor) by color:

ggplot() +
  gg(gorillas_sf$mesh) +
  gg(gorillas_sf$nests, aes(color = group)) +
  gg(gorillas_sf$boundary, alpha = 0) +
  ggtitle("Gorillas nests and group membership")

Fiting the model

First, we define all components that enter the joint model. That is, the intercept that is common to both LGCPs and the two different spatial smoothers, one for each nest group.

matern <- inla.spde2.pcmatern(gorillas_sf$mesh,
  prior.range = c(0.1, 0.01),
  prior.sigma = c(1, 0.01)
)

cmp <- ~
  Common(geometry, model = matern) +
    Difference(geometry, model = matern) +
    Intercept(1)

Given these components we define the linear predictor for each of the likelihoods. (Using “.” indicates a pure additive model, and one can use include/exclude options for bru_obs() to indicate which components are actively involved in each model.)

fml.major <- geometry ~ Intercept + Common + Difference / 2
fml.minor <- geometry ~ Intercept + Common - Difference / 2

Setting up the Cox process likelihoods is easy in this example. Both nest types were observed within the same window:

lik_minor <- bru_obs("cp",
  formula = fml.major,
  data = gorillas_sf$nests[gorillas_sf$nests$group == "major", ],
  samplers = gorillas_sf$boundary,
  domain = list(geometry = gorillas_sf$mesh)
)
lik_major <- bru_obs("cp",
  formula = fml.minor,
  data = gorillas_sf$nests[gorillas_sf$nests$group == "minor", ],
  samplers = gorillas_sf$boundary,
  domain = list(geometry = gorillas_sf$mesh)
)

… which we provide to the ´bru´ function.

jfit <- bru(cmp, lik_major, lik_minor,
  options = list(
    control.inla = list(
      int.strategy = "eb"
    ),
    bru_max_iter = 1
  )
)

library(patchwork)
pl.major <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit$summary.random$Common$mean)
  )
pl.minor <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit$summary.random$Difference$mean)
  )
(pl.major + scale_fill_continuous(trans = "log")) +
  (pl.minor + scale_fill_continuous(trans = "log")) &
  theme(legend.position = "right")

Rerunning

Rerunning with the previous estimate as starting point sometimes improves the accuracy of the posterior distribution estimation.

jfit0 <- jfit
jfit <- bru_rerun(jfit)

pl.major <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit$summary.random$Common$mean)
  )
pl.minor <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit$summary.random$Difference$mean)
  )
(pl.major + scale_fill_continuous(trans = "log")) +
  (pl.minor + scale_fill_continuous(trans = "log")) &
  theme(legend.position = "right")

summary(jfit0)
#> inlabru version: 2.13.0.9024 
#> INLA version: 25.12.02 
#> Latent components:
#> Common: main = spde(geometry)
#> Difference: main = spde(geometry)
#> Intercept: main = linear(1)
#> Observation models:
#>   Model tag: <No tag>
#>     Family: 'cp'
#>     Data class: 'sf', 'tbl_df', 'tbl', 'data.frame'
#>     Response class: 'numeric'
#>     Predictor: geometry ~ BRU_cp_predictor({ Intercept + Common - Difference/2 }, .data., .data_extra.)
#>     Additive/Linear/Rowwise: FALSE/FALSE/FALSE
#>     Used components: effect[Common, Difference, Intercept], latent[] 
#>   Model tag: <No tag>
#>     Family: 'cp'
#>     Data class: 'sf', 'tbl_df', 'tbl', 'data.frame'
#>     Response class: 'numeric'
#>     Predictor: geometry ~ BRU_cp_predictor({ Intercept + Common + Difference/2 }, .data., .data_extra.)
#>     Additive/Linear/Rowwise: FALSE/FALSE/FALSE
#>     Used components: effect[Common, Difference, Intercept], latent[] 
#> Time used:
#>     Pre = 0.865, Running = 26, Post = 0.169, Total = 27 
#> Fixed effects:
#>             mean    sd 0.025quant 0.5quant 0.975quant   mode kld
#> Intercept -0.341 1.364     -3.014   -0.341      2.332 -0.341   0
#> 
#> Random effects:
#>   Name     Model
#>     Common SPDE2 model
#>    Difference SPDE2 model
#> 
#> Model hyperparameters:
#>                       mean    sd 0.025quant 0.5quant 0.975quant  mode
#> Range for Common     2.937 0.573      2.002    2.872      4.247 2.732
#> Stdev for Common     2.046 0.332      1.487    2.014      2.791 1.942
#> Range for Difference 2.092 2.806      0.151    1.243      9.246 0.402
#> Stdev for Difference 0.159 0.091      0.037    0.141      0.379 0.100
#> 
#> Marginal log-Likelihood:  -1204.09 
#>  is computed 
#> Posterior summaries for the linear predictor and the fitted values are computed
#> (Posterior marginals needs also 'control.compute=list(return.marginals.predictor=TRUE)')

summary(jfit)
#> inlabru version: 2.13.0.9024 
#> INLA version: 25.12.02 
#> Latent components:
#> Common: main = spde(geometry)
#> Difference: main = spde(geometry)
#> Intercept: main = linear(1)
#> Observation models:
#>   Model tag: <No tag>
#>     Family: 'cp'
#>     Data class: 'sf', 'tbl_df', 'tbl', 'data.frame'
#>     Response class: 'numeric'
#>     Predictor: geometry ~ BRU_cp_predictor({ Intercept + Common - Difference/2 }, .data., .data_extra.)
#>     Additive/Linear/Rowwise: FALSE/FALSE/FALSE
#>     Used components: effect[Common, Difference, Intercept], latent[] 
#>   Model tag: <No tag>
#>     Family: 'cp'
#>     Data class: 'sf', 'tbl_df', 'tbl', 'data.frame'
#>     Response class: 'numeric'
#>     Predictor: geometry ~ BRU_cp_predictor({ Intercept + Common + Difference/2 }, .data., .data_extra.)
#>     Additive/Linear/Rowwise: FALSE/FALSE/FALSE
#>     Used components: effect[Common, Difference, Intercept], latent[] 
#> Time used:
#>     Pre = 0.709, Running = 8.98, Post = 0.131, Total = 9.82 
#> Fixed effects:
#>             mean    sd 0.025quant 0.5quant 0.975quant   mode kld
#> Intercept -0.343 1.367     -3.022   -0.343      2.336 -0.343   0
#> 
#> Random effects:
#>   Name     Model
#>     Common SPDE2 model
#>    Difference SPDE2 model
#> 
#> Model hyperparameters:
#>                       mean    sd 0.025quant 0.5quant 0.975quant  mode
#> Range for Common     2.935 0.573      1.999    2.872      4.244 2.733
#> Stdev for Common     2.047 0.332      1.486    2.015      2.791 1.945
#> Range for Difference 2.084 2.869      0.140    1.216      9.362 0.372
#> Stdev for Difference 0.159 0.093      0.036    0.139      0.389 0.097
#> 
#> Marginal log-Likelihood:  -1204.04 
#>  is computed 
#> Posterior summaries for the linear predictor and the fitted values are computed
#> (Posterior marginals needs also 'control.compute=list(return.marginals.predictor=TRUE)')

Single-likelihood version

In this particular model, we can also rewrite the problem as a single point process over a product domain over space and group. In versions <= 2.7.0, the integration domain had to be numeric, so we convert the group variable to a 0/1 variable, group_major <- group == "major", which is also useful in the predictor expression:

fml.joint <-
  geometry + group_major ~ Intercept + Common + (group_major - 0.5) * Difference

gorillas_sf$nests$group_major <- gorillas_sf$nests$group == "major"
lik_joint <- bru_obs("cp",
  formula = fml.joint,
  data = gorillas_sf$nests,
  samplers = gorillas_sf$boundary,
  domain = list(
    geometry = gorillas_sf$mesh,
    group_major = c(0, 1)
  )
)

# Approximate with "eb" for faster vignette
jfit_joint <- bru(cmp, lik_joint,
  options = list(
    control.inla = list(
      int.strategy = "eb"
    ),
    bru_max_iter = 1
  )
)

Plotting the ratios of exp(Common) and exp(Difference) between the new fit and the old confirms that the results are the same up to small numerical differences.

library(patchwork)
pl.major <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit_joint$summary.random$Common$mean -
      jfit$summary.random$Common$mean)
  )
pl.minor <- ggplot() +
  gg(gorillas_sf$mesh,
    mask = gorillas_sf$boundary,
    col = exp(jfit_joint$summary.random$Difference$mean -
      jfit$summary.random$Difference$mean)
  )
(pl.major + scale_fill_continuous(trans = "log")) +
  (pl.minor + scale_fill_continuous(trans = "log")) &
  theme(legend.position = "right")

LGCPs - Multiple Likelihoods