STA 310 - Spring 2024 - Multilevel models

Announcements

HW 03 due TODAY at 11:59pm
DataFest: March 1 - 3 in Penn Pavilion
- See Slack for more information
Please fill out Campus Culture Survey by March 1
- You should have received an email with a personalized link

Topics

Interpret and inference for multilevel model coefficients
Calculate and interpret intraclass correlation coefficient
Maximum likelihood (ML) and restricted maximum likelihood (REML) estimation approaches
General process for fitting and comparing multilevel models

Data: Music performance anxiety

Today’s data come from the study by Sadler and Miller (2010) of the emotional state of musicians before performances. The data set contains information collected from 37 undergraduate music majors who completed the Positive Affect Negative Affect Schedule (PANAS), an instrument produces a measure of anxiety (negative affect) and a measure of happiness (positive affect). This analysis will focus on negative affect as a measure of performance anxiety.

The primary variables we’ll use are

id: unique musician identification number
na: negative affect score on PANAS (the response variable)
perform_type: type of performance (Solo, Large Ensemble, Small Ensemble)
instrument: type of instrument (Voice, Orchestral, Piano)

Fit model in R

library(lme4)
music_model <- lmer(na ~ orchestra + large_ensemble +
       orchestra:large_ensemble + (large_ensemble|id),
       REML = TRUE, data = music)

na ~ orchestra + large_ensemble + orchestra:large_ensemble: Represents the fixed effects
(large_ensemble|id): Represents the error terms and associated variance components
- Specifies two error terms: $u_{i}$ corresponding to the intercepts, $v_{i}$ corresponding to effect of large ensemble
- Use (1|id) for models with random intercepts and all other effects fixed.

Model results

effect	group	term	estimate	std.error	statistic
fixed	NA	(Intercept)	15.930	0.641	24.833
fixed	NA	orchestra1	1.693	0.945	1.791
fixed	NA	large_ensemble1	-0.911	0.845	-1.077
fixed	NA	orchestra1:large_ensemble1	-1.424	1.099	-1.295
ran_pars	id	sd__(Intercept)	2.378	NA	NA
ran_pars	id	cor__(Intercept).large_ensemble1	-0.635	NA	NA
ran_pars	id	sd__large_ensemble1	0.672	NA	NA
ran_pars	Residual	sd__Observation	4.670	NA	NA

Interpretation

Select the best interpretation for orchestra1:large_ensemble1.

For students who play an orchestral instrument, the mean performance anxiety is expected to be 1.424 points lower for large ensemble performances compared to solo and small ensembles.
The mean decrease in performance anxiety from large ensemble performances versus solos or small ensembles is expected to be 1.424 points greater for students who play orchestral instruments than the expected decrease for soloists and pianists.
The mean performance anxiety for students who play orchestral instruments in large ensembles is expected to be -1.424 points.

Interpretation

Select the best interpretation for sd__(Intercept).

The estimated standard deviation of performance anxiety score for students playing in solos and small ensembles is 2.378 points.
The estimated standard deviation of performance anxiety score for vocalists and pianists is 2.378 points.
The estimated standard deviation of performance anxiety score for students playing in solos and small ensemble is 2.378, after adjusting for instrument.

Inference for fixed effects

Notice the R model output has test statistic but no p-values for each coefficient
- Exact $t$ distribution under the null hypothesis (no fixed effects) and the associated degrees of freedom are not known
We can generally conclude coefficients with test statistic with absolute value greater than 2 are statistically significant
Some software will produce p-values by making several assumptions, large sample results , or approximate p-values
We will introduce a parametric bootstrap approach in the next chapter.

Unconditional means model

The unconditional means model (also known as random intercepts model) is the multilevel model with no predictors at either level

These models are used to estimate between and within group variability

Level One:

$Y_{i j} = a_{i} + ϵ_{i j} ϵ_{i j} \sim N (0, σ^{2})$

Level Two:

$a_{i} = α_{0} + u_{i} u_{i} \sim (N, σ_{u}^{2})$

Fitting the unconditional means model

uncond_means_model <- lmer(na ~ 1 + (1 | id), 
                           REML = TRUE, data = music)

tidy(uncond_means_model) |> kable(digits = 3)

effect	group	term	estimate	std.error	statistic
fixed	NA	(Intercept)	16.237	0.428	37.943
ran_pars	id	sd__(Intercept)	2.225	NA	NA
ran_pars	Residual	sd__Observation	4.739	NA	NA

Intraclass correlation coefficient

The intraclass correlation coefficient $ρ$ is

$ρ = \frac{Between group variability}{Total variability} = \frac{σ_{u}^{2}}{σ_{u}^{2} + σ^{2}}$

In this analysis, $\hat{ρ} = 0.182$ . This value means…

About 18.2% of the variability in performance anxiety can be explained by musician to musician differences
The correlation of performance anxiety scores within a musician is 0.182

Note

$\hat{ρ}$ is calculated based on the variance components from the unconditional means model.

Interpreting $ρ$

Which of the following values indicates the individual observations are essentially independent?

$ρ \approx 1$
$ρ \approx 0$

When $ρ \approx 0$ , the effective sample size (how many pieces of independent information we have) approaches $n$ , the sample size of the data
- Accounting for multilevel structure of the data is less important when modeling
When $ρ \approx 1$ , the effective sample size is close to the number of groups
- Accounting for multilevel structure of the data is very important when modeling

Multilevel models

Announcements

Topics

Data: Music performance anxiety

Fit model in R

Model results

Interpretation

Interpretation

Inference for fixed effects

Unconditional means model

Fitting the unconditional means model

Intraclass correlation coefficient

Interpreting $ρ$

Estimation

ML and REML

ML and REML

Illustration of ML vs. REML

ML or REML?

Comparing ML and REML results

Strategy for building multilevel models

Saddler and Miller (2010) strategy

Application exercise

References