Sample Size Planning for MLMPSYC 575Winnie Tse, Mark LaiUniversity of Southern CaliforniaUpdated: 2021-11-131 / 28

Week Learning Objectives

Describe the importance of having sufficient sample size for scientific research
Describe conceptually the steps for sample size planning: precision analysis and power analysis
Perform power analysis for MLM using the PowerUpR application and the simr package
Understand the effect of uncertainty in parameter values and explore alternative approaches for sample size planning

2 / 28

Why Sample Size?3 / 28

Small Sample Size is a Problem Because . . .

Low power

Misleading and noisy results¹

When coupled with publication bias (statistical significance filter)^{2 3}

Nonreproducible findings

[1] See Maxwell (2004)

[2] See the graph on this blog post

[3] See also Vasishth et al. (2018)

4 / 28

Review: Sampling distributionsTest yourself! -- Week 13 Quiz (ungraded)What is the null distribution?Suppose we examine the effect of a therapy on eating disorder
We test against the null hypothesis H0:γ01=0H0:γ01=0, where γ01γ01 is the fixed effect of the therapy on eating disorder
What is the alternative distribution?Assume that the true effect of this therapy is γ01=.1γ01=.1
5 / 28

Sampling Distribution as a Function of Sample Size

Assume true effect is $γ_{01} = 0.10$

Let's say

when $N = 20$ , $p < .05$ when $\hat{γ} \geq 0.82$
when $N = 200$ , $p < .05$ when $\hat{γ} \geq 0.26$

6 / 28

Add the 0 line, the 0.1 line, and the cutoff lines

Steps for Sample Size Planning7 / 28

Steps for Sample Size Planning

Write down your model equations
List out all parameters in the model
Determine if you want to achieve a desired level of

a. Power, or

b. Precision

8 / 28

Step 1: Write down model equationsGroup-based therapy for eating disorder (cluster-randomized trial)9 / 28

Step 1: Write down model equations

Group-based therapy for eating disorder (cluster-randomized trial)

Level-1 $Y_{i j} = β_{0 j} + β_{1 j} X_{cmc}_{i j} + e_{i j}$ $e_{i j} \sim N (0, σ)$ Level-2 $\begin{aligned} β_{0 j} & = γ_{00} + γ_{01} W_{j} + u_{0 j} \\ β_{1 j} & = γ_{10} + γ_{11} W_{j} + u_{1 j} \\ [\begin{matrix} u_{0 j} \\ u_{1 j} \end{matrix}] & \sim N ([\begin{matrix} 0 \\ 0 \end{matrix}], [\begin{matrix} τ_{0}^{2} \\ τ_{01} & τ_{1}^{2} \end{matrix}]) \end{aligned}$

10 / 28

Step 1: Write down model equations

Group-based therapy for eating disorder (cluster-randomized trial)

$γ_{10}$ : $X$ (purely level-1 with ICC = 0)
$γ_{01}$ : $W$ (level-2)
$γ_{11}$ : $W \times X$ (cross-level interaction)

10 / 28

Step 2: List out all parameters

Fixed effects: $γ_{00}$ , $γ_{01}$ , $γ_{10}$ , $γ_{11}$
Random effects: $τ_{0}^{2}$ , $τ_{1}^{2}$ , $τ_{01}$
Number of clusters: $J$
Cluster size: $n$

11 / 28

Standard Error and Precision Analysis12 / 28

Sample Size and SE/Post. SD

In the previous graph, when $N = 20$ , the sample estimate is likely to be anywhere between -0.4 and 0.6

$S E \propto \frac{1}{\sqrt{N}}$

13 / 28

Sample Size and SE/Post. SD

In the previous graph, when $N = 20$ , the sample estimate is likely to be anywhere between -0.4 and 0.6

$S E \propto \frac{1}{\sqrt{N}}$

One goal of sample size planning is to

Have sufficient sample size to get precise (low SE) sample estimates of an effect

13 / 28

Analytic Formulas of SE

$J$ = Number of clusters; $n$ = Cluster size

E.g., $J = 100$ schools; $n = 10$ students per school

Assuming $τ_{01} = 0$

$\begin{aligned} S E (γ_{01}) & = \sqrt{\frac{1}{S_{W}^{2}} (\frac{τ_{0}^{2}}{J} + \frac{σ^{2}}{J n})} \\ S E (γ_{10}) & = \sqrt{\frac{τ_{1}^{2}}{J} + \frac{σ^{2}}{J n S_{X}^{2}}} \\ S E (γ_{11}) & = \sqrt{\frac{1}{S_{W}^{2}} (\frac{τ_{1}^{2}}{J} + \frac{σ^{2}}{J n S_{X}^{2}})} \end{aligned}$

14 / 28

Precision Analysis

Group-based therapy for eating disorder (cluster-randomized trial)

Intervention at group level
10 participants per group
Outcome standardized (i.e., SD = $\sqrt{τ_{0}^{2} + σ^{2}} = 1$ )
- $γ$ = Cohen's $d$
ICC = .3 (i.e., $τ_{0}^{2} = .3$ )

15 / 28

Precision Analysis

Group-based therapy for eating disorder (cluster-randomized trial)

Intervention at group level
10 participants per group
Outcome standardized (i.e., SD = $\sqrt{τ_{0}^{2} + σ^{2}} = 1$ )
- $γ$ = Cohen's $d$
ICC = .3 (i.e., $τ_{0}^{2} = .3$ )
Goal: estimate $J$ such that $S E (γ_{10}) \leq .1$
- E.g., if we estimated the sample effect size to be $d = .25$ , the 95% CI would be approximately [.05, .45].

15 / 28

Calculating $J$

When the predictor is binary (e.g., treatment-control), if half of the groups is in one condition, $S_{W}^{2} = 0.25$

Otherwise, if 30% in one condition, $S_{W}^{2} = 0.3 \times 0.7$
$τ_{0}^{2} = 0.3$ , $σ^{2} = 0.7$ , $n = 10$

E.g., if $J = 30$ $S E (γ_{01}) = \sqrt{\frac{1}{S_{W}^{2}} (\frac{τ_{0}^{2}}{J} + \frac{σ^{2}}{J n})} = \sqrt{\frac{1}{0.25} (\frac{0.3}{30} + \frac{0.7}{(30) (10)})} = 0.2221111$

16 / 28

Calculating $J$

When the predictor is binary (e.g., treatment-control), if half of the groups is in one condition, $S_{W}^{2} = 0.25$

Otherwise, if 30% in one condition, $S_{W}^{2} = 0.3 \times 0.7$
$τ_{0}^{2} = 0.3$ , $σ^{2} = 0.7$ , $n = 10$

E.g., if $J = 30$ $S E (γ_{01}) = \sqrt{\frac{1}{S_{W}^{2}} (\frac{τ_{0}^{2}}{J} + \frac{σ^{2}}{J n})} = \sqrt{\frac{1}{0.25} (\frac{0.3}{30} + \frac{0.7}{(30) (10)})} = 0.2221111$

Keep trying, and you'll find ...

When $J$ = 148, $S E (γ_{01}) = 0.1$

So you'll need 148 groups (74 treatment, 74 control)

16 / 28

Power Analysis17 / 28

Two-tailed test, $α = .05$

$H_{0} : γ_{01} = 0$

Critical region: ${\hat{γ}}_{01} \leq - 0.45$ or ${\hat{γ}}_{01} \geq 0.45$

18 / 28

Two-tailed test, $α = .05$

$H_{0} : γ_{01} = 0$

Critical region: ${\hat{γ}}_{01} \leq - 0.45$ or ${\hat{γ}}_{01} \geq 0.45$

$H_{1} : γ_{01} = 0.3$

Power¹ $\approx P ({\hat{γ}}_{01} \leq - 0.45) + P ({\hat{γ}}_{01} \geq 0.45) = 0.2465731$

[1] In practice, we need to incorporate the sampling variability of the standard error as well, so this power calculation is only a rough approximation.

18 / 28

Two-tailed test, $α = .05$

$H_{0} : γ_{01} = 0$

Critical region: ${\hat{γ}}_{01} \leq - 0.2$ or ${\hat{γ}}_{01} \geq 0.2$

19 / 28

Two-tailed test, $α = .05$

$H_{0} : γ_{01} = 0$

Critical region: ${\hat{γ}}_{01} \leq - 0.2$ or ${\hat{γ}}_{01} \geq 0.2$

$H_{1} : γ_{01} = 0.3$

Power $\approx P ({\hat{γ}}_{01} \leq - 0.2) + P ({\hat{γ}}_{01} \geq 0.2) = 0.8461551$

19 / 28

Tools for Power Analysis

Stand-alone programs
- Optimal Design
- PinT
R packages
- simr
Spreadsheet/Webapp
- PowerUp!

See more discussion in Arend & Schäfer (2019)

20 / 28

PowerUpR Shiny App

https://powerupr.shinyapps.io/index/

21 / 28

Monte Carlo Simulation for Power Analysis

Simulate a large number (e.g., $R$ = 1,000) of data sets based on given effect size, ICC, etc
Fit an MLM to each simulated data
Power $\approx$ Proportion of times $p < α$

See sample R code for using `simr`

22 / 28

Uncertainty in Parameter Values23 / 28

Uncertainty in Parameter Values

In the PowerUpR demo, to calculate the number of clusters $J$ need to achieve 80% power, we determined

Type I error rate = .05
Two tailed test = TRUE
g2, r21, r22 = 0, as we did not include any covariates
p = .5, for a balanced design (half treatment, half control)

However, we need to guess the values of

Effect size = .3?
ICC = .3?

24 / 28

The Effect of Uncertainty in Power

Ignoring uncertainty

The more uncertainty we have but ignore about a parameter value, the more power loss we will have in our study (red curve)
Uncertainty in both effect size and ICC can further reduce our power
The more uncertainty we have, the more samples we need to achieve 80% power

25 / 28

Hybrid Classical-Bayesian approach

Incorporates uncertainty for sample size planning
Instead of plugging in a point value of a guess, we can specify how much uncertainty we have (e.g., standard error of $γ_{01}$ from a previous study)

$δ \sim N (.3, .1) ρ \sim Beta (a, b)$

where $a$ , $b$ can be calculated by $\hat{ρ} = .3$ and $σ_{ρ} = .1$ (estimate and uncertainty about $ρ$ )

26 / 28

hcbr Shiny App

http://winnie-wy-tse.shinyapps.io/hcb_shiny

27 / 28

Additional Notes on Power

Increasing $J$ usually leads to higher power than increasing $n$
Balanced designs generally have higher power than unbalanced designs
Larger sample size required for testing level-2 predictors
Testing an interaction requires a much larger sample size
- E.g., 16 times larger than for a main effect

28 / 28

Doubling $J$ is better than doubling $n$

Week Learning Objectives

Describe the importance of having sufficient sample size for scientific research

Describe conceptually the steps for sample size planning: precision analysis and power analysis

Perform power analysis for MLM using the PowerUpR application and the simr package

Understand the effect of uncertainty in parameter values and explore alternative approaches for sample size planning

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Sample Size Planning for MLM

PSYC 575

Winnie Tse, Mark Lai

University of Southern California

Updated: 2021-11-13

Week Learning Objectives

Why Sample Size?

Small Sample Size is a Problem Because . . .

Low power

Misleading and noisy results1

Nonreproducible findings

Review: Sampling distributions

Test yourself! -- Week 13 Quiz (ungraded)

What is the null distribution?

What is the alternative distribution?

Sampling Distribution as a Function of Sample Size

Steps for Sample Size Planning

Steps for Sample Size Planning

Step 1: Write down model equations

Group-based therapy for eating disorder (cluster-randomized trial)

Step 1: Write down model equations

Group-based therapy for eating disorder (cluster-randomized trial)

Step 1: Write down model equations

Group-based therapy for eating disorder (cluster-randomized trial)

Step 2: List out all parameters

Standard Error and Precision Analysis

Sample Size and SE/Post. SD

Sample Size and SE/Post. SD

Analytic Formulas of SE

Precision Analysis

Precision Analysis

Calculating JJ

Calculating JJ

Power Analysis

Tools for Power Analysis

PowerUpR Shiny App

Monte Carlo Simulation for Power Analysis

See sample R code for using simr

Uncertainty in Parameter Values

Uncertainty in Parameter Values

The Effect of Uncertainty in Power

Ignoring uncertainty

Hybrid Classical-Bayesian approach

hcbr Shiny App

Additional Notes on Power

Week Learning Objectives

Help

Misleading and noisy results¹

Calculating $J$

Calculating $J$

See sample R code for using `simr`