Unstratified and Stratified Miettinen and Nurminen Test

Overview

Binary outcome is a commonly used endpoint in clinical trials. This page illustrates how to conduct the unstratified or stratified analysis with the Miettinen and Nurminen (M&N) method (Miettinen and Nurminen 1985) for risk difference analysis in R. The following statistics can be calculated with the function rate_compare():

Estimated risk difference.
Test statistic.
Confidence interval for the risk difference.
p-value for risk difference.

Statistical methods

Unstratified analysis of M&N method

Assume the data includes two independent binomial samples with binary response variables to be analyzed/summarized and the data collected in a clinical design without stratification. Also this approach is applicable to the case when the data are collected using a stratified clinical design and the statistician would like to ignore stratification by pooling the data over strata assuming two independent binomial samples. Assume P_i is the proportion of success responses in the test (i = 1) or control (i = 0) group.

Confidence interval

The confidence interval is based on the M&N method and given by the roots for PD = P₁ − P₀ of the equation:

$$\chi_\alpha^2 = \frac{(\hat{p}_1-\hat{p}_0-PD)^2}{\tilde{V}}$$,

where p̂₁ and p̂₀ are the observed values of P₁ and P₀, respectively;

χ_α² = the upper cut point of size α from the central chi-square distribution with 1 degree of freedom (χ_α² = 3.84 for 95% confidence interval);
PD = the difference between two population proportions (PD = P₁ − P₀);

$$\tilde{V}=\bigg[\frac{\tilde{p}_1(1-\tilde{p}_1)}{n_1}+ \frac{\tilde{p}_0(1-\tilde{p}_0)}{n_0}\bigg]\frac{n_1+n_0}{n_1+n_0-1}$$;

n₁ and n₀ are the sample sizes for the test and control group, respectively;
p̃₁ = maximum likelihood estimate of proportion on test computed as p̃₀ + PD;
p̃₀ = maximum likelihood estimate of proportion on control under the constraint p̃₁ − p̃₀ = PD.

As stated above the 2-sided 100(1 − α)% CI is given by the roots for PD = P₁ − P₀. The bisection algorithm is used in the function to obtain the two roots (confidence interval) for PD.

p-value and Z-statistic

The Z-statistic is computed as:

$$Z_\text{diff}=\frac{\hat{p}_1-\hat{p}_0+S_0}{\sqrt{\tilde{V}}}$$ where p̂₁ and p̂₀ are the observed values for P₁ and P₀ respectively, S₀ is pre-specified proportion difference under the null;

p̃₁ = maximum likelihood estimate of proportion on test computed as p̃₀ + S₀;
p̃₀ = maximum likelihood estimate of proportion on control under the constraint p̃₁ − p̃₀ = S₀.
For non-inferiority or one-sided equivalence hypothesis with S₀ > 0, the p-value, Pr (Z ≥ Z_diff | H₀), is computed based on Z_diff using the standard normal distribution.
For non-inferiority or one-sided equivalence hypothesis with S₀ < 0, the p-value, Pr (Z ≤ Z_diff | H₀), is computed based on Z_diff using the standard normal distribution.
For two-sided superiority test, the p-value Pr (χ_diff² ≤ χ₁² | H₀), is computed based on χ_diff² using the chi-square distribution with 1 degree of freedom, where χ_diff² = Z_diff².

Stratified analysis of M&N method

Assume the data includes two treatment groups, test and control, and collected based on a stratified design. Within each stratum there are two independent binomial samples with binary response variables to be analyzed/summarized. The parameter of interest is the difference between the population proportions of the test and the control groups. The analysis and summaries need to be performed while adjusting for the stratifying variables.

Confidence interval

The confidence interval is based on the M&N method and given by the roots for PD = P₁ − P₀ of the equation:

$$\chi_\alpha^2 = \frac{(\hat{p}_1^*-\hat{p}_0^*-PD)^2}{\sum_{i=1}^I(W_i/\sum_{k=1}^{K}W_k)^2\tilde{V}_i}$$,

where $\hat{p}_s^* = \sum_{i=1}^I(W_i/\sum_{k=1}^KW_k)\hat{p}_{s i}$ for s = 0, 1;

$$\tilde{V}_i=\bigg[\frac{\tilde{p}_{1i}(1-\tilde{p}_{1i})}{n_{1i}}+\frac{\tilde{p}_{0i}(1-\tilde{p}_{0i})}{n_{0i}}\bigg]\frac{n_{1i}+n_{0i}}{n_{1i}+n_{0i}-1}$$;

W_i is the weight for the i-th strata;
$I = K = $ number of strata, i = k= strata;
n_1i and n_0i are the sample sizes in i-th strata for the test and control group, respectively;
p̂_1i and p̂_0i = observed proportion in i-th strata for the test and control, respectively;
p̃_0i and p̃_1i are MLE for P_0i and P_1i, respectively, computed under the constraint p̃_1i = p̃_0i + PD.

Similarly as for unstratified analysis,the 2-sided 100(1 − α)% CI is given by the roots for PD = P₁ − P₀, and the bisection algorithm is used in the function to obtain the two roots (confidence interval) for PD.

p-value and Z-statistic

The Z-statistic is computed as:

$$Z_\text{diff}=\frac{\hat{p}_1^*-\hat{p}^*_0+S_0}{\sqrt{\sum_{i=1}^I(W_i/\sum_{k=1}^{K}W_k)^2\tilde{V}_i}}$$ where S₀ is pre-specified proportion difference under the null;

p̃_0i and p̃_1i are MLE for P_0i and P_1i, respectively, computed under the constraint p̃_1i = p̃_0i + S₀.

The p-value can be calculated as stated above.

Example

Load package

library(metalite.ae)

Data simulation

We simulated a dataset with 2 treatment group for binary output. If stratum is used, we considered 4 stratum.

ana <- data.frame(
  treatment = c(rep(0, 100), rep(1, 100)),
  response  = c(rep(0, 80), rep(1, 20), rep(0, 40), rep(1, 60)),
  stratum   = c(rep(1:4, 12), 1, 3, 3, 1, rep(1:4, 12), rep(1:4, 25))
)

head(ana)
#>   treatment response stratum
#> 1         0        0       1
#> 2         0        0       2
#> 3         0        0       3
#> 4         0        0       4
#> 5         0        0       1
#> 6         0        0       2

Unstratified analysis

The function computes the risk difference, Z-statistic, p-value given the type of test, and two-sided 100(1 − α)% confidence interval of difference between two rates.

rate_compare(response ~ treatment, data = ana)
#>   est  z_score            p    lower     upper
#> 1 0.4 5.759051 4.229411e-09 0.269662 0.5165743

Stratified analysis

The sample size weighting is often used in the clinical trial. Below is the function to conduct stratified MN analysis with sample size weights.

We also support weight in "equal" and "cmh". More details can be found in the rate_compare() documentation.

rate_compare(
  formula = response ~ treatment, strata = stratum, data = ana,
  weight = "ss"
)
#>         est  z_score            p     lower     upper
#> 1 0.3998397 5.712797 5.556727e-09 0.2684383 0.5172779

References

Miettinen, Olli, and Markku Nurminen. 1985. “Comparative Analysis of Two Rates.” Statistics in Medicine 4 (2): 213–26.