Skip to main content

Power Calculations and Critical Values for Two-Stage Nonparametric Testing Regimes

  • Chapter
  • First Online:
Robust and Multivariate Statistical Methods
  • 627 Accesses

Abstract

Interim analysis techniques for clinical trials provide improved power with smaller average sample sizes. These techniques crucially require multivariate probability calculations for determining critical values. Most existing techniques rely on multivariate normal approximations to the joint null distribution of test statistics evaluated on potential interim and full data sets. More accurate critical values for nonparametric testing with an interim analysis are given, using a new multivariate Cornish–Fisher expansion. While earlier authors demonstrated that such an expansion is possible, it has never been implemented before this manuscript. Generally, the superior accuracy of power calculations via an Edgeworth series is demonstrated. Example calculations giving sample sizes from desired power are provided. Calculations are implemented in an R package.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 149.00
Price excludes VAT (USA)
Softcover Book
USD 199.99
Price excludes VAT (USA)
Hardcover Book
USD 199.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

Download references

Acknowledgements

The authors thank Yuzo Maruyama for an English summary of Takeuchi (1978) and Todd Kuffner for pointers to references Takemura and Takeuchi (1988) and Takemura and Takeuchi (1988). John Kolassa thanks NSF DMS 1712839 for partial support for this project. The authors thank editors and referees for helpful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to John Kolassa .

Editor information

Editors and Affiliations

Appendices

Appendix 1: A Bivariate Recursion for Exact Probabilities

The univariate recursion is constructed by counting the number of data set orderings leading to the statistic value and decomposing them into orderings based on one fewer value from the first group and one fewer value from the second group. The bivariate recursion is similar to the above univariate recursion and is similar to that of Wilding et al. (2011).

Let b(u1, u2, m1, n1, m2, n2) represent the number of orderings of (4) for which U1 = u1 and U2 = u2. This number is zero if any of the sample sizes m1, n1, m2, n2 is negative, if either statistic value is negative, or if either statistic value is larger than its maximum value. It is also zero if both additional sample sizes for stage 2 are zero but the second statistic value exceeds the first. If all sample sizes are zero, then the sums in (1) and (3) are empty, and both statistic values are zero; hence b(0, 0, 0, 0, 0, 0) = 1. These end conditions are given by

$$\displaystyle \begin{aligned} b(u_1,u_2,m_1,n_1,m_2,n_2):=\begin{cases} 0,& \text{if any of }m_1, m_2, n_1, n_2, u_1, u_2\text{ is negative or}\\ &\text{ if }u_1>m_1 n_1\text{ or }u_2>(m_1+m_2)(n_1+n_2)\\ 0,& \text{if }n_1=0\text{ or }m_1=0,\text{ and }u_1>0\\ 1,& \text{if }m_1=0\text{ and } n_1= 0\text{ and } u_1= 0\\ &\text{ and } m_2= 0\text{ and } n_2=0\text{ and }u_2=0. \end{cases}\end{aligned} $$
(22)

Otherwise, the number of rearrangements of the data (4) giving rise to statistic values u1 and u2 are the sum of four contributions. First, add those with sample sizes m1 − 1, n1, m2, n2, with an additional value from the first group in the first sample that exceeds all values in the sample, and hence leaves the statistic value unchanged. Second, add those with sample sizes m1, n1 − 1, m2, n2, with an additional value from the second group in the first sample that exceeds all values in the sample, and hence increases the first statistic by m1 and increases the second statistic by m1 + m2. Third, add those with sample sizes m1, n1, m2 − 1, n2, with an additional value from the first group in the second sample added that exceeds all values in the sample, and hence leaves the statistic value unchanged. Fourth, add those with sample sizes m1, n1, m2, n2 − 1, with an additional value from the second group in the second sample added that exceeds all values in the sample, and hence leaves U1 unchanged, and increases the second statistic by m1 + m2. This leads to the following recursion:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} b(u_1,u_2,m_1,n_1,m_2,n_2)& :=&\displaystyle m_1 b(u_1,u_2,m_1-1,n_1,m_2,n_2)\\ & +&\displaystyle n_1 b(u_1-m_1,u_2-m_1-m_2,m_1,n_1-1,m_2,n_2)\\ & +&\displaystyle m_2 b(u_1,u_2,m_1,n_1,m_2-1,n_2)\\ & +&\displaystyle n_2 b(u_1,u_2-m_1-m_2,m_1,n_1,m_2,n_2-1). \end{array} \end{aligned} $$
(23)

Numerical examples in Figs. 1 and 2 exhibit comparisons of probability approximations to bivariate probabilities and quantiles for U = (U1, U2), to the exact values.

Appendix 2: A Continuous Example with Nonzero Skewness

Our aim in determining the expansion for c2 is to apply the techniques to Wilcoxon testing, but the same quantile approximation may be used more generally. Before application to the Wilcoxon statistic, which is somewhat atypical, because the third cumulants are zero, leading to a less dramatic effect, and because the Wilcoxon statistic is discrete, and hence lacks the continuity that the technique was developed for. Instead, we present a more general example consisting of a continuous distribution with nonzero third order cumulants.

Consider Y1, Y2, Y3 independent exponentials. Let U1 = Y1 + Y3, and U2 = Y2 + Y3. Figure 3 compares Edgeworth (E) or Cornish–Fisher (CF), normal (N), and Monte Carlo (MC, taken with 500,000 samples and treated as the truth). Panels are:

  1. (a)

    compares difference of E and N upper tail univariate probability approximation from the MC approximation, as a function of the MC approximation.

    Fig. 3
    4-line graphs a through d. Graph a plots error versus alpha and has increasing line for normal and a declining line for Edgeworth. Graphs b and c plot second ordinate versus final ordinate and has concave down decreasing parallel lines. Graph d plots approximate ordinate versus true ordinate and has increasing lines for multivariate, normal, and now.

    Exponential example, sample size 1. (a) Absolute error for approximations. (b) Difference in absolute error. (c) True bivariate probability. (d) Second ordinate approximation range

  2. (b)

    compares error of E and N upper tail bivariate probability approximation, as a function of the ordinate.

  3. (c)

    Gives contours of MC upper tail probabilities.

  4. (d)

    Represents CF and N approximation to upper tail, vs. MC value. CF and N values exhibit some dependence on target for first univariate tail, and so are represented as a range.

Note from panel b that the Edgeworth approximation fails to dominate the normal approximation only for a narrow band between the contours marked 0 in the middle of the plot.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Kolassa, J., Chen, X., Seifu, Y., Zhong, D. (2023). Power Calculations and Critical Values for Two-Stage Nonparametric Testing Regimes. In: Yi, M., Nordhausen, K. (eds) Robust and Multivariate Statistical Methods. Springer, Cham. https://doi.org/10.1007/978-3-031-22687-8_19

Download citation

Publish with us

Policies and ethics