Skip to main content

2020 | OriginalPaper | Buchkapitel

Applications of Sequential Methods in Multiple Hypothesis Testing

verfasst von : Anthony Almudevar

Erschienen in: Statistical Modeling for Biological Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the main computational burdens in genome-wide statistical applications is the evaluation of large scale multiple hypothesis tests. Such tests are often implemented using replication-based methods, such as the permutation test or bootstrap procedure. While such methods are widely applicable, they place a practical limit on the computational complexity of the underlying test procedure. In such cases it would seem natural to apply sequential procedures. For example, suppose we observe the first ten replications of an upper-tailed statistic under a null distribution generated by random permutations, and of those ten, five exceed the observed value. It would seem reasonable to conclude that the P-value will not be small enough to be of interest, and further replications should not be needed.
While such methods have been proposed in the literature, for example by Hall in 1983, by Besag and Clifford in 1991 and by Lock in 1991, they have not been widely applied in multiple testing applications generated by high dimensional data sets, where they would likely be of some benefit. In this article related methods will first be reviewed. It will then be shown how commonly used multiple testing procedures may be modified so as to introduce sequential procedures while preserving the validity of reported error rates. A number of examples will show how such procedures can reduce computation time by an order of magnitude with little loss in power.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Almudevar, A. (2000). Exact confidence regions for species assignment based on DNA markers. The Canadian Journal of Statistics, 28, 81–95.MathSciNetCrossRef Almudevar, A. (2000). Exact confidence regions for species assignment based on DNA markers. The Canadian Journal of Statistics, 28, 81–95.MathSciNetCrossRef
2.
Zurück zum Zitat Almudevar, A. (2010). A hypothesis test for equality of Bayesian network models. EURASIP Journal on Bioinformatics and Systems Biology, 2010, 10.CrossRef Almudevar, A. (2010). A hypothesis test for equality of Bayesian network models. EURASIP Journal on Bioinformatics and Systems Biology, 2010, 10.CrossRef
3.
Zurück zum Zitat Benjamini, Y., & Yekutieli, D. (2001). The control of the false discovery rate in multiple testing under dependency. The Annals of Statistics, 29, 1165–1188.MathSciNetCrossRef Benjamini, Y., & Yekutieli, D. (2001). The control of the false discovery rate in multiple testing under dependency. The Annals of Statistics, 29, 1165–1188.MathSciNetCrossRef
4.
6.
Zurück zum Zitat Dudoit, S., Shaffer, J. P., & Boldrick, J. C. (2003). Multiple hypothesis testing in microarray experiments. Statistical Science, 18, 71–103.MathSciNetCrossRef Dudoit, S., Shaffer, J. P., & Boldrick, J. C. (2003). Multiple hypothesis testing in microarray experiments. Statistical Science, 18, 71–103.MathSciNetCrossRef
7.
Zurück zum Zitat Dudoit, S., & van der Laan, M. J. (2008). Multiple testing procedures with applications to genomics. New York: Springer.CrossRef Dudoit, S., & van der Laan, M. J. (2008). Multiple testing procedures with applications to genomics. New York: Springer.CrossRef
8.
Zurück zum Zitat Fay, M. P., & Follmann, D. A. (2002). Designing Monte Carlo implementations of permutation or bootstrap hypothesis tests. The American Statistician, 56, 63–70.MathSciNetCrossRef Fay, M. P., & Follmann, D. A. (2002). Designing Monte Carlo implementations of permutation or bootstrap hypothesis tests. The American Statistician, 56, 63–70.MathSciNetCrossRef
9.
Zurück zum Zitat Hall, W. J. (1983). Some sequential tests for matched pairs: A sequential permutation test. In P. K. Sen (ed.), Contributions to statistics: essays in honour of Norman L. Johnson, (pp. 211–228). Amsterdam: North-Holland. Hall, W. J. (1983). Some sequential tests for matched pairs: A sequential permutation test. In P. K. Sen (ed.), Contributions to statistics: essays in honour of Norman L. Johnson, (pp. 211–228). Amsterdam: North-Holland.
10.
Zurück zum Zitat Ljung, L. (2007). Strong convergence of a stochastic approximation algorithm. The American Statistician, 6, 680–696.MathSciNetMATH Ljung, L. (2007). Strong convergence of a stochastic approximation algorithm. The American Statistician, 6, 680–696.MathSciNetMATH
11.
Zurück zum Zitat Lock, R. H. (1991). A sequential approximation to a permutation test. Communications in Statistics: Simulation and Computation, 20, 341–363.MathSciNetCrossRef Lock, R. H. (1991). A sequential approximation to a permutation test. Communications in Statistics: Simulation and Computation, 20, 341–363.MathSciNetCrossRef
12.
Zurück zum Zitat Medland, S., Schmitt, J., Webb, B., Kuo, P.-H., & Neale, M. (2009). Efficient calculation of empirical P-values for genome-wide linkage analysis through weighted permutation. Behavior Genetics, 39, 91–100.CrossRef Medland, S., Schmitt, J., Webb, B., Kuo, P.-H., & Neale, M. (2009). Efficient calculation of empirical P-values for genome-wide linkage analysis through weighted permutation. Behavior Genetics, 39, 91–100.CrossRef
13.
Zurück zum Zitat Mootha, V. K., Lindgren, C. M., Eriksson, K. F., Subramanian, A., Sihag, S., Lehar, J., et al. (2003). PGC-1 α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature Genetics, 100, 605–610. Mootha, V. K., Lindgren, C. M., Eriksson, K. F., Subramanian, A., Sihag, S., Lehar, J., et al. (2003). PGC-1 α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature Genetics, 100, 605–610.
14.
Zurück zum Zitat Robbins, H., & Monro, S. (1951). A stochastic approximation method. The Annals of Mathematical Statistics, 22, 400–407.MathSciNetCrossRef Robbins, H., & Monro, S. (1951). A stochastic approximation method. The Annals of Mathematical Statistics, 22, 400–407.MathSciNetCrossRef
15.
Zurück zum Zitat Siegmund, D. (1985). Sequential analysis: tests and confidence intervals. New York: Springer-Verlag.CrossRef Siegmund, D. (1985). Sequential analysis: tests and confidence intervals. New York: Springer-Verlag.CrossRef
16.
Zurück zum Zitat Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America, 102, 15545–15550. Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America, 102, 15545–15550.
17.
Zurück zum Zitat Wald, A. (1947). Sequential analysis. New York: John Wiley and Sons.MATH Wald, A. (1947). Sequential analysis. New York: John Wiley and Sons.MATH
18.
Zurück zum Zitat Wald, A. (1948). Optimum character of the sequential probability ratio test. The Annals of Mathematical Statistics, 19, 326–339.MathSciNetCrossRef Wald, A. (1948). Optimum character of the sequential probability ratio test. The Annals of Mathematical Statistics, 19, 326–339.MathSciNetCrossRef
19.
Zurück zum Zitat Yang, H. Y., & Speed, T. (2003). Design and analysis of comparative microarray experiments. In T. Speed (ed.) Statistical analysis of gene expression microarray data (pp. 35–92). Boca Raton, FL: Chapman and Hall. Yang, H. Y., & Speed, T. (2003). Design and analysis of comparative microarray experiments. In T. Speed (ed.) Statistical analysis of gene expression microarray data (pp. 35–92). Boca Raton, FL: Chapman and Hall.
Metadaten
Titel
Applications of Sequential Methods in Multiple Hypothesis Testing
verfasst von
Anthony Almudevar
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-34675-1_6