Why do we not always use the closed testing principle for multiple comparisons?Multiple hypothesis testing and F tests?What does independence between comparisons in multiple comparisons mean?Why aren't multiple hypothesis corrections applied to all experiments since the dawn of time?FWER, FDR and multiple comparisons for BeginnersPlease convince me why I should bother correcting p values with multiple comparisons, because I don't see the pointAfter discovering alpha error inflation, how to estimate severity of the effect?

Can 35 mm film which went through a washing machine still be developed?

Generating Roman numerals with dice

Is the "spacetime" the same thing as the mathematical 4th dimension?

How to plausibly write a character with a hidden skill

Non-electric Laser

Found a minor bug, affecting 1% of users. What should QA do?

Does Hogwarts have its own anthem?

Why is the time of useful consciousness only seconds at high altitudes?

What's the correct way to determine turn order in this situation?

Does the DOJ's declining to investigate the Trump-Zelensky call ruin the basis for impeachment?

Quote to show students don't have to fear making mistakes

Did the Soviet army intentionally send troops (e.g. penal battalions) running over minefields?

How much lead is really on my hands when soldering?

Would houseruling two or more instances of resistance to the same element as immunity be overly unbalanced?

"cd" into /sys/kernel/debug/tracing causes permission change

Notation clarity question for a conglomerate of accidentals

Why do personal finance apps focus on outgoings rather than income

How Concerning Is This X-CU-modified: FAKECU Text Attack?

What does a textbook look like while you are writing it?

Was there an autocomplete utility in MS-DOS?

electricity bill doubled

What makes a character irredeemable?

Booting Ubuntu from USB drive on MSI motherboard -- EVERYTHING fails

Realistically, how much do you need to start investing?

Why do we not always use the closed testing principle for multiple comparisons?

Multiple hypothesis testing and F tests?What does independence between comparisons in multiple comparisons mean?Why aren't multiple hypothesis corrections applied to all experiments since the dawn of time?FWER, FDR and multiple comparisons for BeginnersPlease convince me why I should bother correcting p values with multiple comparisons, because I don't see the pointAfter discovering alpha error inflation, how to estimate severity of the effect?

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty
margin-bottom:0;

From the Wikipedia entry:

Suppose there are k hypotheses $H_1,..., H_k$ to be tested and the overall
type I error rate is $alpha$. The closed testing principle allows the
rejection of any one of these elementary hypotheses, say $H_i$, if all
possible intersection hypotheses involving $H_i$ can be rejected by using
valid local level $alpha$ tests. It controls the familywise error rate for
all the k hypotheses at level α in the strong sense.

By using the closed testing principle, we ensure all the false positive rate of all hypothesis tests in a family will be controlled at $alpha$. This seems too good to be true. Why do we bother with Bonferonni or other corrections for multiple comparisons, when we can just test all the intersection hypotheses? The only downside seems to be that you need to conduct $2^n$ tests.

EDIT: I am thinking about a relatively small number of individual hypothesis tests, say $n = 3 textor 5$. Then you would have a reasonable number of tests.

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

2

$begingroup$
Your "only" downside is a huge one! It limits practical procedures to $nll 30$ or so, and in cases where the tests are onerous to compute (think bootstrapping, for instance) to $nll 10.$
$endgroup$
– whuber♦
8 hours ago

$begingroup$
@whuber Not really true, is it? The closed testing principle is used all the time.
$endgroup$
– Björn
8 hours ago

$begingroup$
@whuber, I had in mind a relatively small number of hypothesis tests. For example, I'm curious why didn't I learn this for ANOVA with two treatments and an interaction.
$endgroup$
– EliK
8 hours ago

$begingroup$
I wonder if this sucks power from the test. Sure, you control $alpha$, but at what cost? If even the Bonferroni correction is more powerful, then I think that this would be an inadmissable testing procedure. (Confession: I don't know this to be the case, and it might be totaly wrong.)
$endgroup$
– Dave
8 hours ago

$begingroup$
@Dave I don't think Bonferroni is more powerful, but I'm not sure.
$endgroup$
– EliK
7 hours ago

|
show 1 more comment

From the Wikipedia entry:

Suppose there are k hypotheses $H_1,..., H_k$ to be tested and the overall
type I error rate is $alpha$. The closed testing principle allows the
rejection of any one of these elementary hypotheses, say $H_i$, if all
possible intersection hypotheses involving $H_i$ can be rejected by using
valid local level $alpha$ tests. It controls the familywise error rate for
all the k hypotheses at level α in the strong sense.

EDIT: I am thinking about a relatively small number of individual hypothesis tests, say $n = 3 textor 5$. Then you would have a reasonable number of tests.

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

2

$begingroup$
Your "only" downside is a huge one! It limits practical procedures to $nll 30$ or so, and in cases where the tests are onerous to compute (think bootstrapping, for instance) to $nll 10.$
$endgroup$
– whuber♦
8 hours ago

$begingroup$
@whuber Not really true, is it? The closed testing principle is used all the time.
$endgroup$
– Björn
8 hours ago

$begingroup$
@whuber, I had in mind a relatively small number of hypothesis tests. For example, I'm curious why didn't I learn this for ANOVA with two treatments and an interaction.
$endgroup$
– EliK
8 hours ago

$begingroup$
I wonder if this sucks power from the test. Sure, you control $alpha$, but at what cost? If even the Bonferroni correction is more powerful, then I think that this would be an inadmissable testing procedure. (Confession: I don't know this to be the case, and it might be totaly wrong.)
$endgroup$
– Dave
8 hours ago

$begingroup$
@Dave I don't think Bonferroni is more powerful, but I'm not sure.
$endgroup$
– EliK
7 hours ago

|
show 1 more comment

From the Wikipedia entry:

Suppose there are k hypotheses $H_1,..., H_k$ to be tested and the overall
type I error rate is $alpha$. The closed testing principle allows the
rejection of any one of these elementary hypotheses, say $H_i$, if all
possible intersection hypotheses involving $H_i$ can be rejected by using
valid local level $alpha$ tests. It controls the familywise error rate for
all the k hypotheses at level α in the strong sense.

EDIT: I am thinking about a relatively small number of individual hypothesis tests, say $n = 3 textor 5$. Then you would have a reasonable number of tests.

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

From the Wikipedia entry:

Suppose there are k hypotheses $H_1,..., H_k$ to be tested and the overall
type I error rate is $alpha$. The closed testing principle allows the
rejection of any one of these elementary hypotheses, say $H_i$, if all
possible intersection hypotheses involving $H_i$ can be rejected by using
valid local level $alpha$ tests. It controls the familywise error rate for
all the k hypotheses at level α in the strong sense.

EDIT: I am thinking about a relatively small number of individual hypothesis tests, say $n = 3 textor 5$. Then you would have a reasonable number of tests.

hypothesis-testing statistical-significance p-value multiple-comparisons

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

edited 8 hours ago

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

asked 8 hours ago

EliK

4023 silver badges15 bronze badges

2

$begingroup$
Your "only" downside is a huge one! It limits practical procedures to $nll 30$ or so, and in cases where the tests are onerous to compute (think bootstrapping, for instance) to $nll 10.$
$endgroup$
– whuber♦
8 hours ago

$begingroup$
@whuber Not really true, is it? The closed testing principle is used all the time.
$endgroup$
– Björn
8 hours ago

$begingroup$
@whuber, I had in mind a relatively small number of hypothesis tests. For example, I'm curious why didn't I learn this for ANOVA with two treatments and an interaction.
$endgroup$
– EliK
8 hours ago

$begingroup$
I wonder if this sucks power from the test. Sure, you control $alpha$, but at what cost? If even the Bonferroni correction is more powerful, then I think that this would be an inadmissable testing procedure. (Confession: I don't know this to be the case, and it might be totaly wrong.)
$endgroup$
– Dave
8 hours ago

$begingroup$
@Dave I don't think Bonferroni is more powerful, but I'm not sure.
$endgroup$
– EliK
7 hours ago

|
show 1 more comment

2

$begingroup$
Your "only" downside is a huge one! It limits practical procedures to $nll 30$ or so, and in cases where the tests are onerous to compute (think bootstrapping, for instance) to $nll 10.$
$endgroup$
– whuber♦
8 hours ago

$begingroup$
@whuber Not really true, is it? The closed testing principle is used all the time.
$endgroup$
– Björn
8 hours ago

$begingroup$
@whuber, I had in mind a relatively small number of hypothesis tests. For example, I'm curious why didn't I learn this for ANOVA with two treatments and an interaction.
$endgroup$
– EliK
8 hours ago

$begingroup$
I wonder if this sucks power from the test. Sure, you control $alpha$, but at what cost? If even the Bonferroni correction is more powerful, then I think that this would be an inadmissable testing procedure. (Confession: I don't know this to be the case, and it might be totaly wrong.)
$endgroup$
– Dave
8 hours ago

$begingroup$
@Dave I don't think Bonferroni is more powerful, but I'm not sure.
$endgroup$
– EliK
7 hours ago

Your "only" downside is a huge one! It limits practical procedures to $nll 30$ or so, and in cases where the tests are onerous to compute (think bootstrapping, for instance) to $nll 10.$

– whuber♦
8 hours ago

@whuber Not really true, is it? The closed testing principle is used all the time.

– Björn
8 hours ago

@whuber, I had in mind a relatively small number of hypothesis tests. For example, I'm curious why didn't I learn this for ANOVA with two treatments and an interaction.

– EliK
8 hours ago

I wonder if this sucks power from the test. Sure, you control $alpha$, but at what cost? If even the Bonferroni correction is more powerful, then I think that this would be an inadmissable testing procedure. (Confession: I don't know this to be the case, and it might be totaly wrong.)

– Dave
8 hours ago

@Dave I don't think Bonferroni is more powerful, but I'm not sure.

– EliK
7 hours ago

|
show 1 more comment

1 Answer
1

active

oldest

votes

It is actually used quite often (in fact, Bonferroni and Bonferroni-Holm can be shown to be valid tests using the closed testing principle - see below). Part of the reason why some simple procedures like Bonferroni are still so popular is of course, that they are very easy to implement and it is easy to communicate what you did.

E.g. even for the simple Bonferroni or Bonferroni-Holm writing down the tests for the intersection null hypotheses is a bit tedious. That's why the graphical approach to constructing closed testing procedures is very popular - at least in the clinical trials setting (where there is usually a trained statistician at hand that can help pre-specify a tailored testing procedure for a specific trial). This graphical approach provides a really easy way to construct valid testing procedures and makes communication easier. Various variants, extensions and improvements such as exploiting correlations, prioritization of certain hypotheses depending on which hypotheses are already rejected etc. have been proposed over the years.

Bonferroni vs. Bonferroni-Holm as closed testing procedures example

The Bonferroni test can be easily written down in terms of the closed testing principle, e.g. in the case of two null hypotheses as

Test for $H_1 cap H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for elementary null hypothesis $H_1$: $p_1 leq alpha/2$

Test for elementary null hypothesis $H_2$: $p_2 leq alpha/2$

with $H_1$ rejected overall, if both $H_1$ and $H_1 cap H_2$ are rejected. Thus, we know it is a valid test. You also immediately see that for the elementary null hypotheses $H_1$ and $H_2$ you are using a test that is not exhausting the significance level and that the Bonferroni-Holm test

Test for $H_1$ and $H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for $H_1$: $p_1 leq alpha$

Test for $H_2$: $p_2 leq alpha$

is uniformly more powerful.

Figure 1: Illustration of the graphical approach for displaying these two procedures
Bonferroni and Bonf.-Holm as graphical closed testing procedures

Power considerations and other considerations

Usually, there's no downside in terms of power to using a closed testing procedure - if anything it tends to be easier to make sure the test fully exhausts the significance level. It can sometimes be a bit tricky to build in some desired features (e.g. exploiting a known correlation) while keep some other desired contraints (e.g. that the closed testing procedure can be easily displayed graphically due to the consistency requirement on the testing procedure).

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

$begingroup$
Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.
$endgroup$
– EliK
7 hours ago

$begingroup$
I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.
$endgroup$
– EliK
7 hours ago

add a comment
|

Your Answer

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f428996%2fwhy-do-we-not-always-use-the-closed-testing-principle-for-multiple-comparisons%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Bonferroni vs. Bonferroni-Holm as closed testing procedures example

The Bonferroni test can be easily written down in terms of the closed testing principle, e.g. in the case of two null hypotheses as

Test for $H_1 cap H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for elementary null hypothesis $H_1$: $p_1 leq alpha/2$

Test for elementary null hypothesis $H_2$: $p_2 leq alpha/2$

Test for $H_1$ and $H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for $H_1$: $p_1 leq alpha$

Test for $H_2$: $p_2 leq alpha$

is uniformly more powerful.

Figure 1: Illustration of the graphical approach for displaying these two procedures
Bonferroni and Bonf.-Holm as graphical closed testing procedures

Power considerations and other considerations

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

$begingroup$
Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.
$endgroup$
– EliK
7 hours ago

$begingroup$
I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.
$endgroup$
– EliK
7 hours ago

add a comment
|

Bonferroni vs. Bonferroni-Holm as closed testing procedures example

The Bonferroni test can be easily written down in terms of the closed testing principle, e.g. in the case of two null hypotheses as

Test for $H_1 cap H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for elementary null hypothesis $H_1$: $p_1 leq alpha/2$

Test for elementary null hypothesis $H_2$: $p_2 leq alpha/2$

Test for $H_1$ and $H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for $H_1$: $p_1 leq alpha$

Test for $H_2$: $p_2 leq alpha$

is uniformly more powerful.

Figure 1: Illustration of the graphical approach for displaying these two procedures
Bonferroni and Bonf.-Holm as graphical closed testing procedures

Power considerations and other considerations

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

$begingroup$
Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.
$endgroup$
– EliK
7 hours ago

$begingroup$
I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.
$endgroup$
– EliK
7 hours ago

add a comment
|

Bonferroni vs. Bonferroni-Holm as closed testing procedures example

The Bonferroni test can be easily written down in terms of the closed testing principle, e.g. in the case of two null hypotheses as

Test for $H_1 cap H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for elementary null hypothesis $H_1$: $p_1 leq alpha/2$

Test for elementary null hypothesis $H_2$: $p_2 leq alpha/2$

Test for $H_1$ and $H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for $H_1$: $p_1 leq alpha$

Test for $H_2$: $p_2 leq alpha$

is uniformly more powerful.

Figure 1: Illustration of the graphical approach for displaying these two procedures
Bonferroni and Bonf.-Holm as graphical closed testing procedures

Power considerations and other considerations

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

Bonferroni vs. Bonferroni-Holm as closed testing procedures example

The Bonferroni test can be easily written down in terms of the closed testing principle, e.g. in the case of two null hypotheses as

Test for $H_1 cap H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for elementary null hypothesis $H_1$: $p_1 leq alpha/2$

Test for elementary null hypothesis $H_2$: $p_2 leq alpha/2$

Test for $H_1$ and $H_2$: $p_1 leq alpha/2$ and $p_2 leq alpha/2$

Test for $H_1$: $p_1 leq alpha$

Test for $H_2$: $p_2 leq alpha$

is uniformly more powerful.

Figure 1: Illustration of the graphical approach for displaying these two procedures
Bonferroni and Bonf.-Holm as graphical closed testing procedures

Power considerations and other considerations

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

edited 7 hours ago

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

answered 7 hours ago

Björn

12.7k2 gold badges12 silver badges46 bronze badges

$begingroup$
Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.
$endgroup$
– EliK
7 hours ago

$begingroup$
I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.
$endgroup$
– EliK
7 hours ago

add a comment
|

$begingroup$
Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.
$endgroup$
– EliK
7 hours ago

$begingroup$
I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.
$endgroup$
– EliK
7 hours ago

Thank you. I have a followup question. Let's ignore changing the importance of particular test or accounting for correlation. If I can construct a uniformally most powerful test (UMP) for every hypothesis test in the family, is that the most powerful procedure I can construct that still controls the Family wise Error Rate.

– EliK
7 hours ago

I didn't see your addition when I asked my question. I think your addition covers my question. Thank you.

– EliK
7 hours ago

add a comment
|

draft saved

draft discarded

Thanks for contributing an answer to Cross Validated!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Mfcttrf

1 Answer
1

Your Answer

Post as a guest

1 Answer
1

1 Answer
1

Post as a guest

Popular posts from this blog

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

1 Answer 1

1 Answer 1

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

1 Answer
1

1 Answer
1

1 Answer
1