How likely is sample A and sample B is from distribution C?Learning to create samples from an unknown distributionDifference between null distribution and sampling distributionHow to make a two-tailed hypergeometric test?Why can variance be estimated from a sample taken from an alternative hypothesis?Is this sample drawn from the normal distribution ? using information from both mean and standard deviationInfer a population, and hence a sampling distribution, from a sampleWhen should I use one-sample t-test and when should I use t-test for two population means?How to combine probability plots and hypothesis tests to check normality?Testing if two distributions have the same mean by using a sample distributionHypothesis Testing - Switch hypothesis and get same result?
Should I include salary information on my CV?
How hard is it to sell a home which is currently mortgaged?
Why is Madam Hooch not a professor?
Anagram Within an Anagram!
Does ultrasonic bath cleaning damage laboratory volumetric glassware calibration?
I played my first (rapid) tournament recently and I wanted to calculate my ELO
Children's short story about material that accelerates away from gravity
Set vertical spacing between two particular items
how to remove the dotted white border around focused button text?
MH370 blackbox - is it still possible to retrieve data from it?
If a high rpm motor is run at lower rpm, will it produce more torque?
If my Scout rogue has used his full movement on his turn, can he later use the reaction from the Skirmisher feature to move again?
Are there any vegetarian astronauts?
Three column layout
can’t run a function against EXEC
Can you get infinite turns with this 2 card combo?
How fast can a ship with rotating habitats be accelerated?
Transitive action of a discrete group on a compact space
Do sudoku answers always have a single minimal clue set?
How would a order of Monks that renounce their names communicate effectively?
Difference between 'demás' and 'otros'?
Cross over of arrows in a complex diagram
The use of "I" and "we" used in the same sentence and other questions
Should I report a leak of confidential HR information?
How likely is sample A and sample B is from distribution C?
Learning to create samples from an unknown distributionDifference between null distribution and sampling distributionHow to make a two-tailed hypergeometric test?Why can variance be estimated from a sample taken from an alternative hypothesis?Is this sample drawn from the normal distribution ? using information from both mean and standard deviationInfer a population, and hence a sampling distribution, from a sampleWhen should I use one-sample t-test and when should I use t-test for two population means?How to combine probability plots and hypothesis tests to check normality?Testing if two distributions have the same mean by using a sample distributionHypothesis Testing - Switch hypothesis and get same result?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
$begingroup$
Let's say I have a sample A: [0,0,0,1]
and another sample B: [2,0,5,10,100,3,2,6]
I would like to know the probability that A and B are both picked from the same population C.
I tried applying a hypothesis test, but it gives me a p value of approx. 0.39 and I think it should be clear that it's very unlikely that both samples are from the same distribution.
probability hypothesis-testing distributions p-value multivariate-analysis
New contributor
$endgroup$
add a comment |
$begingroup$
Let's say I have a sample A: [0,0,0,1]
and another sample B: [2,0,5,10,100,3,2,6]
I would like to know the probability that A and B are both picked from the same population C.
I tried applying a hypothesis test, but it gives me a p value of approx. 0.39 and I think it should be clear that it's very unlikely that both samples are from the same distribution.
probability hypothesis-testing distributions p-value multivariate-analysis
New contributor
$endgroup$
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago
add a comment |
$begingroup$
Let's say I have a sample A: [0,0,0,1]
and another sample B: [2,0,5,10,100,3,2,6]
I would like to know the probability that A and B are both picked from the same population C.
I tried applying a hypothesis test, but it gives me a p value of approx. 0.39 and I think it should be clear that it's very unlikely that both samples are from the same distribution.
probability hypothesis-testing distributions p-value multivariate-analysis
New contributor
$endgroup$
Let's say I have a sample A: [0,0,0,1]
and another sample B: [2,0,5,10,100,3,2,6]
I would like to know the probability that A and B are both picked from the same population C.
I tried applying a hypothesis test, but it gives me a p value of approx. 0.39 and I think it should be clear that it's very unlikely that both samples are from the same distribution.
probability hypothesis-testing distributions p-value multivariate-analysis
probability hypothesis-testing distributions p-value multivariate-analysis
New contributor
New contributor
New contributor
asked 8 hours ago
Franc WeserFranc Weser
113 bronze badges
113 bronze badges
New contributor
New contributor
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago
add a comment |
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
You don't say what kind of hypothesis test you used.
Doing inference on such small samples as these is always
going to be difficult. However, a nonparametric Kolmogorov-Smirnov test (in R) does reject the null hypothesis that these
two samples were randomly sampled from the same population.
There is a warning message that (on account of the ties), the P-value is not exact, but 0.034 seems sufficiently smaller than 0.05 to say that we can reject at the 5% level.
x1 = c(0,0,0,1)
x2 = c(2,0,5,10,100,3,2,6)
ks.test(x1, x2)
Two-sample Kolmogorov-Smirnov test
data: x1 and x2
D = 0.875, p-value = 0.0337
alternative hypothesis: two-sided
Warning message:
In ks.test(x1, x2) : cannot compute exact p-value with ties
Similar data without ties gives a 'cleaner' test--rejecting the null hypothesis with no warning messages.
y1 = c(.01, .02, .03, .9)
y2 = c(2,0,5,10,100,3,2.1,6)
ks.test(y1, y2)
Two-sample Kolmogorov-Smirnov test
data: y1 and y2
D = 0.875, p-value = 0.0202
alternative hypothesis: two-sided
Another possible test is the two-sample Wilcoxon (rank sum test). Its distribution theory is also somewhat disturbed by ties, but it does find a significant difference between your two samples. Looking just at the P-value, we have:
wilcox.test(x1,x2)$p.val
[1] 0.02434338
Warning message:
In wilcox.test.default(x1, x2) :
cannot compute exact p-value with ties
$endgroup$
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Franc Weser is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f414335%2fhow-likely-is-sample-a-and-sample-b-is-from-distribution-c%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
You don't say what kind of hypothesis test you used.
Doing inference on such small samples as these is always
going to be difficult. However, a nonparametric Kolmogorov-Smirnov test (in R) does reject the null hypothesis that these
two samples were randomly sampled from the same population.
There is a warning message that (on account of the ties), the P-value is not exact, but 0.034 seems sufficiently smaller than 0.05 to say that we can reject at the 5% level.
x1 = c(0,0,0,1)
x2 = c(2,0,5,10,100,3,2,6)
ks.test(x1, x2)
Two-sample Kolmogorov-Smirnov test
data: x1 and x2
D = 0.875, p-value = 0.0337
alternative hypothesis: two-sided
Warning message:
In ks.test(x1, x2) : cannot compute exact p-value with ties
Similar data without ties gives a 'cleaner' test--rejecting the null hypothesis with no warning messages.
y1 = c(.01, .02, .03, .9)
y2 = c(2,0,5,10,100,3,2.1,6)
ks.test(y1, y2)
Two-sample Kolmogorov-Smirnov test
data: y1 and y2
D = 0.875, p-value = 0.0202
alternative hypothesis: two-sided
Another possible test is the two-sample Wilcoxon (rank sum test). Its distribution theory is also somewhat disturbed by ties, but it does find a significant difference between your two samples. Looking just at the P-value, we have:
wilcox.test(x1,x2)$p.val
[1] 0.02434338
Warning message:
In wilcox.test.default(x1, x2) :
cannot compute exact p-value with ties
$endgroup$
add a comment |
$begingroup$
You don't say what kind of hypothesis test you used.
Doing inference on such small samples as these is always
going to be difficult. However, a nonparametric Kolmogorov-Smirnov test (in R) does reject the null hypothesis that these
two samples were randomly sampled from the same population.
There is a warning message that (on account of the ties), the P-value is not exact, but 0.034 seems sufficiently smaller than 0.05 to say that we can reject at the 5% level.
x1 = c(0,0,0,1)
x2 = c(2,0,5,10,100,3,2,6)
ks.test(x1, x2)
Two-sample Kolmogorov-Smirnov test
data: x1 and x2
D = 0.875, p-value = 0.0337
alternative hypothesis: two-sided
Warning message:
In ks.test(x1, x2) : cannot compute exact p-value with ties
Similar data without ties gives a 'cleaner' test--rejecting the null hypothesis with no warning messages.
y1 = c(.01, .02, .03, .9)
y2 = c(2,0,5,10,100,3,2.1,6)
ks.test(y1, y2)
Two-sample Kolmogorov-Smirnov test
data: y1 and y2
D = 0.875, p-value = 0.0202
alternative hypothesis: two-sided
Another possible test is the two-sample Wilcoxon (rank sum test). Its distribution theory is also somewhat disturbed by ties, but it does find a significant difference between your two samples. Looking just at the P-value, we have:
wilcox.test(x1,x2)$p.val
[1] 0.02434338
Warning message:
In wilcox.test.default(x1, x2) :
cannot compute exact p-value with ties
$endgroup$
add a comment |
$begingroup$
You don't say what kind of hypothesis test you used.
Doing inference on such small samples as these is always
going to be difficult. However, a nonparametric Kolmogorov-Smirnov test (in R) does reject the null hypothesis that these
two samples were randomly sampled from the same population.
There is a warning message that (on account of the ties), the P-value is not exact, but 0.034 seems sufficiently smaller than 0.05 to say that we can reject at the 5% level.
x1 = c(0,0,0,1)
x2 = c(2,0,5,10,100,3,2,6)
ks.test(x1, x2)
Two-sample Kolmogorov-Smirnov test
data: x1 and x2
D = 0.875, p-value = 0.0337
alternative hypothesis: two-sided
Warning message:
In ks.test(x1, x2) : cannot compute exact p-value with ties
Similar data without ties gives a 'cleaner' test--rejecting the null hypothesis with no warning messages.
y1 = c(.01, .02, .03, .9)
y2 = c(2,0,5,10,100,3,2.1,6)
ks.test(y1, y2)
Two-sample Kolmogorov-Smirnov test
data: y1 and y2
D = 0.875, p-value = 0.0202
alternative hypothesis: two-sided
Another possible test is the two-sample Wilcoxon (rank sum test). Its distribution theory is also somewhat disturbed by ties, but it does find a significant difference between your two samples. Looking just at the P-value, we have:
wilcox.test(x1,x2)$p.val
[1] 0.02434338
Warning message:
In wilcox.test.default(x1, x2) :
cannot compute exact p-value with ties
$endgroup$
You don't say what kind of hypothesis test you used.
Doing inference on such small samples as these is always
going to be difficult. However, a nonparametric Kolmogorov-Smirnov test (in R) does reject the null hypothesis that these
two samples were randomly sampled from the same population.
There is a warning message that (on account of the ties), the P-value is not exact, but 0.034 seems sufficiently smaller than 0.05 to say that we can reject at the 5% level.
x1 = c(0,0,0,1)
x2 = c(2,0,5,10,100,3,2,6)
ks.test(x1, x2)
Two-sample Kolmogorov-Smirnov test
data: x1 and x2
D = 0.875, p-value = 0.0337
alternative hypothesis: two-sided
Warning message:
In ks.test(x1, x2) : cannot compute exact p-value with ties
Similar data without ties gives a 'cleaner' test--rejecting the null hypothesis with no warning messages.
y1 = c(.01, .02, .03, .9)
y2 = c(2,0,5,10,100,3,2.1,6)
ks.test(y1, y2)
Two-sample Kolmogorov-Smirnov test
data: y1 and y2
D = 0.875, p-value = 0.0202
alternative hypothesis: two-sided
Another possible test is the two-sample Wilcoxon (rank sum test). Its distribution theory is also somewhat disturbed by ties, but it does find a significant difference between your two samples. Looking just at the P-value, we have:
wilcox.test(x1,x2)$p.val
[1] 0.02434338
Warning message:
In wilcox.test.default(x1, x2) :
cannot compute exact p-value with ties
edited 7 hours ago
answered 7 hours ago
BruceETBruceET
9,5581 gold badge8 silver badges24 bronze badges
9,5581 gold badge8 silver badges24 bronze badges
add a comment |
add a comment |
Franc Weser is a new contributor. Be nice, and check out our Code of Conduct.
Franc Weser is a new contributor. Be nice, and check out our Code of Conduct.
Franc Weser is a new contributor. Be nice, and check out our Code of Conduct.
Franc Weser is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Cross Validated!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f414335%2fhow-likely-is-sample-a-and-sample-b-is-from-distribution-c%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
I'm guessing you used a pooled 2-sample t test, which is not a good choice here because sample sizes are small, 100 is a far outlier, and sample variances are hugely different. But your intuition that these data are not likely to have come from the same population is correct.
$endgroup$
– BruceET
7 hours ago
$begingroup$
As phrased the question (which contains a request for a probability), appears to be framed as a Bayesian problem. I expect that a Bayesian analysis is likely not the OP's intent, but if answers talk about hypothesis tests they should also discuss what question those answer (in place of what the question asks).
$endgroup$
– Glen_b♦
1 hour ago