Where is the bomb: How to estimate the probability, given row and column totals?An inconsistency between the concept of “subindependence” and the chi-square test for independence?How to find probability with multiple pieces of evidence?Probability of independent events given the historyProbability that two random letters from a language will be the same?LLR with Positive and Negative Values vs. Dunning method with Entropy-based CalculationWhat is the probability that a sequence of events completes within a given time interval?Given hit probability and number of projectiles, randomly determine number of hitsProbability of getting the exact same letters in Scrabble 2 turns in a row?

How to prevent pickpocketing in busy bars?

Why does Captain Marvel in the MCU not have her sash?

After viewing logs with journalctl, how do I exit the screen that says "lines 1-2/2 (END)"?

How to convert what I'm singing to notes

How to justify getting additional team member when the current team is doing well?

How can I find Marin?

Is the order of words purely based on convention?

Garage door sticks on a bolt

What does it mean by "my days-of-the-week underwear only go to Thursday" in this context?

How fast can a LN payment be over TOR?

Movie called "Predator" or "Predator 5" with Predator-like creatures inside of people

Why is the Common Agricultural Policy unfavourable to the UK?

When did Unix stop storing passwords in clear text?

I transpose the source code, you transpose the input!

What can Thomas Cook customers who have not yet departed do now it has stopped operating?

Why, even after his imprisonment, people keep calling Hannibal Lecter "Doctor"?

How to stop the death waves in my city?

Why does my browser attempt to download pages from http://clhs.lisp.se instead of viewing them normally?

Convert a string of digits from words to an integer

Where to find the Arxiv endorsement code?

Notebook with version-dependent cells

Where's the mandatory argument of sectioning commands

How come it seems the best way to make a living is to invest in real estate?

Can I exile my opponent's Progenitus/True-Name Nemesis with Teferi, Hero of Dominaria's emblem?



Where is the bomb: How to estimate the probability, given row and column totals?


An inconsistency between the concept of “subindependence” and the chi-square test for independence?How to find probability with multiple pieces of evidence?Probability of independent events given the historyProbability that two random letters from a language will be the same?LLR with Positive and Negative Values vs. Dunning method with Entropy-based CalculationWhat is the probability that a sequence of events completes within a given time interval?Given hit probability and number of projectiles, randomly determine number of hitsProbability of getting the exact same letters in Scrabble 2 turns in a row?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








4












$begingroup$


This question is inspired by a mini-game from Pokemon Soulsilver:



Imagine there are 15 bombs hidden on this 5x6 area:



Sums



Now, how would you estimate the probability to find a bomb on a specific field, given the row/column totals?



If you look at column 5 (total bombs = 5), then you might think: Within this column the chance to find a bomb in row 2 is double the chance to find one in row 1.



This (wrong) assumption of direct proportionality, which basically can be described as drawing standard independence-test operations (like in Chi-Square) into the wrong context, would lead to the following estimations:



Chi-Square



As you can see, direct proportionality leads to probability estimates over 100%, and even before that, would be wrong.



So I performed a computational simulation of all possible permutations which led to 276 unique possibilities of placing 15 bombs. (given row and column totals)



Here is the average over the 276 solutions:
Computational solution



This is the correct solution, but due to the exponential computational work, I would like to find an estimation method.



My question is now: Is there an established statistical method to estimating this?
I was wondering if this was a known problem, how it is called and if there are papers/websites you could recommend!










share|cite|improve this question







New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$













  • $begingroup$
    Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
    $endgroup$
    – Tim
    8 hours ago










  • $begingroup$
    Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
    $endgroup$
    – KaPy3141
    7 hours ago










  • $begingroup$
    The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
    $endgroup$
    – KaPy3141
    7 hours ago






  • 1




    $begingroup$
    I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
    $endgroup$
    – Aksakal
    4 hours ago






  • 1




    $begingroup$
    @Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
    $endgroup$
    – Dougal
    3 hours ago


















4












$begingroup$


This question is inspired by a mini-game from Pokemon Soulsilver:



Imagine there are 15 bombs hidden on this 5x6 area:



Sums



Now, how would you estimate the probability to find a bomb on a specific field, given the row/column totals?



If you look at column 5 (total bombs = 5), then you might think: Within this column the chance to find a bomb in row 2 is double the chance to find one in row 1.



This (wrong) assumption of direct proportionality, which basically can be described as drawing standard independence-test operations (like in Chi-Square) into the wrong context, would lead to the following estimations:



Chi-Square



As you can see, direct proportionality leads to probability estimates over 100%, and even before that, would be wrong.



So I performed a computational simulation of all possible permutations which led to 276 unique possibilities of placing 15 bombs. (given row and column totals)



Here is the average over the 276 solutions:
Computational solution



This is the correct solution, but due to the exponential computational work, I would like to find an estimation method.



My question is now: Is there an established statistical method to estimating this?
I was wondering if this was a known problem, how it is called and if there are papers/websites you could recommend!










share|cite|improve this question







New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$













  • $begingroup$
    Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
    $endgroup$
    – Tim
    8 hours ago










  • $begingroup$
    Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
    $endgroup$
    – KaPy3141
    7 hours ago










  • $begingroup$
    The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
    $endgroup$
    – KaPy3141
    7 hours ago






  • 1




    $begingroup$
    I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
    $endgroup$
    – Aksakal
    4 hours ago






  • 1




    $begingroup$
    @Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
    $endgroup$
    – Dougal
    3 hours ago














4












4








4


2



$begingroup$


This question is inspired by a mini-game from Pokemon Soulsilver:



Imagine there are 15 bombs hidden on this 5x6 area:



Sums



Now, how would you estimate the probability to find a bomb on a specific field, given the row/column totals?



If you look at column 5 (total bombs = 5), then you might think: Within this column the chance to find a bomb in row 2 is double the chance to find one in row 1.



This (wrong) assumption of direct proportionality, which basically can be described as drawing standard independence-test operations (like in Chi-Square) into the wrong context, would lead to the following estimations:



Chi-Square



As you can see, direct proportionality leads to probability estimates over 100%, and even before that, would be wrong.



So I performed a computational simulation of all possible permutations which led to 276 unique possibilities of placing 15 bombs. (given row and column totals)



Here is the average over the 276 solutions:
Computational solution



This is the correct solution, but due to the exponential computational work, I would like to find an estimation method.



My question is now: Is there an established statistical method to estimating this?
I was wondering if this was a known problem, how it is called and if there are papers/websites you could recommend!










share|cite|improve this question







New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$




This question is inspired by a mini-game from Pokemon Soulsilver:



Imagine there are 15 bombs hidden on this 5x6 area:



Sums



Now, how would you estimate the probability to find a bomb on a specific field, given the row/column totals?



If you look at column 5 (total bombs = 5), then you might think: Within this column the chance to find a bomb in row 2 is double the chance to find one in row 1.



This (wrong) assumption of direct proportionality, which basically can be described as drawing standard independence-test operations (like in Chi-Square) into the wrong context, would lead to the following estimations:



Chi-Square



As you can see, direct proportionality leads to probability estimates over 100%, and even before that, would be wrong.



So I performed a computational simulation of all possible permutations which led to 276 unique possibilities of placing 15 bombs. (given row and column totals)



Here is the average over the 276 solutions:
Computational solution



This is the correct solution, but due to the exponential computational work, I would like to find an estimation method.



My question is now: Is there an established statistical method to estimating this?
I was wondering if this was a known problem, how it is called and if there are papers/websites you could recommend!







probability estimation chi-squared independence games






share|cite|improve this question







New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.










share|cite|improve this question







New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








share|cite|improve this question




share|cite|improve this question






New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








asked 8 hours ago









KaPy3141KaPy3141

211 bronze badge




211 bronze badge




New contributor



KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




New contributor




KaPy3141 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • $begingroup$
    Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
    $endgroup$
    – Tim
    8 hours ago










  • $begingroup$
    Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
    $endgroup$
    – KaPy3141
    7 hours ago










  • $begingroup$
    The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
    $endgroup$
    – KaPy3141
    7 hours ago






  • 1




    $begingroup$
    I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
    $endgroup$
    – Aksakal
    4 hours ago






  • 1




    $begingroup$
    @Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
    $endgroup$
    – Dougal
    3 hours ago

















  • $begingroup$
    Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
    $endgroup$
    – Tim
    8 hours ago










  • $begingroup$
    Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
    $endgroup$
    – KaPy3141
    7 hours ago










  • $begingroup$
    The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
    $endgroup$
    – KaPy3141
    7 hours ago






  • 1




    $begingroup$
    I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
    $endgroup$
    – Aksakal
    4 hours ago






  • 1




    $begingroup$
    @Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
    $endgroup$
    – Dougal
    3 hours ago
















$begingroup$
Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
$endgroup$
– Tim
8 hours ago




$begingroup$
Fast and easy approach: For higher number of rows & columns, you could conduct a Monte Carlo simulation, where you would check the random subsample of the possible configurations that is lower then then total number of possibilities. It'd give you an approximate solution.
$endgroup$
– Tim
8 hours ago












$begingroup$
Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
$endgroup$
– KaPy3141
7 hours ago




$begingroup$
Thanks for your comment! Unfortunately I see a dilemma in this approach: If I just randomly place 15 bombs, I need 268 million tries to get just 70 (non-unique!) solutions. -> It is super unlikely that random placement leads to a valid solution. (=correct totals)
$endgroup$
– KaPy3141
7 hours ago












$begingroup$
The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
$endgroup$
– KaPy3141
7 hours ago




$begingroup$
The dilemma is that, if I now help the simulation, by using rules how to place the subsequent bombs after the first one. Then I would distort the result! (Every field I start produces a solution -> 50:50 bias)
$endgroup$
– KaPy3141
7 hours ago




1




1




$begingroup$
I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
$endgroup$
– Aksakal
4 hours ago




$begingroup$
I dont understand your computational solution. What are the numbers in cells? They certainly don't add up to 100%, it's not PMF. They also dont look like CDF, the right/bottom cell is not 100%
$endgroup$
– Aksakal
4 hours ago




1




1




$begingroup$
@Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
$endgroup$
– Dougal
3 hours ago





$begingroup$
@Aksakal These are the marginal probabilities that any given cell contains a bomb. The numbers add to 15, the number of total bombs on the board.
$endgroup$
– Dougal
3 hours ago











1 Answer
1






active

oldest

votes


















3














$begingroup$

There's no unique solution



I don't think that true discrete probability distribution can be recovered, unless you make some additional assumptions. Your situation is basically a problem of recovering the joint distribution from marginals. It is sometimes solved by using copulas in the industry, for example financial risk management, but usually for continuous distributions.



The special case of independent rows and columns is easy: $P_i^j=P_itimes P^j$
where $P_i$ and $P^j$ are marginals of rows and columns. For instance, row $P_6=3/15=0.2$ and column $P^3=3/15=0.2$, hence the probability that a bomb is in row 6 and column 3 is $P_6^3=0.04$. This is assuming that more than one bomb can be placed in a cell, of course. You actually produced this distribution in your first table.



Discrete Copulas



There are discrete copulas, but they have issues: they're not unique. It doesn't make them useless. So, I'd try applying discrete copulas. You can find a good overview of them in Genest, C. and J. Nešlehová (2007). A primer on copulas for count data. Astin Bull. 37(2), 475–515.



Copulas can be especially useful, as they usually allow to explicitly induce dependence, or to estimate it from data when the data is available. I mean the dependence of row and columns when placing bombs. For instance, it could be the case when if the bomb is one the first row, then it is more likely that it will be one the first column too.



Example



Let's apply Kimeldorf and Sampson copula to your data, assuming again that more than one bomb can be placed in a cell. The copula for a dependency parameter $theta$ is defined as:
$$C(u,v)=(u^-theta+u^-theta-1)^-1/theta$$
You can think of $theta$ as an analog of the correlation coefficient.



Independent



Let's start with the case of weak dependence, $theta=0.000001$, where we have the following probabilities (PMF) and marginal PDFs are shown too on the panels on the right and at the bottom:



enter image description here



You can see how in the column 5 the second row probability does have twice higher probability than the first row. This is not wrong contrary to what you seemed to imply in your question. All probability do add up to 100%, of course, as do the marginals on the panels match the frequencies. For instance, the column 5 in the lower panel shows 1/3 which corresponds to stated 5 bombs out of total 15 as expected.



Positive Correlation



For stronger dependency (positive correlation) with $theta=10$ we have the following:



enter image description here



Negative Correlation



The same for stronger but negative correlation (dependency) $theta=-0.2$:



enter image description here



You can see that all probabilities add up to 100%, of course. Also, you can see how dependency impacts the PMF's shape. For positive dependency (correlation) you get the highest PMF concentrated on the diagonal, while for the negative dependency it is off-diagonal



No more than one bomb in a field



My copula solution will not work here without modification. We need to use copulas that hold the constrain that $P_i^jle 1$






share|cite|improve this answer











$endgroup$

















    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "65"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );







    KaPy3141 is a new contributor. Be nice, and check out our Code of Conduct.









    draft saved

    draft discarded
















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f428346%2fwhere-is-the-bomb-how-to-estimate-the-probability-given-row-and-column-totals%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    3














    $begingroup$

    There's no unique solution



    I don't think that true discrete probability distribution can be recovered, unless you make some additional assumptions. Your situation is basically a problem of recovering the joint distribution from marginals. It is sometimes solved by using copulas in the industry, for example financial risk management, but usually for continuous distributions.



    The special case of independent rows and columns is easy: $P_i^j=P_itimes P^j$
    where $P_i$ and $P^j$ are marginals of rows and columns. For instance, row $P_6=3/15=0.2$ and column $P^3=3/15=0.2$, hence the probability that a bomb is in row 6 and column 3 is $P_6^3=0.04$. This is assuming that more than one bomb can be placed in a cell, of course. You actually produced this distribution in your first table.



    Discrete Copulas



    There are discrete copulas, but they have issues: they're not unique. It doesn't make them useless. So, I'd try applying discrete copulas. You can find a good overview of them in Genest, C. and J. Nešlehová (2007). A primer on copulas for count data. Astin Bull. 37(2), 475–515.



    Copulas can be especially useful, as they usually allow to explicitly induce dependence, or to estimate it from data when the data is available. I mean the dependence of row and columns when placing bombs. For instance, it could be the case when if the bomb is one the first row, then it is more likely that it will be one the first column too.



    Example



    Let's apply Kimeldorf and Sampson copula to your data, assuming again that more than one bomb can be placed in a cell. The copula for a dependency parameter $theta$ is defined as:
    $$C(u,v)=(u^-theta+u^-theta-1)^-1/theta$$
    You can think of $theta$ as an analog of the correlation coefficient.



    Independent



    Let's start with the case of weak dependence, $theta=0.000001$, where we have the following probabilities (PMF) and marginal PDFs are shown too on the panels on the right and at the bottom:



    enter image description here



    You can see how in the column 5 the second row probability does have twice higher probability than the first row. This is not wrong contrary to what you seemed to imply in your question. All probability do add up to 100%, of course, as do the marginals on the panels match the frequencies. For instance, the column 5 in the lower panel shows 1/3 which corresponds to stated 5 bombs out of total 15 as expected.



    Positive Correlation



    For stronger dependency (positive correlation) with $theta=10$ we have the following:



    enter image description here



    Negative Correlation



    The same for stronger but negative correlation (dependency) $theta=-0.2$:



    enter image description here



    You can see that all probabilities add up to 100%, of course. Also, you can see how dependency impacts the PMF's shape. For positive dependency (correlation) you get the highest PMF concentrated on the diagonal, while for the negative dependency it is off-diagonal



    No more than one bomb in a field



    My copula solution will not work here without modification. We need to use copulas that hold the constrain that $P_i^jle 1$






    share|cite|improve this answer











    $endgroup$



















      3














      $begingroup$

      There's no unique solution



      I don't think that true discrete probability distribution can be recovered, unless you make some additional assumptions. Your situation is basically a problem of recovering the joint distribution from marginals. It is sometimes solved by using copulas in the industry, for example financial risk management, but usually for continuous distributions.



      The special case of independent rows and columns is easy: $P_i^j=P_itimes P^j$
      where $P_i$ and $P^j$ are marginals of rows and columns. For instance, row $P_6=3/15=0.2$ and column $P^3=3/15=0.2$, hence the probability that a bomb is in row 6 and column 3 is $P_6^3=0.04$. This is assuming that more than one bomb can be placed in a cell, of course. You actually produced this distribution in your first table.



      Discrete Copulas



      There are discrete copulas, but they have issues: they're not unique. It doesn't make them useless. So, I'd try applying discrete copulas. You can find a good overview of them in Genest, C. and J. Nešlehová (2007). A primer on copulas for count data. Astin Bull. 37(2), 475–515.



      Copulas can be especially useful, as they usually allow to explicitly induce dependence, or to estimate it from data when the data is available. I mean the dependence of row and columns when placing bombs. For instance, it could be the case when if the bomb is one the first row, then it is more likely that it will be one the first column too.



      Example



      Let's apply Kimeldorf and Sampson copula to your data, assuming again that more than one bomb can be placed in a cell. The copula for a dependency parameter $theta$ is defined as:
      $$C(u,v)=(u^-theta+u^-theta-1)^-1/theta$$
      You can think of $theta$ as an analog of the correlation coefficient.



      Independent



      Let's start with the case of weak dependence, $theta=0.000001$, where we have the following probabilities (PMF) and marginal PDFs are shown too on the panels on the right and at the bottom:



      enter image description here



      You can see how in the column 5 the second row probability does have twice higher probability than the first row. This is not wrong contrary to what you seemed to imply in your question. All probability do add up to 100%, of course, as do the marginals on the panels match the frequencies. For instance, the column 5 in the lower panel shows 1/3 which corresponds to stated 5 bombs out of total 15 as expected.



      Positive Correlation



      For stronger dependency (positive correlation) with $theta=10$ we have the following:



      enter image description here



      Negative Correlation



      The same for stronger but negative correlation (dependency) $theta=-0.2$:



      enter image description here



      You can see that all probabilities add up to 100%, of course. Also, you can see how dependency impacts the PMF's shape. For positive dependency (correlation) you get the highest PMF concentrated on the diagonal, while for the negative dependency it is off-diagonal



      No more than one bomb in a field



      My copula solution will not work here without modification. We need to use copulas that hold the constrain that $P_i^jle 1$






      share|cite|improve this answer











      $endgroup$

















        3














        3










        3







        $begingroup$

        There's no unique solution



        I don't think that true discrete probability distribution can be recovered, unless you make some additional assumptions. Your situation is basically a problem of recovering the joint distribution from marginals. It is sometimes solved by using copulas in the industry, for example financial risk management, but usually for continuous distributions.



        The special case of independent rows and columns is easy: $P_i^j=P_itimes P^j$
        where $P_i$ and $P^j$ are marginals of rows and columns. For instance, row $P_6=3/15=0.2$ and column $P^3=3/15=0.2$, hence the probability that a bomb is in row 6 and column 3 is $P_6^3=0.04$. This is assuming that more than one bomb can be placed in a cell, of course. You actually produced this distribution in your first table.



        Discrete Copulas



        There are discrete copulas, but they have issues: they're not unique. It doesn't make them useless. So, I'd try applying discrete copulas. You can find a good overview of them in Genest, C. and J. Nešlehová (2007). A primer on copulas for count data. Astin Bull. 37(2), 475–515.



        Copulas can be especially useful, as they usually allow to explicitly induce dependence, or to estimate it from data when the data is available. I mean the dependence of row and columns when placing bombs. For instance, it could be the case when if the bomb is one the first row, then it is more likely that it will be one the first column too.



        Example



        Let's apply Kimeldorf and Sampson copula to your data, assuming again that more than one bomb can be placed in a cell. The copula for a dependency parameter $theta$ is defined as:
        $$C(u,v)=(u^-theta+u^-theta-1)^-1/theta$$
        You can think of $theta$ as an analog of the correlation coefficient.



        Independent



        Let's start with the case of weak dependence, $theta=0.000001$, where we have the following probabilities (PMF) and marginal PDFs are shown too on the panels on the right and at the bottom:



        enter image description here



        You can see how in the column 5 the second row probability does have twice higher probability than the first row. This is not wrong contrary to what you seemed to imply in your question. All probability do add up to 100%, of course, as do the marginals on the panels match the frequencies. For instance, the column 5 in the lower panel shows 1/3 which corresponds to stated 5 bombs out of total 15 as expected.



        Positive Correlation



        For stronger dependency (positive correlation) with $theta=10$ we have the following:



        enter image description here



        Negative Correlation



        The same for stronger but negative correlation (dependency) $theta=-0.2$:



        enter image description here



        You can see that all probabilities add up to 100%, of course. Also, you can see how dependency impacts the PMF's shape. For positive dependency (correlation) you get the highest PMF concentrated on the diagonal, while for the negative dependency it is off-diagonal



        No more than one bomb in a field



        My copula solution will not work here without modification. We need to use copulas that hold the constrain that $P_i^jle 1$






        share|cite|improve this answer











        $endgroup$



        There's no unique solution



        I don't think that true discrete probability distribution can be recovered, unless you make some additional assumptions. Your situation is basically a problem of recovering the joint distribution from marginals. It is sometimes solved by using copulas in the industry, for example financial risk management, but usually for continuous distributions.



        The special case of independent rows and columns is easy: $P_i^j=P_itimes P^j$
        where $P_i$ and $P^j$ are marginals of rows and columns. For instance, row $P_6=3/15=0.2$ and column $P^3=3/15=0.2$, hence the probability that a bomb is in row 6 and column 3 is $P_6^3=0.04$. This is assuming that more than one bomb can be placed in a cell, of course. You actually produced this distribution in your first table.



        Discrete Copulas



        There are discrete copulas, but they have issues: they're not unique. It doesn't make them useless. So, I'd try applying discrete copulas. You can find a good overview of them in Genest, C. and J. Nešlehová (2007). A primer on copulas for count data. Astin Bull. 37(2), 475–515.



        Copulas can be especially useful, as they usually allow to explicitly induce dependence, or to estimate it from data when the data is available. I mean the dependence of row and columns when placing bombs. For instance, it could be the case when if the bomb is one the first row, then it is more likely that it will be one the first column too.



        Example



        Let's apply Kimeldorf and Sampson copula to your data, assuming again that more than one bomb can be placed in a cell. The copula for a dependency parameter $theta$ is defined as:
        $$C(u,v)=(u^-theta+u^-theta-1)^-1/theta$$
        You can think of $theta$ as an analog of the correlation coefficient.



        Independent



        Let's start with the case of weak dependence, $theta=0.000001$, where we have the following probabilities (PMF) and marginal PDFs are shown too on the panels on the right and at the bottom:



        enter image description here



        You can see how in the column 5 the second row probability does have twice higher probability than the first row. This is not wrong contrary to what you seemed to imply in your question. All probability do add up to 100%, of course, as do the marginals on the panels match the frequencies. For instance, the column 5 in the lower panel shows 1/3 which corresponds to stated 5 bombs out of total 15 as expected.



        Positive Correlation



        For stronger dependency (positive correlation) with $theta=10$ we have the following:



        enter image description here



        Negative Correlation



        The same for stronger but negative correlation (dependency) $theta=-0.2$:



        enter image description here



        You can see that all probabilities add up to 100%, of course. Also, you can see how dependency impacts the PMF's shape. For positive dependency (correlation) you get the highest PMF concentrated on the diagonal, while for the negative dependency it is off-diagonal



        No more than one bomb in a field



        My copula solution will not work here without modification. We need to use copulas that hold the constrain that $P_i^jle 1$







        share|cite|improve this answer














        share|cite|improve this answer



        share|cite|improve this answer








        edited 2 hours ago

























        answered 7 hours ago









        AksakalAksakal

        41.9k4 gold badges55 silver badges129 bronze badges




        41.9k4 gold badges55 silver badges129 bronze badges
























            KaPy3141 is a new contributor. Be nice, and check out our Code of Conduct.









            draft saved

            draft discarded

















            KaPy3141 is a new contributor. Be nice, and check out our Code of Conduct.












            KaPy3141 is a new contributor. Be nice, and check out our Code of Conduct.











            KaPy3141 is a new contributor. Be nice, and check out our Code of Conduct.














            Thanks for contributing an answer to Cross Validated!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f428346%2fwhere-is-the-bomb-how-to-estimate-the-probability-given-row-and-column-totals%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Canceling a color specificationRandomly assigning color to Graphics3D objects?Default color for Filling in Mathematica 9Coloring specific elements of sets with a prime modified order in an array plotHow to pick a color differing significantly from the colors already in a given color list?Detection of the text colorColor numbers based on their valueCan color schemes for use with ColorData include opacity specification?My dynamic color schemes

            Invision Community Contents History See also References External links Navigation menuProprietaryinvisioncommunity.comIPS Community ForumsIPS Community Forumsthis blog entry"License Changes, IP.Board 3.4, and the Future""Interview -- Matt Mecham of Ibforums""CEO Invision Power Board, Matt Mecham Is a Liar, Thief!"IPB License Explanation 1.3, 1.3.1, 2.0, and 2.1ArchivedSecurity Fixes, Updates And Enhancements For IPB 1.3.1Archived"New Demo Accounts - Invision Power Services"the original"New Default Skin"the original"Invision Power Board 3.0.0 and Applications Released"the original"Archived copy"the original"Perpetual licenses being done away with""Release Notes - Invision Power Services""Introducing: IPS Community Suite 4!"Invision Community Release Notes

            François Viète Contents Biography Work and thought Bibliography See also Notes Further reading External links Navigation menup. 21Google Bookspp. 75–77Google BooksDe thou (from University of Saint Andrews)ArchivedGoogle BooksGoogle BooksGoogle BooksGoogle booksGoogle Bookscc-parthenay.frL'histoire universelle (fr)Universal History (en)ArchivedAdsabs.harvard.eduPagesperso-orange.frArchive.orgChikara Sasaki. Descartes' mathematical thought p.259Google BooksGoogle BooksGoogle Bookspp. 152 and onwardGoogle BooksGoogle BooksScribd.comGoogle Books1257-7979Google BooksGoogle BooksGoogle BooksGoogle BooksGoogle BooksGoogle BooksGallica.bnf.frGoogle BooksGoogle Books"François Viète"Francois Viète: Father of Modern Algebraic NotationThe Lawyer and the GamblerAbout TarporleySite de Jean-Paul GuichardL'algèbre nouvelle"About the Harmonicon"cb120511976(data)1188044800000 0001 0913 5903n82164680ola2013766880073431702w6vt1sb70287374827140948071409480