Optimal set order to maximize stochastic rewardStochastic programming MIP solversHow to determine the correct level of detail when modelling?Bin Packing with Relational PenalizationFinding an optimal set without forbidden subsetsRunning a linear programming model to maximize binned predictionsHow to modify EMSR when capacity for each fare class is differentStatic stochastic knapsack problem: unbounded versionVariable fixing based on a good feasible solution

Front hydraulic disk brake is too powerful on MTB — solutions?

Why is the empennage design of this Lockheed 1049E so complicated?

"A tin of biscuits" vs "A biscuit tin"

A Problem of Succession

XDSL Internet Connection Over Optical Fibre Telephone Line?

How can I learn to write better questions to test for conceptual understanding?

Is Schrodinger's Cat itself an observer?

Why do adjectives come before nouns in English?

Dice? = Is that so?

Which culture used no personal names?

Is it now possible to undetectably cross the Arctic Ocean on ski/kayak?

Who can change WIP limit in DoW when urgent work emerges?

Is there a push, in the United States, to use gender-neutral language and gender pronouns (when they are given)?

Optimal set order to maximize stochastic reward

Should I avoid "big words" when writing to a younger audience?

What‘s wrong with my proof of the Law of Total Variance?

Are Snap and Flatpack apps safe to install? Are they "official", approved or tested for a particular distro version?

Neither Raman nor IR Active vibrational modes

Does my protagonist need to be the most important character?

How to protect my Wi-Fi password from being displayed by Android phones when sharing it with QR code?

Incorrect mmap behavior when assembly files included in the project

Remove last letter 4 times, get a real word each time, starting word is a car model

Usefulness of Nash embedding theorem

Is anyone against the rational teachings of Maimonides?



Optimal set order to maximize stochastic reward


Stochastic programming MIP solversHow to determine the correct level of detail when modelling?Bin Packing with Relational PenalizationFinding an optimal set without forbidden subsetsRunning a linear programming model to maximize binned predictionsHow to modify EMSR when capacity for each fare class is differentStatic stochastic knapsack problem: unbounded versionVariable fixing based on a good feasible solution













5














$begingroup$


You have a ticket allowing you to visit up to $n$ of $M$ carnival booths offering games of chance. At each booth you have probability $p_i$ of winning a reward with average value $r_i$. Each booth can be visited once at most. You can visit the booths in any order but you can only win once, and you have to accept the first reward that you win. The objective is to select the subset of booths to visit, and the order in which to visit them, to maximize the expected reward.



It's easy enough to solve the problem using brute force. The expected value of the reward can be calculated for each permutation of $n$ of $M$ booths.



What would be a more computationally feasible way to formulate the problem?










share|improve this question









New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$















  • $begingroup$
    Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
    $endgroup$
    – Oguz Toragay
    9 hours ago






  • 1




    $begingroup$
    @OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
    $endgroup$
    – sedge
    9 hours ago















5














$begingroup$


You have a ticket allowing you to visit up to $n$ of $M$ carnival booths offering games of chance. At each booth you have probability $p_i$ of winning a reward with average value $r_i$. Each booth can be visited once at most. You can visit the booths in any order but you can only win once, and you have to accept the first reward that you win. The objective is to select the subset of booths to visit, and the order in which to visit them, to maximize the expected reward.



It's easy enough to solve the problem using brute force. The expected value of the reward can be calculated for each permutation of $n$ of $M$ booths.



What would be a more computationally feasible way to formulate the problem?










share|improve this question









New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$















  • $begingroup$
    Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
    $endgroup$
    – Oguz Toragay
    9 hours ago






  • 1




    $begingroup$
    @OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
    $endgroup$
    – sedge
    9 hours ago













5












5








5





$begingroup$


You have a ticket allowing you to visit up to $n$ of $M$ carnival booths offering games of chance. At each booth you have probability $p_i$ of winning a reward with average value $r_i$. Each booth can be visited once at most. You can visit the booths in any order but you can only win once, and you have to accept the first reward that you win. The objective is to select the subset of booths to visit, and the order in which to visit them, to maximize the expected reward.



It's easy enough to solve the problem using brute force. The expected value of the reward can be calculated for each permutation of $n$ of $M$ booths.



What would be a more computationally feasible way to formulate the problem?










share|improve this question









New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$




You have a ticket allowing you to visit up to $n$ of $M$ carnival booths offering games of chance. At each booth you have probability $p_i$ of winning a reward with average value $r_i$. Each booth can be visited once at most. You can visit the booths in any order but you can only win once, and you have to accept the first reward that you win. The objective is to select the subset of booths to visit, and the order in which to visit them, to maximize the expected reward.



It's easy enough to solve the problem using brute force. The expected value of the reward can be calculated for each permutation of $n$ of $M$ booths.



What would be a more computationally feasible way to formulate the problem?







modeling combinatorial-optimization stochastic-programming dynamic-programming






share|improve this question









New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.










share|improve this question









New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








share|improve this question




share|improve this question



share|improve this question








edited 8 hours ago









TheSimpliFire

2,6547 silver badges39 bronze badges




2,6547 silver badges39 bronze badges






New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








asked 9 hours ago









sedgesedge

262 bronze badges




262 bronze badges




New contributor



sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




New contributor




sedge is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • $begingroup$
    Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
    $endgroup$
    – Oguz Toragay
    9 hours ago






  • 1




    $begingroup$
    @OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
    $endgroup$
    – sedge
    9 hours ago
















  • $begingroup$
    Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
    $endgroup$
    – Oguz Toragay
    9 hours ago






  • 1




    $begingroup$
    @OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
    $endgroup$
    – sedge
    9 hours ago















$begingroup$
Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
$endgroup$
– Oguz Toragay
9 hours ago




$begingroup$
Hi sedge and welcome to OR.SE, I couldn't understand this part: "You can visit the booths in any order but once you win a reward you are finished playing." Can you explain a little more?
$endgroup$
– Oguz Toragay
9 hours ago




1




1




$begingroup$
@OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
$endgroup$
– sedge
9 hours ago




$begingroup$
@OguzToragay You can only win once, and you have to accept the first reward that you win. Edited the description to clarify.
$endgroup$
– sedge
9 hours ago










2 Answers
2






active

oldest

votes


















2
















$begingroup$

To get $O(2^M M^2)$ instead of $O(M!)$, you could modify the dynamic programming formulation of the traveling salesman problem, with a state for each subset of booths visited so far.






share|improve this answer












$endgroup$






















    2
















    $begingroup$

    As it is explained here, this problem is a portfolio selection problem. The player should select the first $n$ booths with the maximum $E(g_i)=p_i times r_i$ in which $E(g_i)$ represents the expected value of gain, then among the selected $n$ booths, the player should start playing the games from the one with maximum $r_i$. In other words, the selected booths should be sorted based on their $r_i$ value (from max to min value).



    Edit(example):



    Consider $M=3, n=2, r_1=100, p_1=0.6, r_2=50, p_2=0.95, r_3=1000, p_3=0.1$ with these data, we have:



    $E(g_1)=60 , E(g_2)=47.5 textand E(g_3)=100$



    the sorted list of choices for the player is:



    $$list=(3,1)$$



    Also consider that in this type of problem, the solution is related to the amount of risk that the player willings to take. This relation can be explained as the tradeoff between the risk value and expected total gain.






    share|improve this answer












    $endgroup$














    • $begingroup$
      Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
      $endgroup$
      – sedge
      7 hours ago










    • $begingroup$
      @sedge I edited my answer and included a small example. Please check it again.
      $endgroup$
      – Oguz Toragay
      7 hours ago






    • 1




      $begingroup$
      Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
      $endgroup$
      – sedge
      7 hours ago













    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "700"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );







    sedge is a new contributor. Be nice, and check out our Code of Conduct.









    draft saved

    draft discarded
















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2for.stackexchange.com%2fquestions%2f2752%2foptimal-set-order-to-maximize-stochastic-reward%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown


























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    2
















    $begingroup$

    To get $O(2^M M^2)$ instead of $O(M!)$, you could modify the dynamic programming formulation of the traveling salesman problem, with a state for each subset of booths visited so far.






    share|improve this answer












    $endgroup$



















      2
















      $begingroup$

      To get $O(2^M M^2)$ instead of $O(M!)$, you could modify the dynamic programming formulation of the traveling salesman problem, with a state for each subset of booths visited so far.






      share|improve this answer












      $endgroup$

















        2














        2










        2







        $begingroup$

        To get $O(2^M M^2)$ instead of $O(M!)$, you could modify the dynamic programming formulation of the traveling salesman problem, with a state for each subset of booths visited so far.






        share|improve this answer












        $endgroup$



        To get $O(2^M M^2)$ instead of $O(M!)$, you could modify the dynamic programming formulation of the traveling salesman problem, with a state for each subset of booths visited so far.







        share|improve this answer















        share|improve this answer




        share|improve this answer



        share|improve this answer








        edited 8 hours ago

























        answered 8 hours ago









        Rob PrattRob Pratt

        1,4671 silver badge12 bronze badges




        1,4671 silver badge12 bronze badges
























            2
















            $begingroup$

            As it is explained here, this problem is a portfolio selection problem. The player should select the first $n$ booths with the maximum $E(g_i)=p_i times r_i$ in which $E(g_i)$ represents the expected value of gain, then among the selected $n$ booths, the player should start playing the games from the one with maximum $r_i$. In other words, the selected booths should be sorted based on their $r_i$ value (from max to min value).



            Edit(example):



            Consider $M=3, n=2, r_1=100, p_1=0.6, r_2=50, p_2=0.95, r_3=1000, p_3=0.1$ with these data, we have:



            $E(g_1)=60 , E(g_2)=47.5 textand E(g_3)=100$



            the sorted list of choices for the player is:



            $$list=(3,1)$$



            Also consider that in this type of problem, the solution is related to the amount of risk that the player willings to take. This relation can be explained as the tradeoff between the risk value and expected total gain.






            share|improve this answer












            $endgroup$














            • $begingroup$
              Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
              $endgroup$
              – sedge
              7 hours ago










            • $begingroup$
              @sedge I edited my answer and included a small example. Please check it again.
              $endgroup$
              – Oguz Toragay
              7 hours ago






            • 1




              $begingroup$
              Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
              $endgroup$
              – sedge
              7 hours ago
















            2
















            $begingroup$

            As it is explained here, this problem is a portfolio selection problem. The player should select the first $n$ booths with the maximum $E(g_i)=p_i times r_i$ in which $E(g_i)$ represents the expected value of gain, then among the selected $n$ booths, the player should start playing the games from the one with maximum $r_i$. In other words, the selected booths should be sorted based on their $r_i$ value (from max to min value).



            Edit(example):



            Consider $M=3, n=2, r_1=100, p_1=0.6, r_2=50, p_2=0.95, r_3=1000, p_3=0.1$ with these data, we have:



            $E(g_1)=60 , E(g_2)=47.5 textand E(g_3)=100$



            the sorted list of choices for the player is:



            $$list=(3,1)$$



            Also consider that in this type of problem, the solution is related to the amount of risk that the player willings to take. This relation can be explained as the tradeoff between the risk value and expected total gain.






            share|improve this answer












            $endgroup$














            • $begingroup$
              Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
              $endgroup$
              – sedge
              7 hours ago










            • $begingroup$
              @sedge I edited my answer and included a small example. Please check it again.
              $endgroup$
              – Oguz Toragay
              7 hours ago






            • 1




              $begingroup$
              Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
              $endgroup$
              – sedge
              7 hours ago














            2














            2










            2







            $begingroup$

            As it is explained here, this problem is a portfolio selection problem. The player should select the first $n$ booths with the maximum $E(g_i)=p_i times r_i$ in which $E(g_i)$ represents the expected value of gain, then among the selected $n$ booths, the player should start playing the games from the one with maximum $r_i$. In other words, the selected booths should be sorted based on their $r_i$ value (from max to min value).



            Edit(example):



            Consider $M=3, n=2, r_1=100, p_1=0.6, r_2=50, p_2=0.95, r_3=1000, p_3=0.1$ with these data, we have:



            $E(g_1)=60 , E(g_2)=47.5 textand E(g_3)=100$



            the sorted list of choices for the player is:



            $$list=(3,1)$$



            Also consider that in this type of problem, the solution is related to the amount of risk that the player willings to take. This relation can be explained as the tradeoff between the risk value and expected total gain.






            share|improve this answer












            $endgroup$



            As it is explained here, this problem is a portfolio selection problem. The player should select the first $n$ booths with the maximum $E(g_i)=p_i times r_i$ in which $E(g_i)$ represents the expected value of gain, then among the selected $n$ booths, the player should start playing the games from the one with maximum $r_i$. In other words, the selected booths should be sorted based on their $r_i$ value (from max to min value).



            Edit(example):



            Consider $M=3, n=2, r_1=100, p_1=0.6, r_2=50, p_2=0.95, r_3=1000, p_3=0.1$ with these data, we have:



            $E(g_1)=60 , E(g_2)=47.5 textand E(g_3)=100$



            the sorted list of choices for the player is:



            $$list=(3,1)$$



            Also consider that in this type of problem, the solution is related to the amount of risk that the player willings to take. This relation can be explained as the tradeoff between the risk value and expected total gain.







            share|improve this answer















            share|improve this answer




            share|improve this answer



            share|improve this answer








            edited 7 hours ago

























            answered 8 hours ago









            Oguz ToragayOguz Toragay

            4,1271 gold badge3 silver badges30 bronze badges




            4,1271 gold badge3 silver badges30 bronze badges














            • $begingroup$
              Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
              $endgroup$
              – sedge
              7 hours ago










            • $begingroup$
              @sedge I edited my answer and included a small example. Please check it again.
              $endgroup$
              – Oguz Toragay
              7 hours ago






            • 1




              $begingroup$
              Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
              $endgroup$
              – sedge
              7 hours ago

















            • $begingroup$
              Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
              $endgroup$
              – sedge
              7 hours ago










            • $begingroup$
              @sedge I edited my answer and included a small example. Please check it again.
              $endgroup$
              – Oguz Toragay
              7 hours ago






            • 1




              $begingroup$
              Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
              $endgroup$
              – sedge
              7 hours ago
















            $begingroup$
            Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
            $endgroup$
            – sedge
            7 hours ago




            $begingroup$
            Can you describe which algorithm or technique you would suggest for selecting the first n booths with the maximum $E(g_i)$? There are many listed in the linked presentation. It would be appreciated if you could narrow it down to a good starting point.
            $endgroup$
            – sedge
            7 hours ago












            $begingroup$
            @sedge I edited my answer and included a small example. Please check it again.
            $endgroup$
            – Oguz Toragay
            7 hours ago




            $begingroup$
            @sedge I edited my answer and included a small example. Please check it again.
            $endgroup$
            – Oguz Toragay
            7 hours ago




            1




            1




            $begingroup$
            Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
            $endgroup$
            – sedge
            7 hours ago





            $begingroup$
            Based on the example I see what you mean now. (Although the variables are switched compared to the OP: p is now the reward and r is the probability). So, would the number of computations needed be only M, or $p_i$ x $r_i$ for each i in M? If so that is very simple indeed.
            $endgroup$
            – sedge
            7 hours ago












            sedge is a new contributor. Be nice, and check out our Code of Conduct.









            draft saved

            draft discarded

















            sedge is a new contributor. Be nice, and check out our Code of Conduct.












            sedge is a new contributor. Be nice, and check out our Code of Conduct.











            sedge is a new contributor. Be nice, and check out our Code of Conduct.














            Thanks for contributing an answer to Operations Research Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2for.stackexchange.com%2fquestions%2f2752%2foptimal-set-order-to-maximize-stochastic-reward%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown









            Popular posts from this blog

            Invision Community Contents History See also References External links Navigation menuProprietaryinvisioncommunity.comIPS Community ForumsIPS Community Forumsthis blog entry"License Changes, IP.Board 3.4, and the Future""Interview -- Matt Mecham of Ibforums""CEO Invision Power Board, Matt Mecham Is a Liar, Thief!"IPB License Explanation 1.3, 1.3.1, 2.0, and 2.1ArchivedSecurity Fixes, Updates And Enhancements For IPB 1.3.1Archived"New Demo Accounts - Invision Power Services"the original"New Default Skin"the original"Invision Power Board 3.0.0 and Applications Released"the original"Archived copy"the original"Perpetual licenses being done away with""Release Notes - Invision Power Services""Introducing: IPS Community Suite 4!"Invision Community Release Notes

            Canceling a color specificationRandomly assigning color to Graphics3D objects?Default color for Filling in Mathematica 9Coloring specific elements of sets with a prime modified order in an array plotHow to pick a color differing significantly from the colors already in a given color list?Detection of the text colorColor numbers based on their valueCan color schemes for use with ColorData include opacity specification?My dynamic color schemes

            Ласкавець круглолистий Зміст Опис | Поширення | Галерея | Примітки | Посилання | Навігаційне меню58171138361-22960890446Bupleurum rotundifoliumEuro+Med PlantbasePlants of the World Online — Kew ScienceGermplasm Resources Information Network (GRIN)Ласкавецькн. VI : Літери Ком — Левиправивши або дописавши її