Jump to content

Monty Hall problem

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Everyking (talk | contribs) at 05:52, 15 July 2004 (Anecdotes). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

File:Monty-hall.png
The Monty Hall problem

The Monty Hall problem is a puzzle in probability that is loosely based on the American game show Let's Make a Deal; the name comes from the show's host Monty Hall. In this puzzle a contestant is shown three closed doors; behind one is a car, and behind each of the others is a goat. The contestant chooses one door and will be allowed to keep what is behind it. Before the door is opened, however, the host opens one of the other doors and shows that there is a goat behind it. Should the contestant stick with the original choice or change to the remaining door; or does it make no difference?

The question has generated heated debate. As the standard answer appears to contradict elementary ideas of probability, it may be regarded as a paradox. As the answer relies on assumptions that are not in the statement of the puzzle and are not obvious, it may also be considered a trick question.

Problem and solution

The problem

The solution to the problem depends on what assumptions are made. Here is a famous statement of the problem, from a letter from Craig F. Whitaker to Marilyn vos Savant's column in Parade Magazine in 1990:

Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, "Do you want to pick door No. 2?" Is it to your advantage to switch your choice? [cited from Bohl, Liberatore & Nydick]

The same puzzle appeared as the "three prisoners problem" in Martin Gardner's column in 1959. (The "Monty Hall" connection also may predate 1990.) Gardner's version is superior as a mathematical puzzle in that it makes the procedure explicit, avoiding the unstated assumptions in the version given here.

The standard solution

The answer to this problem is yes: the chance of winning the prize is doubled when the player switches to another door rather than sticking with the original choice. This is because upon the original choice, the player has only a 1/3 chance of choosing the door with the prize; there is a probability of 2/3 that it is behind one of the other doors. These probabilities do not change when Monty opens a door with a goat, but now it is clear which door has the car if the player has chosen a goat. Hence the chances of winning the prize are 1/3 if the player sticks to the original choice, and 2/3 if the player switches.

Assumptions

The classical answer presented above makes two assumptions that are rarely made explicit:

  • Monty always opens a door
  • There is always a goat behind the door Monty opens

If one of these assumptions is violated, the answer is different. In the first case, it could be that Monty does not always open a losing door. Maybe in some shows he does, while on other occasions he does not, simply giving the contestant whatever is behind his first choice door. (This in fact is more like the original procedure on the game show.) If that is the case, it all depends on Monty's character:

  • If Monty is a helpful person and wants you to win, then you should always switch, because the fact that Monty offered you a second chance means that your first choice was a goat.
  • If Monty is mean or received pressure from his TV station, you should always stick. The fact that Monty offered you a second chance only means that he wants to lure you away from your correct first choice. On the other hand, if Monty is truly devious, he could on very rare occasions open a door even if the contestant's initial choice was incorrect, just to fool future contestants.

In the second case, Monty may open an unpicked door at random, rather than always opening a door with a goat behind it. In this case, if Monty happens to open a door with a goat behind it, then both switcher and sticker have a 50% chance of winning, so it doesn't matter what you do. This is because a correct initial guess means that Monty is certain to open a door with a goat behind it, whereas if the initial guess was incorrect there is only a 50% chance that Monty will open a door with a goat behind it.

There are further assumptions implicit in the puzzle; for instance, that a goat is worth less than the car. One makes this sort of assumption automatically, but of course it could be false.

More significantly, it is assumed that when the contestant chooses the door that conceals the car, Monty will select one of the other two at random. If, instead, he always chooses door 3 in such a case, then his choosing it gives no information, and the contestant's chance is even-money with either door 1 or door 2. However if he chooses door 2 then changing is a guaranteed winning strategy. In fact regardless of Monty's (secret) method of choosing which door to open if you choose the car, switching will not make you worse off, and will on average (for a given method) gain you 1/3 of the difference between a car and a goat.

Aids to understanding

  • It may be easier for the reader to appreciate the result by considering a hundred doors instead of just three. In this case there are 99 doors with goats behind them and 1 door with a prize. The contestant picks a door; 99 out of 100 times the contestant will pick a door with a goat. Monty then opens 98 of the other doors revealing 98 goats and offers the contestant the chance to switch to the other unopened door. On 99 out of 100 occasions the door the contestant can switch to will contain the prize as 99 out of 100 times the contestant first picked a door with a goat. At this point a rational contestant should always switch.
  • Another way of phrasing why the player should switch: By switching, the player is ensuring that he will win if he originally picked a goat. The probability of picking a goat was 2/3, so the player should switch.
  • Instead of one door being opened and thus eliminated from the game, it may equivalently be regarded as combining two doors into one, as a door containing a goat is essentially the same as a door with nothing behind it. In essence, this means the player has the choice of either sticking with their original choice of door, or choosing the sum of the contents of the two other doors. Clearly, the chances of the prize being in the other two doors is twice as high. Notice how the above assumptions play a role here: The reason switching is equivalent to taking the combined contents is that Monty Hall is required to open a door with a goat.
  • It may also help to think in terms of why Monty chooses a particular door. Before Monty opens any doors, the contestant knows that each door is equally likely to have the prize behind it, therefore in about one third of the games the contestant will be lucky and choose the door with the prize. However Monty's choice of door adds information which the contestant can make use of. Monty will not choose the contestant's door, but which of the other two he chooses depends upon what is behind the contestant's door.
In about one third of the games, the contestant's chosen door will be the prize door and Monty will choose one of the other two doors at random. A sticking contestant will win and a switching contestant will lose when this happens, no matter which door Monty picks, since the unchosen door must be a losing door.
In about two thirds of the games, the contestant's door will be a losing door and Monty will have to choose the other losing door. Therefore the unchosen door is the prize door in these cases. A sticking contestant will lose and a switching contestant will win when this happens.
A contestant may now choose to stick or to switch, knowing that the unchosen door is a loser in one third of the games and a winner in two thirds of the games. Knowing this, a rational contestant will always switch.
  • For the least reliance on verbiage and the most on formal mathematics, an approach using Bayes' theorem may be best. It also makes explicit the effect of the assumptions given earlier. Consider the position when door 1 has been chosen and no door has been opened. The probability that the car is behind door 2, p(C2), is plainly 1/3, as it may equally well be in any of the three places. The probability that Monty will open door 3, p(O3), is 1/2; if there can be any doubt, enumeration of cases will confirm this. But when the car is behind door 2, Monty will certainly open door 3, by the assumptions; that is, p(O3|C2) = 1. Hence the probability that the car is behind door 2 given that Monty opens door 3 is
    p(C2|O3) = p(O3|C2) * p(C2) / p(O3)
             = 1        * 1/3   / (1/2)
             = 2/3


See Empirical proof of the Monty Hall problem for a Perl program which demonstrates the result.

  • Imagine a scenario in which Contestant A chooses a door. Monty then opens a goat door. And Contestant B opens the remaining door. Since the first contestant will choose the car door only 1 in 3 times, the second contestant will win the car 2 out of 3 times. Thus, the car is behind the remaining door 2 out of 3 times.

Variants

In the original game show, there were in fact two contestants. Both of them chose a door; they were not allowed to choose the same one. Monty then eliminated a player with a goat behind their door (if both players had a goat, one was eliminated randomly, without letting the players know about it), opened the door and then offered the remaining player a chance to switch. Should the remaining player switch?

The answer is no. The reason: a switcher in this game will lose if and only if either of two initial choices of the two contestants was correct. How likely is that? Two-thirds. A sticker will win in those 2/3 of the cases. So stickers will win twice as often as switchers.

Altenatively by enumerating possibilities (again you play 1 and the other player plays 2)

         Ejected Player  Probability    Switch Strategy     Stick Strategy
 1 2 3    
 C G G   2               1/3            Lose                Win    
 G C G   1               1/3            Lose                Lose   
 G G C   1               1/6            Win                 Lose
         2               1/6            Lose                Lose

Player 1 wins 1/3 of the time with the stick strategy, or 1/6 of the time with the switching strategy. Half the time he is eliminated. Given that he is not eliminated there is 2/3 probability of winning with the sticking strategy.

There is a generalization of the original problem to n doors: in the first step, you choose a door. Monty then opens some other door that's a loser. If you want, you may then switch your allegiance to another door. Monty will then open an as yet unopened losing door, different from your current preference. Then you may switch again, and so on. This continues until there are only two unopened doors left: your current choice and another one. How many times should you switch, and when, if at all?

The answer is: stick all the way through with your first choice but then switch at the very end. This was proved by Bapeswara Rao and Rao. Essentially the plan is to extract the maximum amount of information. Your odds of winning will be (n-1)/n.

An unrelated paradox involving infinite sequences of actions is sometimes called the Monty Hell problem.

Origins

The game used in the Monty Hall problem is similar to three card monte, a gambling game in which the player has to find a single winning card among three face-down cards. As in the Monty Hall problem, the dealer knows where the winning card is, but here the dealer always tries to trick the player into picking the wrong card. As the card is often a Queen court card, it is also known as Find the Lady.

An older puzzle in probability theory involves three prisoners, one of whom (already chosen at random but unknown to the prisoners) is to be executed in the morning. The first prisoner begs the guard to tell him which of the other two will go free, arguing that this reveals no information about whether he will be the victim; the guard responds by claiming that if the prisoner knows that a specific one of the other two prisoners will go free it will raise the first prisoner's subjective chance of being executed from 1/3 to 1/2. The question is whether the analysis of the prisoner or the guard is correct. In the version given by Martin Gardner, the guard then performs a particular randomizing procedure for selecting which name to give the prisoner; this gives the equivalent of the Monty Hall problem without the usual ambiguities in its presentation.

Anecdotes

After this problem's solution was discussed in Marilyn vos Savant's "Ask Marilyn" question-and-answer column of Parade magazine in 1990, many readers including several math professors wrote in to declare that her solution was wrong. An equally contentious discussion of Marilyn's discussion took place in Cecil Adams's column The Straight Dope.

The Monty Hall problem is elegantly discussed, from the perspective of a boy with Asperger's syndrome, in The Curious Incident of the Dog in the Night-Time, a 2003 novel by Mark Haddon.

References

  • Bapeswara Rao, V. V. and Rao, M. Bhaskara (1992). "A three-door game show and some of its variants". The Mathematical Scientist 17, no. 2, pp. 89–94
  • Bohl, Alan H.; Liberatore, Matthew J.; and Nudick, Robert L. (1995). "A Tale of Two Goats ... and a Car, or The Importance of Assumptions in Problem Solutions". Journal of Recreational Mathematics 1995, pp. 1–9.
  • Gardner, Martin (1959). "Mathematical Games" column, Scientific American, October 1959, pp. 180–182.
  • Tierney, John (1991). "Behind Monty Hall's Doors: Puzzle, Debate and Answer?", The New York Times July 21, 1991, Sunday, Section 1; Part 1; Page 1; Column 5
  • vos Savant, Marilyn (1990). "Ask Marilyn" column, Parade Magazine p. 12 (Feb. 17, 1990). [cited in Bohl et al., 1995]