Monte Carlo
Chapter 4 of the The New Turing Omnibus is about the Monte Carlo method. I spent a fair bit of time learning about Monte Carlo methods for my PhD and have had to explain it to a few people as a result. When I read this chapter, I thought back to my favorite example, and thought I would reproduce it here.
One way to think about Monte Carlo is that it is a way to use simulations to approximate mathematical functions. They are a way to use random numbers to determine information about one or more functions that might be hard to calculate some other way.
So I like to start telling someone about Monte Carlo with a really simple example. George Fishman discusses this example in depth in his book A First Course in Monte Carlo, including a way to optimize the calculation. But I am going to stick to the super simple version.
Keep in mind that this is a simple, non-exact explanation of Monte Carlo.
Suppose that you want to calculate the value \( \pi \). We can use the function for the area of a circle which is
$$areaOfCircle(radius) = \pi \times radius^2$$
or if we move things around
$$ \pi = \frac{areaOfCircle(radius)}{radius^2} $$
So if we had a circle and could figure out its radius and area we could calculate \( \pi \). But how can we know the area of a circle. This is where the idea behind Monte Carlo comes in. Consider a square with a circle drawn inside it. The area of the square will be \( side ^ 2 \). The radius of the circle will be \( \frac{side}{2} \). So if we have the side of the square, we also know the area of the square and the radius of the circle. We don't know the area of the circle, but we can use a simulation to calculate it.
Start picking random numbers that are between 0 and the length of a side of the square. Take two of these numbers and treat them as a point in the square. Now, determine if the point is in the circle. We can do this by using the formula of a circle:
$$ x^2 + y^2 = radius^2 $$
So if we take the square of each point, add them and check if they are less than the square of the radius, the point is in the circle.
If we think about this as a probability, then the chance of a random point in the square being in the circle is the same as the ratio of the area of a square to the area of a circle. This means that:
$$ \frac{points\ we\ pick}{points\ that\ hit\ the\ circle} = \frac{area\ of\ the\ square}{area\ of\ the\ circle} $$
So our simulation is to pick random points and figure out if they are in the circle, then use the number of random points and whether they hit the circle to determine the area of the circle. Putting this idea together with the forumula for the area of a circle, we can now calculate \( \pi \) by taking:
$$ \pi = \frac{points\ that\ hit\ the\ circle}{points\ that\ we\ pick} \times \frac{side ^ 2}{ (\frac{side}{2}) ^ 2 } $$
First, remember that \( \pi \) starts out as \( 3.14159 \).
Let's try it with 10, 100, 1000, 10,000 and 100,000 guesses. These are calculated in real time when you load the page, so if you refresh the page you will get different values. You can view source on the page to see the actual code if you are interested.
guess_pi(10);
guess_pi(100);
guess_pi(1000);
guess_pi(10000);
guess_pi(100000);
What you should see is that as you try more guesses you get closer the the right value of \( \pi \). But you won't necessarily see the numbers increasing monotonically. This is one of the important lessions in learning about Monte Carlo. You usually have to run a lot of simulations, which can take a long time. How many you have to run to get a required accuracy is generally calculatable, but that is beyond the scope of this post.
I do want to mention one of the real gotcha's with Monte Carlo. In my PhD we used Monte Carlo simulations to model neutrons moving through different materials. Neutrons have a chance to scatter off of each atom in the material, but the chance of scattering and potential angle is dependent on the material. If you have an alloy or compound, with several materials, you have to model which element you hit on top of how the neutron reacts. Now, suppose that you had a material that was 99.9% pure. So approximately 1 in a 1000 collisions will be with an impurity. If you want to do an accurate simulation, you need to include enough impurity collisions to get a reasonable answer. But since you dont' get these often, you have to run many many more simulations than you would for a pure material. I am simplifying here, but I wanted to be clear that the number of simulations can be complex, and there is opportunity for optimizations.
So that is Monte Carlo. Using simulations to model a function that may not even be solvable, but can at least be evaluated in some way.