The @RISK and RISKOptimizer manuals state, “We recommend using Latin Hypercube, the default sampling type setting, unless your modeling situation specifically calls for Monte Carlo sampling.” But what’s the actual difference?
Monte Carlo sampling refers to the traditional technique for using random or pseudo-random numbers to sample from a probability distribution. Monte Carlo sampling techniques are entirely random in principle — that is, any given sample value may fall anywhere within the range of the input distribution. With enough iterations, Monte Carlo sampling recreates the input distributions through sampling. A problem of clustering, however, arises when a small number of iterations are performed.
Each simulation in @RISK or RISKOptimizer represents a random sample from each input distribution. The question naturally arises, how much separation between the sample mean and the distribution mean do we expect? Or, to look at it another way, how likely are we to get a sample mean that’s a given distance away from the distribution mean?
The Central Limit Theorem of statistics (CLT) answers this question with the concept of the standard error of the mean (SEM). One SEM is the standard deviation of the input distribution, divided by the square root of the number of iterations per simulation. For example, with RiskNormal(655,20) the standard deviation is 20. If you have 100 iterations, the standard error is 20/√100 = 2. The CLT tells us that about 68% of sample means should occur within one standard error above or below the distribution mean, and 95% should occur within two standard errors above or below. In practice, sampling with the Monte Carlo sampling method follows this pattern quite closely.
By contrast, Latin Hypercube sampling stratifies the input probability distributions. With this sampling type, @RISK or RISKOptimizer divides the cumulative curve into equal intervals on the cumulative probability scale, then takes a random value from each interval of the input distribution. We no longer have pure random samples and the CLT no longer applies. Instead, we have stratified random samples. The effect is that each sample (the data of each simulation) is constrained to match the input distribution very closely.
Therefore, for even modest sample sizes, the Latin Hypercube method makes all or nearly all of the sample means fall within a small fraction of the standard error. This is usually desirable, particularly in @RISK when you are performing just one simulation. And when you’re performing multiple simulations, their means will be much closer together with Latin Hypercube than with Monte Carlo; this is how the Latin Hypercube method makes simulations converge faster than Monte Carlo..
The attached Excel workbook lets you explore the difference in the distribution of sample means between the Monte Carlo and Latin Hypercube sampling methods. Select your sample size and number of simulations and click “Run Comparison”. If you wish, you can change the mean and standard deviation of the input distribution, or even select a completely different distribution to explore. Under every combination we’ve tested, the sample means are much closer together with the Latin Hypercube sampling method than with the Monte Carlo method.
Extra material: here