# Benford’s Law

Take a collection of seemingly random numbers, for instance the gross domestic product of 212 countries, and then examine the leading digit. For instance, for the number \$435 million (the 2015 GDP of Tonga) the leading digit is 4. The leading digits will of course be from 1-9, since 0 cannot be a leading digit. How would you expect the digits 1-9 to be distributed? Randomly, with each digit appearing as a leading digit approximately 11 percent of the time? That is not the case. The distribution for many data sets follows Benford’s Law.

Above is the actual distribution of leading digits for the GDP of 212 countries, showing that it approximates what Benford’s Law predicts: 1 will appear as a leading digit 30.1% of the time, 2 will appear 17.6 % of the time, 3 is 12.5%, and the frequency continues to decrease. Here is the GDP data in table form:

17233.96%30.1%
23215.09%17.6%
32913.68%12.5%
42210.38%9.7%
5209.43%7.9%
6104.72%6.7%
7125.66%5.8%
8125.66%5.1%
931.42%4.6%

The result is not a quirk of the units used. The above numbers were in U.S. dollars, but you will get a similar result if the numbers are converted into rubles or yen. Similarly, the lengths of the rivers of the world follow Benford’s Law, whether they are measured in kilometers, miles, or feet. The law is scale invariant, so multiplying the numbers in a data set by any constant yields the same distribution of first digit frequencies.