Benford's Law Calculator

Enter a list of numbers into the data input field and this Benford's Law Calculator analyzes the leading digit distribution of your dataset. You'll see each digit's observed frequency compared against the expected Benford's Law probability, plus a Chi-Square statistic and p-value to help you determine whether your data is consistent with natural distributions — or potentially manipulated.

Paste or type your dataset here. Numbers can be separated by commas, spaces, or new lines. At least 50–100 values recommended for meaningful results.

Results

Chi-Square Statistic

--

P-Value

--

Verdict

--

Total Numbers Analyzed

--

D Statistic (Max Deviation)

--

Observed vs Expected Leading Digit Frequency (%)

Results Table

Frequently Asked Questions

What is Benford's Law?

Benford's Law (also called the First-Digit Law or the Newcomb–Benford Law) states that in many naturally occurring datasets, the leading digit is not uniformly distributed. Instead, the digit 1 appears as the first digit about 30.1% of the time, digit 2 about 17.6%, and so on down to digit 9 at just 4.6%. It was formalized by physicist Frank Benford in 1938 after observing this pattern across diverse data sources.

How do I test a dataset with Benford's Law?

Extract the first significant digit from each number in your dataset and count how many times each digit (1–9) appears. Compare these observed frequencies against the expected Benford's Law probabilities using a statistical test like the Chi-Square test. A high Chi-Square value or low p-value (below 0.05) suggests the data may not follow Benford's Law naturally.

How can Benford's Law help detect fraud?

When people fabricate numbers — such as in financial statements or tax returns — they often unconsciously distribute digits more uniformly or favor certain digits. Since real-world data tends to follow Benford's Law, a dataset that deviates significantly from it can be a red flag for manipulation, prompting further investigation by auditors or forensic accountants.

What kind of data follows Benford's Law?

Benford's Law works best on datasets spanning several orders of magnitude with no artificial constraints. Good examples include financial transactions, population figures, stock prices, river lengths, physical constants, election vote counts, and scientific measurements. The larger and more organically generated the dataset, the better it tends to conform.

What kind of data does NOT follow Benford's Law?

Data that is artificially constrained or uniformly distributed will not follow Benford's Law. Examples include phone numbers, zip codes, ID numbers, lottery results, heights of people (which cluster in a narrow range), and any dataset where numbers are assigned sequentially or randomly rather than arising naturally.

How do I statistically test Benford's Law?

The most common method is the Chi-Square goodness-of-fit test, which compares observed digit frequencies to expected Benford's Law frequencies. A p-value above 0.05 generally suggests the data is consistent with Benford's Law, while a p-value below 0.05 indicates a statistically significant deviation. The D statistic (maximum absolute deviation) is another useful measure.

How many numbers do I need for a valid Benford's Law test?

For statistically reliable results, you generally need at least 50–100 numbers, though many experts recommend 500 or more. Small samples may show apparent deviations purely by chance. The Chi-Square test becomes more reliable as your dataset grows, so larger datasets give more trustworthy conclusions.

What does a high Chi-Square value mean?

A high Chi-Square statistic means there is a large discrepancy between your data's observed leading-digit frequencies and what Benford's Law predicts. Combined with a low p-value (typically below 0.05), this is evidence that your dataset does not conform to Benford's Law, which may warrant closer scrutiny — though it doesn't automatically prove fraud.

More Statistics Tools