Thinking Statistically: A Primer

Applied Statistics (Beginners)
Applied Statistics (Intermediate)
Applied Statistics (Advanced)
Thinking statistically doesn’t start with numbers or formulas, it starts with noticing.
Author

Conor O’Driscoll

Published

August 7, 2025

Thinking Statistically: A Way of Seeing the World

Most of us meet statistics in passing — a poll on the evening news, a headline about a medical breakthrough, a sports commentator flashing charts on the screen. We may not think much of it, but statistics quietly runs through almost every part of modern life: from predicting the weather, to setting housing policy, to planning bus timetables. Behind every one of those applications is the same motivation: to make better sense of the world around us, thus helping us to make better decisions in the face of uncertainty.

For social and spatial scientists, this work happens in two worlds at once. The visible world (e.g., house prices, commuting patterns, school applicatins) helps us to come up with our own ideas and theories about how the world works. Meanwhile, the world we infer (e.g., migration patterns, economic opportunity, discrimination, and efficiency) is the world we inhabit when we try to link our theories and ideas to reality.

Statistics sits at the intersection of these two worlds. Without statistics, our ideas and theories about the world around us are little more than speculations. With statistics, we can compare our perceptions of the world to a more grounded view of reality, allowing us to test and validate our own understanding. A poor match between our ideas and reality forces us to think again and reformulate our theories based on what we observe through statistics, while a good match might signal that we have a good intuitive understanding of what is going on.

Implicit in this is the notion that, in order to be meaningful, our ideas and theories of the world around us must yield plausible predictions; predictions which we can evaluate and assess using statistics. If our theories and ideas to do not tell us what we should expect to observe, or if the predictions are so general that any data will conform with our theories (e.g., astrology), then evidently our theories and ideas are no good.

Decision Making and Asking Better Questions

With this in mind, statistics is also a valuable tool we can use to steer our way through a complex world and make decisions about the “best” actions to take. We can use statistical methods to extract information from data we have collected to describe how the world is behaving and infer how we might wish to respond. Therefore, statistics plays a fundamental role in tying observations about the world around us to our ideas and understanding of that world, providing a disciplined way to make sense of a messy reality.

The kind of thinking involved in Statistics may seem quite different from what you are used to. But once we strip away the jargon and examine what it is we are really doing, you will quickly realize that the kind of thinking involved in statistics is not entirely new to you. In fact, many of your day-to-day assumptions and decisions already depend on statistical thinking.

When we summarize our past experience in a rough-and-ready way, or generalize from previous experiences, and use this information to make predictions about what we expect, we are subconsciously making broader sense of our lived experiences. That is why thinking statistically is not about chasing certainty, it is about navigating ambiguity with discipline. We use data to challenge our assumptions, to explore patterns we might not otherwise see, and to resist being misled by what seems obvious.

Note

Suppose I tell you that I have two friends: one is around six feet tall and the other is five feet tall. What would be your best guess as to each one’s sex?

Of course, you have not seen all men, or all women, but experience probably tells you that by-and-large, men tend to be taller than women.

Therefore, I expect that you feel fairly confident in assuming that my smaller friend is female, while my taller friend is male. In other words, in the absence of any other information, you probably think it is more likely that a tall adult is male and a small adult is female.

We do this all the time. When you say: “On average, I cycle 50km per week”, or “We can expect a lot of rain at this time of year”, or “It is more likely that you will do well in your exams if you begin studying early” you are making statistical statements even though you have performed no calculations. Indeed, each of these examples is showing off one particular type of statistical thinking; answer the questions below to figure out which.

What type of statistical thinking best corresponds to each of the following scenarios:

  1. On average, I cycle 50km per week.

  2. We can expect a lot of rain at this time of year.

  3. It is more likely that you will do well in your exams if you begin studying early.

Statistics is human-made

Because statistics involve numbers, people often assume they are hard facts handed down by nature. But numbers don’t just fall from the sky — people choose what to count, how to count it, and how to present it. That means statistics are not just facts, they are interpretations.

To think statistically is to think critically. It’s to ask questions like: Is that plausible? Compared to what? How do they know? The more you practice, the more these habits become second nature — a kind of internal, healthy skepticism paired with external curiosity.

This ability to think critically is become more important everyday. We live in an age where information is instant but trust is fragile. It is becoming increasingly difficult to tell what is true and what is not. Even though misinformation has been a problem that has existed since ancient Greece, the unique problem we face today is that it has proliferated and is closely intertwined on the internet with real information, making it sometimes difficult to identify.

Some of you who might respond by saying “it is not my job to evaluate statistics critically”. I say that it may not be your job, but it is your responsibility to own your opinions. Taking ownership of these opinions requires you to think critically about the words and numbers you encounter, and scrutinize/examine them as best you can.

Test your ability to critically evaluate statistical statements using the following examples, where TRUE = Plausible and False = Not Plausible:

  1. A crystal wine glass fell off a table onto a thick carpet without breaking.

  2. A crystal wine glass fell off the roof of a forty-storey skyscraper onto the footpath without breaking.

  3. The best salesperson in Company X makes 1,000 sales every day.

  4. The cost of a telephone has decreased by 12,000 percent since the formation of the Communication Satellite Corportation.

Problems 4 and 5 were hopefully easy for most of you.

Problem 6 is a bit trickier because it is not immediately obvious where to begin. But below I take a stab at it. Not to say that this is the one-and-only way to answer this question, but it is one illustration of how you can apply statistical thinking to assess the plausibility of a statistical statement.

Note

Assuming that it takes: i) five seconds to dial a phone number, ii) another five seconds for the phone to ring, iii) 10 seconds to deliver your sales pitch, and iv) 40 seconds to collect the buyers bank details and address. That adds up to one call per minute. Finally, lets assume that every phone call ends in a sale. Even under these highly optimistic conditions, you come out sixty sales per hour, and 480 sales in an eight hour work day with no breaks. Need I say any more?

Claim 7 is difficult to verify if you do not spot the immediate contradiction. If a cost decreases by 100 percent it means that, irrespective of the starting price, the price is now zero. If a cost decreases by 200 percent, it means that someone is paying you the same amount of money that you used to pay them for their product. Thus, while a decrease in price of 100 percent is quite rare, a decrease of 12000 percent seems wildly unlikely. You might think this is obvious, but tell that to the editors of Science (a premier scientific journal) who published such a statement.

Note

Claim 7 introduces another common error when interpreting statistics and that involves making sense of percentages.

Percentages seem so simple and incorruptable, but they are often confusing. For example, if your salary drops by 50% tomorrow, you will not “break-even” by increasing your new salary by 50 percent because the baselines have changed.

If you were recieving €1,000 each week and that suddenly dropped to €500 (a 50 percent reduction), a 50 percent increase on that pay only brings you to €750.

Similarly, if interest rates rise from 3 to 4 percent, that constitutes an increase of one percentage point, or an increase of 33 percent. Meanwhile, if interest rates drop from 4 to 3 percent, that constitutes a decrease of one percentage point, but a decrease of 25 percent. Has your head exploded yet?

Working with percentages and probabilities can be hard even for those of us with advanced degrees in such subjects. So even though we all practice statistical thinking quite regularly, that does not mean we are naturally very good at it.

In one somewhat famous case, the state of New Jersey (U.S.A) adopted legislation that denied additional benefits to mothers who have children while already on welfare. Some legislators believed that women were having babies in New Jersey simply to increase the amount on their welfare checks. Within two months, legislators were declaring that the “family cap” law was a success because births had already fallen by 16 percent, as reported in the New York Times. Now, maybe things are different in New Jersey, but I believe that it takes nine months for a pregnancy to come to term, so how can this effect possibly be attributed to the policy?

But state legislators are not the only people to make basic blunders in statistical thinking. Experts do it all the time, something demonstrated in the highly impressive Thinking, Fast and Slow by Daniel Kahneman (Nobel Laureate in Economics, 2002).

Other claims, like More people have cell phones than toilets (reported by Time Magazine in 2013) are a bit trickier because the answer depends on a host of factors, like who/how we count and what we compare. One potentially reasonable place to start in this example might be the observation that many people in the developing world do not have indoor plumbing, whereas many people in the developed world may have more than one cell phone (at least over their lifetime). From this angle, the claim seems plausible. This is not to say that we should accept it, but rather that we cannot reject it out-of-hand as being ridiculous. In this case, what matters isn’t whether you instantly know the answer, but whether you know how to start thinking about solving the problem.

Shooting the messenger

Importantly, however, not all misuses of statistics are accidents. In his book A Field Guide To Lies and Statistics, Daniel Levitin (a neuroscientist) provides a number of examples where statistics are mis-used, misinterpreted, or misapplied honestly as well as dishonestly.

This hits upon a broader point that sometimes you cannot always determine whether a statistic or claim is reliable. Sure, the people who generate and report on such statistics should do this for you, but they often do not. Sometimes, this is deliberate (i.e., politically-motivated) but other times the people generating/reporting on these statistics do not understand them all that well.

That is how runaway statistics take hold, potentially poisoning public opinion and discourse on a particular topic, further emphasizing the importance of being able to think statistically.

Take the widely reported statistic (from some years ago) that, in the U.S. 150,000 girls and young women die of annorexia each year. According to the U.S. Centers for Disease Control, the annual number of deaths from all causes for girls and women between the ages of fifteen and twenty-four (a reasonable definition for “girls and young women”?) is around 8,500. If you become more conservative in your definitions and incorporate women aged twnety-five to fourty-four, you still only get 55,000. It should be obvious that annorexia deaths in one year cannot be three times the number of all reported deaths, but it clearly was not.

In these cases, it is often very easy to blame to blame the statistician or the statistics themselves, but they are usually not the ones with concealed motivations, political or otherwise.

Moreover, when a statistic is used to measure performance, people often start “playing to the metric” — sometimes in ways that make the number look better without actually improving the thing it’s meant to measure. (Economists call this Goodhart’s Law). National accounting measures like Gross Domestic Product (GDP) are becoming increasingly good examples of i) statistical measures that are gamed for concealed motives and ii) measures which do not do what they say on the tin - which brings us to a whole new level of discussion about picking and understanding your choice of measurement…the subject of another blog.

Wrapping Up

Statistics is not just about techniques. It is about developing better instincts and organising your curiosity. It helps you ask clearer questions and give more careful answers.

It will not turn you into a calculator. But it will help you see more clearly. To doubt usefully, not cynically. To ask things like: Where did that number come from? What is the comparison? What is the alternative?

If you have ever had a hunch something was not quite right, or spotted a pattern in your surroundings, congratulations. You have already started thinking statistically.

Bibliography

  1. Statistics Without Tears: An Introduction For Non-Mathematicians, by Derek Rowntree
  2. A Field Guide To Lies and Statistics: A Neuroscientist on How to Make Sense of A Complex World, by Daniel J. Levitin
  3. Elementary Statistics For Geographers by James Burt, Gerald Barber, and David Rigby