99 problems when n=1?

graphic of a pie chart — Take studies involving small sample sizes with a big grain of salt. Getty Images

Consider a coin toss.

Say you flip a quarter five times, and four of those times it comes up heads. You might chalk it up to random chance.

Now say you flip that quarter 500 times, and it lands on heads 400 times. That’s the same ratio. But here you could confidently say something was fishy with that coin (and pity the poor sucker who kept betting on tails).

We’ve entered the vexing realm of sample size, or “n” in the jargon of statistics. Any published paper on health or medical research will say how large of a population the authors studied. It might relate to patients or tissue samples, but you can always find out what n equals.

Spinning Science

Overhyped headlines, snarled statistics lead readers astray

Read our special series of articles with tips and tools to help you better understand health risks, scientific research, statistics and clinical studies.

Trouble is, sometimes journalists don’t.

We’ve all seen health-related headlines that go viral — and then turn out to have been misleading at best. Maybe a journalist writes about a study that “shows” how running does wonders for your knees, or that a cup of coffee a day sharpens your short-term memory. If those studies only involved a handful of individuals, the chance of those findings being true are reduced.

In the statistical world, size matters.

Still, small sample sizes aren’t inherently problematic, says Dr. Ted Gooley, a biostatistician at Fred Hutch. But they should give one pause — especially when interpreting results.

“Say I hear about a baseball player with a .385 batting average,” said Gooley, who falls into the not-small population of sports-loving statisticians. “If it’s over a three-game stretch, I’m not that impressed. But it becomes a different story if it’s over an entire season.”

Gooley has designed, and authored papers on, hundreds of clinical trials for patients with cancer. Many of them are early-phase trials that involve small numbers of patients. He points to a recent study he collaborated on with Fred Hutch’s Dr. Aude Chapuis. It found that engineered immune cells prevented relapse in 12 patients with high-risk acute myeloid leukemia.

That is clearly a small sample size. But in the context of high-risk AML — where most patients will relapse — the finding is striking, Gooley said.

“That is not what you would have expected given the types of patients they were treating,” he said. “A small study may not be able to offer definitive proof of an effect, but it can offer evidence of a potential signal that something’s going on, and that can be a reason to look at this further,” he said. “And the ultimate goal of these smaller studies is to get to a larger, randomized study, which is the gold standard for showing definitive proof of a meaningful treatment effect.”

Anything less than that gold standard, and there’s room for doubt. So be vigilant when reading about studies with small sample sizes, he said. And always be skeptical when something surprising is reported.

But sometimes surprising things do happen, he added:

“Who knows? Maybe the Mariners will win the World Series someday. Rare events do happen.”

Help Us Eliminate Cancer

Every dollar counts. Please support lifesaving research today.

99 problems when n=1?

Spinning Science

jake-siegel_

reprint-republish

Related News

Help Us Eliminate Cancer