Search This Blog

Saturday, April 25, 2020

Wow. How did so many of us miss this?

I've tried to stick to my plan to write only one post about COVID-19 but it's been hard because there are so many data issues to talk about. As an aside - I think that StatNews is doing a pretty good job covering things.

But then I ran across this Wired story by Ferris Jabr. You should read it yourself but I'll be nice and quote the main point:

"Both newspapers and scientific journals frequently state three facts about the Spanish flu: it infected 500 million people (nearly one-third of the world population at the time); it killed between 50 and 100 million people; and it had a case fatality rate of 2.5 percent. This is not mathematically possible. Once a pandemic is over and all the numbers are tallied, its case fatality rate is simply the total number of deaths divided by the total number of recorded cases. Each country and city will have its own CFR, but it’s also common to calculate a global average. If the Spanish flu infected 500 million and killed 50 to 100 million, the global CFR was 10 to 20 percent. If the fatality rate was in fact 2.5 percent, and if 500 million were infected, then the death toll was 12.5 million. There were 1.8 billion people in 1918. To make 50 million deaths compatible with a 2.5 percent CFR would require at least two billion infections—more than the number of people that existed at the time."

Wow. How did we all miss this? Are we so innumerate that we didn't see 500 million and 50 million and immediately say "Hey, that's 10% not 2.5%"? Shame on us.

Beyond pointing out that none of us are paying careful attention, Jabr digs into the history behind these numbers and uncovers a lot of uncertainty about the Spanish Flu.

So here's where we stand.

  1. COVID-19. My original post is still correct. The data would matter greatly if we had it. But we don't. It's getting better but it's still inconsistent and unclear and we're still facing extensive uncertainly. 
  2. Spanish flu. This data would also matter greatly if we had it. But we don't have good data and, at this point in history, we never will.
The take-away? We need to get more comfortable with margins of error and ranges of estimates. Data literacy should emphasize the need to look beyond simple point estimates.

Along those lines, I've just started a simulation unit in one of my classes. Simulation is a great tool for dealing with high levels of uncertainty. If you want to see my opening lesson, it's right here:

Tuesday, April 14, 2020

Interesting Shifts in Consumer Spending

From one of my favorite blogs: https://flowingdata.com/2020/04/12/change-in-consumer-spending-since-the-virus/

To me, the most fascinating aspect of this graph is that "groceries" initially jumped nearly 50% but now appears to be trending back to pre-stay-at-home levels while all other categories continue to decrease.

Much of the drop may be due to the sudden increase in unemployment but we can hope that some of it is due to people simply not having opportunities to spend. If people are saving money now, then maybe they'll splurge when this is over and help revive the economy.

Sunday, April 12, 2020

If it saves just one life...

How many times have you heard someone say "If it saves just one life, then it's worth it"?

That's one of my pet peeves because it's blatantly wrong. We will not do whatever it takes to "save just one life".

Consider auto accidents. Just under 40,000 people are killed each year in traffic accidents in the US. We could eliminate nearly all of those with just two regulations:

  • Speed Limit: A strictly enforced national speed limit of 10 miles per hour on all roads of all types.
  • Safety Equipment: Require all passengers and drivers to use fire suits, helmets, and NASCAR style five-point safety harnesses.
Of course, there's no way that we're going to do that. The cost is too high. It would save tens of thousands of lives but we won't do it.

Maybe that's not enough lives or maybe the rules would be too hard to enforce. So here's another idea.

About 5,000 pedestrians and 800~900 cyclists are killed by motorized vehicles each year. Instead of those two suggested rules, we should simply outlaw motorized vehicles completely. Then we'd save about 45,000 lives combined from traffic accidents, vehicle-pedestrian accidents, and vehicle-cyclist accidents.

By banning all motorized vehicles, we'd also stop over 500 boating deaths, several hundred ATV deaths per year, and all plane crash deaths.

Now we're talking WAY more than "just one life" so surely it's worth it. Isn't it? No, it's not. If someone actually attempted to outlaw motorized vehicles, they wouldn't get very far.

Maybe people who say "it's worth it" mean "it's worth it, as long as you don't mess with our transportation system".

OK. Let's explore that.

Between 2012 and 2016, there were over 176,000 home structure files started by cooking activities. This led to over 5000 fire injuries and 530 deaths. That's over 100 deaths per year caused by cooking at home. Therefore, we should outlaw cooking at home. If it saves over 100 lives, then it's worth it.

We also need to outlaw sports. Obviously boxing is dangerous with over 500 deaths in the ring since 1884. Football has to go too. There's been one only death on the field in the NFL but there are several every year in youth, high school, and college football. Several. Every single year.

We can't switch over to soccer either. Or hockey. Or Basketball. Fatality rates in some of these sports aren't high but they're not zero and "if it saves just one life, then it's worth it". All sports would need to be banned.

Anyone of us could easily come up with more examples, but I think you get my point. There are limits to what we will do, pay, give up, etc. to save lives. There always has been and always will be.

So what do people mean when they say "if it saves just one life, then it's worth it"? In some cases, they're probably not thinking rationally. It's a statement made in an emotionally stressful moment and, if they thought about it, they wouldn't say it.

However, I think it's more often an attempt to manipulate. When they say "if it saves just one life..." what they're actually saying is "this rule/policy/expenditure is really important to me and I don't want anyone to argue with me or do any sort of cost/benefit analysis". 

It's really just an attempt to cut off discussion and ignore the data. When that happens, I call them on it because - data matters.