By Mark Dexter
29th January 2015

Or is it ‘lies, damn lies and statistics’ depending on your stance?

I’m sure most of you in the Information Management market will have read the book Freakonomics by Steven Levitt and Stephen J. Dubner. After all, it’s of general interest to anyone who works with or is interested in data, and it has been around for a while.

But just in case you haven’t read it, here’s a brief synopsis. Published in 2005 and having sold more than 5.5 million copies, this non-fiction book examines a number of every-day beliefs and applies the authors’ brand of economics to debunking the popular thinking on areas such as cheating by teachers in exams, sumo wrestling and crime figures. The underlying tenet of all of the findings is that people (and for that matter all intelligent animals) are motivated by incentives, or put another way, rewards, for their actions. And they come up with some pretty radical reappraisals of commonly held beliefs. They use data mining techniques to uncover unusual patterns and analyse this data with a view to giving answers to questions that intrigue them.

What struck me when reading this in the age of Big Data and data Science is how much fun a group of data scientists could have by coming up with their own interpretations. It’s easy to read and understand a new radical reason as to why crime figures in the US fell in the mid ‘90s; the legalisation of abortion in the 70s, natch. I thought this was an amazing conclusion but was fully convinced by the arguments put forward.

But there’s a whole host of other data to analyse here (prison population, the economy etc) and would today’s data science techniques and technologies find another pattern here? Or why can Sumo wrestlers with a less good record suddenly pull out all the stops and win their final qualifying bout against a better opponent? Match fixing, of course.

But is this the case? Did the authors’ noughties data mining and analysis techniques really get to the (ample) bottom if it? Or could their conclusion be unpicked by a re analysis of the available data? Perhaps just good old effort on the part of the underdog who really needed to win to progress? But how do you quantify this from data. Data Science anyone?

Just ask the people involved in the ‘is global warming caused by man’ debate; everyone seems to have a different answer based on whatever data they are looking at but in this case politics often plays a huge part in which side of the fence you come down on.

So I come back to my question about what today’s data scientists might uncover from the data behind Freakonomics’ answers. Would today’s technology along with modern data science thinking change the results? 

Freakonomics

What do you think? Have you read the book and/or its follow ups? Did you wonder how you might have come up with different answers? If you work in the data science field have you tried to take a new look at the results?

Comments

Currently there are no comments. Be the first to post one!

Post Comment

*
*
*

4 key questions for recent grads

It’s reaching that crunch time for students across the country, attempting to scram in every last detail as you revise for your final exams having submitted your thesis and dissertations! You’re about to graduate and enter the world of employment... Read More

Will AI take our job?

Artificial Intelligence (AI) is still in its infancy; while the image Hollywood gives us of robots roaming our cities and becoming smarter than humans seems a long while off (if it ever happens) AI is still here, mostly in the... Read More

Applications of real-time analytics

Many companies across the word are using real-time analytics and big data to make better decisions, understand their customers and stay ahead of the competition. Real-time analytics allow a business to fully understand what is happening at the time of... Read More

4 ways for an effective job search

Looking for a new job can be a stressful time, no matter the reason why you are looking. But following a few simple steps can be the difference between a stressful job search or one that is more efficient and... Read More

Where should we send our newsletter?

Close