August 4, 2014

Predicting blood alcohol concentration is tricky

Rasmus Bååth, who is doing a PhD in Cognitive Science, in Sweden, has written a web app that predicts blood alcohol concentrations using reasonably sophisticated equations from the forensic science literature.

The web page gives a picture of the whole BAC curve over time, but requires a lot of detailed inputs. Some of these are things you could know accurately: your height and weight, exactly when you had each drink and what it was. Some of them you have a reasonable idea about: is your stomach empty or full, and therefore is alcohol absorption fast or slow. You also need to specify an alcohol elimination rate, which he says averages 0.018%/hour but could be half or twice that, and you have no real clue.

If you play around with the interactive controls, you can see why the advice given along with the new legal limits is so approximate (as Campbell Live is demonstrating tonight).  Rasmus has all sorts of disclaimers about how you shouldn’t rely on the app, so he’d probably be happier if you don’t do any more than that with it.

Stat of the Week Competition: August 2 – 8 2014

Each week, we would like to invite readers of Stats Chat to submit nominations for our Stat of the Week competition and be in with the chance to win an iTunes voucher.

Here’s how it works:

  • Anyone may add a comment on this post to nominate their Stat of the Week candidate before midday Friday August 8 2014.
  • Statistics can be bad, exemplary or fascinating.
  • The statistic must be in the NZ media during the period of August 2 – 8 2014 inclusive.
  • Quote the statistic, when and where it was published and tell us why it should be our Stat of the Week.

Next Monday at midday we’ll announce the winner of this week’s Stat of the Week competition, and start a new one.

(more…)

August 2, 2014

When in doubt, randomise

The Cochrane Collaboration, the massive global conspiracy to summarise and make available the results of clinical trials, has developed ‘Plain Language Summaries‘ to make the results easier to understand (they hope).

There’s nothing terribly noticeable about a plain-language initiative; they happen all the time.  What is unusual is that the Cochrane Collaboration tested the plain-language summaries in a randomised comparison to the old format. The abstract of their research paper (not, alas, itself a plain-language summary) says

With the new PLS, more participants understood the benefits and harms and quality of evidence (53% vs. 18%, P < 0.001); more answered each of the five questions correctly (P ≤ 0.001 for four questions); and they answered more questions correctly, median 3 (interquartile range [IQR]: 1–4) vs. 1 (IQR: 0–1), P < 0.001). Better understanding was independent of education level. More participants found information in the new PLS reliable, easy to find, easy to understand, and presented in a way that helped make decisions. Overall, participants preferred the new PLS.

That is, it worked. More importantly, they know it worked.

Briefly

  • Nicely balanced Guardian article on genetic sequencing in rare diseases

Some diagnoses open up a whole world of support groups and treatments for patients who previously had no name to put to their disorder. But even when there is no treatment, no sudden rush of hope and relief, families are grateful for the diagnosis.

  • Andrew Gelman:

    The result is what I call the “scientific surprise” two-step: (1) When defending the plausibility of your results, you emphasize that they are just as expected from a well-estabilished scientific theory with a rich literature; (2) When publicizing your results… you emphasize the novelty and surprise value.

  • From the copyediting blog “Ten Minutes Past Deadline”, an example of how statistical errors avoid getting into print (on a good day, God willing and the creek don’t rise)

 

August 1, 2014

The algorithm of life

“…Unlike people, computer programs aren’t embarrassed by their prejudices and don’t try to hide them.” Thomas Lumley discusses the algorithm-based tools that sift data about us and our behaviour in this week’s New Zealand Listener. Click here for the full story.

 

July 31, 2014

Briefly

‘This is statistics’ website

The American Statistical Association is launching a public relations campaign to make people think statistics is less boring and pointless, which is good:

We want students and parents to have a better understanding of a field that is often unknown or misunderstood. Statistics is not just a collection of numbers or formulas. It’s not just lines, bars or points on a graph. It’s not just computing. Statistics is so much more. It’s an exciting—even fun—way of looking at the world and gaining insights through a scientific approach that rewards creative thinking.

That’s a quote from the  shiny new website, ThisIsStatistics. It has stories about what statisticians do, and information about salary and job trends and stuff.  There are videos of statisticians talking about their work: currently Roger Peng (Johns Hopkins, SimplyStatistics blog) and Genevra Allen (Rice University).

It’s slightly disappointing that more of the people on the site arent’ real, just stock photos, but I suppose that’s unavoidable. What’s a bit more annoying is one of the photos in particular:

About-ASA-cropped

This looks as if it was constructed specially (the cup/mat/tablet/glasses are stock, eg).  It’s a rose chart, which is an ok way to display circular data (eg wind directions), but is not so good for comparison because of the way the wedges change shape as they get larger. The numeric labels are also a slightly strange choice for a circle measured in degrees (90 isn’t a multiple of 20).

Much more importantly, given the emphasis of the site on statistics as solving real problems, this is labelled as not being real: “data A” and “data B”.  Not helpful when we’re trying to tell people “It’s not just lines, bars or points on a graph”.

 

July 30, 2014

NRL Predictions for Round 21

Team Ratings for Round 21

The basic method is described on my Department home page. I have made some changes to the methodology this year, including shrinking the ratings between seasons.

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Sea Eagles 9.18 9.10 0.10
Rabbitohs 7.48 5.82 1.70
Roosters 7.09 12.35 -5.30
Cowboys 4.89 6.01 -1.10
Warriors 4.38 -0.72 5.10
Storm 2.94 7.64 -4.70
Broncos 0.81 -4.69 5.50
Panthers -0.02 -2.48 2.50
Bulldogs -0.76 2.46 -3.20
Knights -1.30 5.23 -6.50
Dragons -2.44 -7.57 5.10
Titans -5.06 1.45 -6.50
Raiders -6.23 -8.99 2.80
Wests Tigers -6.81 -11.26 4.50
Sharks -7.26 2.32 -9.60
Eels -8.66 -18.45 9.80

 

Performance So Far

So far there have been 144 matches played, 79 of which were correctly predicted, a success rate of 54.9%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Knights vs. Roosters Jul 25 16 – 12 -5.80 FALSE
2 Broncos vs. Storm Jul 25 8 – 30 7.40 FALSE
3 Panthers vs. Sharks Jul 26 16 – 18 14.80 FALSE
4 Titans vs. Eels Jul 26 18 – 24 11.20 FALSE
5 Bulldogs vs. Cowboys Jul 26 12 – 20 0.50 FALSE
6 Warriors vs. Sea Eagles Jul 27 12 – 22 1.90 FALSE
7 Wests Tigers vs. Dragons Jul 27 12 – 28 3.60 FALSE
8 Raiders vs. Rabbitohs Jul 28 18 – 34 -7.60 TRUE

 

Predictions for Round 21

Here are the predictions for Round 21. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Sea Eagles vs. Broncos Aug 01 Sea Eagles 12.90
2 Bulldogs vs. Panthers Aug 01 Bulldogs 3.80
3 Sharks vs. Eels Aug 02 Sharks 5.90
4 Cowboys vs. Titans Aug 02 Cowboys 14.50
5 Roosters vs. Dragons Aug 02 Roosters 14.00
6 Raiders vs. Warriors Aug 03 Warriors -6.10
7 Rabbitohs vs. Knights Aug 03 Rabbitohs 13.30
8 Wests Tigers vs. Storm Aug 04 Storm -5.20

 

Super 15 Predictions for the Super Rugby Final

Team Ratings for the Super Rugby Final

The basic method is described on my Department home page. I have made some changes to the methodology this year, including shrinking the ratings between seasons.

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Waratahs 10.21 1.67 8.50
Crusaders 10.20 8.80 1.40
Sharks 3.91 4.57 -0.70
Hurricanes 2.89 -1.44 4.30
Bulls 2.88 4.87 -2.00
Chiefs 2.23 4.38 -2.10
Brumbies 2.20 4.12 -1.90
Stormers 1.68 4.38 -2.70
Blues 1.44 -1.92 3.40
Highlanders -2.54 -4.48 1.90
Lions -3.39 -6.93 3.50
Force -4.67 -5.37 0.70
Reds -4.98 0.58 -5.60
Cheetahs -5.55 0.12 -5.70
Rebels -9.53 -6.36 -3.20

 

Performance So Far

So far there have been 124 matches played, 82 of which were correctly predicted, a success rate of 66.1%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Crusaders vs. Sharks Jul 26 38 – 6 7.40 TRUE
2 Waratahs vs. Brumbies Jul 26 26 – 8 9.40 TRUE

 

Predictions for the Super Rugby Final

Here are the predictions for the Super Rugby Final. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Waratahs vs. Crusaders Aug 02 Waratahs 4.00

 

If you can explain anything, it proves nothing

An excellent piece from sports site Grantland (via Brendan Nyhan), on finding explanations for random noise and regression to the mean.

As a demonstration, they took ten baseball batters and ten pitchers who had apparently improved over the season so far, and searched the internet for news that would allow them to find an explanation.  They got pretty good explanations for all twenty.  Looking at past seasons, this sort of short-term improvement almost always turns out be random noise, despite the convincing stories.

Having a good explanation for a trend feels like convincing evidence the trend is real. It feels that way to statisticians as well, but it isn’t true.

It’s traditional at this point to come up with evolutionary psychology explanations for why people are so good at over-interpreting trends, but I hope the circularity of that approach is obvious.