December 5, 2018

Rugby Premiership Predictions for Round 10

Team Ratings for Round 10

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Saracens 12.69 11.19 1.50
Exeter Chiefs 11.39 11.13 0.30
Wasps 4.39 8.30 -3.90
Gloucester Rugby 3.83 1.23 2.60
Northampton Saints 2.42 3.42 -1.00
Leicester Tigers 2.41 6.26 -3.80
Harlequins 2.38 2.05 0.30
Bath Rugby 0.95 3.11 -2.20
Sale Sharks 0.10 -0.81 0.90
Worcester Warriors -2.39 -5.18 2.80
Newcastle Falcons -2.43 -3.51 1.10
Bristol -4.14 -5.60 1.50

 

Performance So Far

So far there have been 54 matches played, 40 of which were correctly predicted, a success rate of 74.1%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Harlequins vs. Exeter Chiefs Nov 30 28 – 26 -4.70 FALSE
2 Bristol vs. Leicester Tigers Dec 01 41 – 10 -3.60 FALSE
3 Gloucester Rugby vs. Worcester Warriors Dec 01 36 – 16 10.80 TRUE
4 Northampton Saints vs. Newcastle Falcons Dec 01 14 – 16 11.60 FALSE
5 Saracens vs. Wasps Dec 01 29 – 6 12.80 TRUE
6 Bath Rugby vs. Sale Sharks Dec 02 7 – 7 7.10 FALSE

 

Predictions for Round 10

Here are the predictions for Round 10. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Worcester Warriors vs. Northampton Saints Dec 21 Worcester Warriors 0.70
2 Exeter Chiefs vs. Saracens Dec 22 Exeter Chiefs 4.20
3 Leicester Tigers vs. Harlequins Dec 22 Leicester Tigers 5.50
4 Newcastle Falcons vs. Gloucester Rugby Dec 22 Gloucester Rugby -0.80
5 Sale Sharks vs. Bristol Dec 22 Sale Sharks 9.70
6 Wasps vs. Bath Rugby Dec 23 Wasps 8.90

 

December 4, 2018

Briefly

December 3, 2018

Margin of error

From Scoop.co.nz, the latest Colmar Brunton poll results

National 46 percent up three points 
Labour 43 percent down two points
Greens 5 percent down two points
NZ First 4 percent down one point
Maori Party unchanged on 4 one percent
ACT up one point to one percent

Question: which of these changes are greater than the ‘margin of error’ for polls of this size?

The maximum margin of error in these polls  is 3% (the maximum margin of error is for parties polling not too far from 50%). That makes the maximum margin of error for changes between two polls about 4.5% — there are two polls involved, and that multiplies the likely error by the square root of two.  For any given party, about one poll in six should show a change of three or more points if the underlying support is stable.  That’s in a perfect mathematical world — in the real world, the likely sample errors are larger because the polls aren’t an ideal random sample.

If the 3-point increase in National’s support were real, it would be interesting. But a single poll is a very blunt instrument and the grounds for calling this a “surge” are very weak.

The maximum margin of error doesn’t apply to the Greens, though.  When you get to smaller parties, the likely sampling error is smaller in absolute terms, though larger as a proportion of their support.  I’ve posted before on this topic, and you can look up the table there to find that the margin of error at 5% is a bit under 1.5 percentage points, so a change of two points is borderline interesting — depending on whether it was rounded up to two or down to two.

You might also wonder if the same applied to ACT. There we’re completely at the mercy of the rounding — it will be possible to tell more when Colmar Brunton releases their detailed report, which (going from past versions) gives an extra decimal place for parties below 5%

November 28, 2018

Rugby Premiership Predictions for Round 9

Team Ratings for Round 9

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Saracens 12.17 11.19 1.00
Exeter Chiefs 12.00 11.13 0.90
Wasps 4.90 8.30 -3.40
Leicester Tigers 3.70 6.26 -2.60
Gloucester Rugby 3.35 1.23 2.10
Northampton Saints 3.06 3.42 -0.40
Harlequins 1.77 2.05 -0.30
Bath Rugby 1.34 3.11 -1.80
Sale Sharks -0.30 -0.81 0.50
Worcester Warriors -1.92 -5.18 3.30
Newcastle Falcons -3.07 -3.51 0.40
Bristol -5.43 -5.60 0.20

 

Performance So Far

So far there have been 48 matches played, 38 of which were correctly predicted, a success rate of 79.2%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Newcastle Falcons vs. Bath Rugby Nov 23 16 – 8 0.30 TRUE
2 Worcester Warriors vs. Harlequins Nov 23 20 – 13 0.70 TRUE
3 Exeter Chiefs vs. Gloucester Rugby Nov 24 23 – 6 13.50 TRUE
4 Sale Sharks vs. Northampton Saints Nov 24 18 – 13 1.50 TRUE
5 Wasps vs. Bristol Nov 24 32 – 28 17.10 TRUE
6 Leicester Tigers vs. Saracens Nov 25 22 – 27 -2.50 TRUE

 

Predictions for Round 9

Here are the predictions for Round 9. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Harlequins vs. Exeter Chiefs Nov 30 Exeter Chiefs -4.70
2 Bristol vs. Leicester Tigers Dec 01 Leicester Tigers -3.60
3 Gloucester Rugby vs. Worcester Warriors Dec 01 Gloucester Rugby 10.80
4 Northampton Saints vs. Newcastle Falcons Dec 01 Northampton Saints 11.60
5 Saracens vs. Wasps Dec 01 Saracens 12.80
6 Bath Rugby vs. Sale Sharks Dec 02 Bath Rugby 7.10

 

Pro14 Predictions for Round 10

Team Ratings for Round 10

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Leinster 12.53 9.80 2.70
Glasgow Warriors 9.87 8.55 1.30
Munster 9.34 8.08 1.30
Scarlets 5.36 6.39 -1.00
Connacht 1.88 0.01 1.90
Cardiff Blues 0.37 0.24 0.10
Ulster 0.31 2.07 -1.80
Ospreys -0.50 -0.86 0.40
Edinburgh -1.29 -0.64 -0.70
Cheetahs -2.61 -0.83 -1.80
Treviso -4.36 -5.19 0.80
Dragons -9.31 -8.59 -0.70
Southern Kings -9.44 -7.91 -1.50
Zebre -11.61 -10.57 -1.00

 

Performance So Far

So far there have been 63 matches played, 52 of which were correctly predicted, a success rate of 82.5%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Scarlets vs. Ulster Nov 24 29 – 12 8.10 TRUE
2 Leinster vs. Ospreys Nov 24 52 – 7 15.50 TRUE
3 Glasgow Warriors vs. Cardiff Blues Nov 24 40 – 15 13.00 TRUE
4 Cheetahs vs. Treviso Nov 24 31 – 25 6.30 TRUE
5 Southern Kings vs. Connacht Nov 26 14 – 31 -5.80 TRUE
6 Zebre vs. Munster Nov 26 7 – 32 -15.60 TRUE
7 Dragons vs. Edinburgh Nov 26 18 – 12 -4.40 FALSE

 

Predictions for Round 10

Here are the predictions for Round 10. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Munster vs. Edinburgh Dec 01 Munster 15.10
2 Ospreys vs. Zebre Dec 01 Ospreys 15.60
3 Cheetahs vs. Connacht Dec 02 Cheetahs 0.00
4 Ulster vs. Cardiff Blues Dec 02 Ulster 4.40
5 Dragons vs. Leinster Dec 02 Leinster -17.30
6 Glasgow Warriors vs. Scarlets Dec 02 Glasgow Warriors 9.00
7 Southern Kings vs. Treviso Dec 02 Treviso -0.60

 

November 27, 2018

NZ Census updates

Setting the record straight?

Brian Wansink, a prominent food researcher from Cornell, was forced to retire earlier this year. Andrew Gelman has some good perspectives. Wansink’s research was on contextual effects on eating — eg, the impact of plate size — and a bunch of these papers have now been retracted.

This week, Wired has a Thanksgiving-themed article about his research. It includes this quote from an email he sent to colleagues ahead of his retirement

“We may believe that our papers have been unfairly retracted. But what they can’t retract is the impact these have had on people’s lives and the impact they will continue to have.”

That’s a perfect summary of the problems with studies that over-promise and are over-publicised.  Whether the research was done well or not, it’s going to stick.  Subsequent developments — whether modifications, replication failures, or retractions — never get the impact of the original claim.

November 26, 2018

Briefly

  • “Rather than assume algorithms will produce better outcomes and hope they don’t accelerate discrimination, we should assume they will be discriminatory and inequitable unless designed specifically to redress these issues.” Lucy Bernholz
  • ” Introduction of [a predictive risk screening tool] resulted in a statistically significant increase in emergency hospital admissions and use of other [National Health] services without evidence of benefits to patients or the [National Health Service].” In the academic journal BMJ, so a bit more technical
  • Why the NY Times map of the US election results is so good: a Twitter thread
  • Stacey Kirk in the Sunday Star-Times on the campaign to get Pharmac to pay for one of the most expensive drugs in the world.
  • Interesting interactive in the Herald about quality-of-life and work in NZ cities.  It’s very economist in style.  That’s true on the good sense that it appreciates high house prices are a signal that lots of people want to live somewhere and low house prices are a signal that lots of people don’t.  It’s also true in the bad sense that there some places where not many people want to live, but the people who do live there really like it — and this sort of analysis suppresses that variation in preferences.
  • Interesting book on data science and data use: “Data Feminism”

Privacy and mathwashing

The Herald has a story from the Washington Post on an “AI” screening tool for babysitters, that allegedly uses both computer vision and text processing to screen social media for risk factors.   Here are three quotes from it:

1. A company co-founder says

Parents, he said, should see the ratings as a companion that “may or may not reflect the sitter’s actual attributes.”

But the danger of hiring a problematic or violent babysitter, he added, makes the AI a necessary tool for any parent hoping to keep his or her child safe.

The first thing to note about this is that you could make the same claims about astrology or handwriting analysis or a tarot reading. There’s no quantitative information about accuracy given, and it’s hard to see how the company could even know much how accurate its ratings were, or how biased. It’s not even for sure that the risk rating is positively correlated with risk to kids; the company seems careful not to make even a claim this weak.

 

2.

Parents could, presumably, look at their sitters’ public social media accounts themselves. But the computer-generated reports promise an in-depth inspection of years of online activity, boiled down to a single digit: an intoxicatingly simple solution to an impractical task.

If the algorithms actually predicted risk better and were less biased than typical employers there might be an advantage to this: your social media would be shared with a faceless US company rather than your potential employer, so there might be less actual privacy invasion — after all, some faceless US companies already have your social media. It might be less embarrassing than your boss knowing what your favourite member of the appropriate sex calls you. The computer could also be set up to ignore irrelevant information like whether you talk about your sexual orientation online.   With the setup as it is, that’s not the case, and one of the biggest risks is the completely unfounded appearance of both accuracy and objectivity — “mathwashing” as the jargon puts it.

 

3.

Where she lives, “100 per cent of the parents are going to want to use this,” she added. “We all want the perfect babysitter.”

One of the significant risks of automating human judgement is that it can go viral. There’s a limit to how well ordinary human prejudices can scale — you’ve got some chance of finding someone who has different biases. The prejudices of one computer checklist, though, can keep someone completely out of an employment sector.

November 20, 2018

Rugby Premiership Predictions for Round 8

Team Ratings for Round 8

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Saracens 11.95 11.19 0.80
Exeter Chiefs 11.68 11.13 0.50
Wasps 5.52 8.30 -2.80
Leicester Tigers 3.92 6.26 -2.30
Gloucester Rugby 3.67 1.23 2.40
Northampton Saints 3.38 3.42 -0.00
Harlequins 2.34 2.05 0.30
Bath Rugby 1.76 3.11 -1.40
Sale Sharks -0.61 -0.81 0.20
Worcester Warriors -2.49 -5.18 2.70
Newcastle Falcons -3.49 -3.51 0.00
Bristol -6.04 -5.60 -0.40

 

Performance So Far

So far there have been 42 matches played, 32 of which were correctly predicted, a success rate of 76.2%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Gloucester Rugby vs. Leicester Tigers Nov 16 36 – 13 3.60 TRUE
2 Harlequins vs. Newcastle Falcons Nov 16 20 – 7 11.00 TRUE
3 Bath Rugby vs. Worcester Warriors Nov 17 30 – 13 8.90 TRUE
4 Northampton Saints vs. Wasps Nov 17 36 – 17 1.80 TRUE
5 Saracens vs. Sale Sharks Nov 17 31 – 25 19.30 TRUE
6 Bristol vs. Exeter Chiefs Nov 18 29 – 31 -13.30 TRUE

 

Predictions for Round 8

Here are the predictions for Round 8. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Newcastle Falcons vs. Bath Rugby Nov 23 Newcastle Falcons 0.30
2 Worcester Warriors vs. Harlequins Nov 23 Worcester Warriors 0.70
3 Exeter Chiefs vs. Gloucester Rugby Nov 24 Exeter Chiefs 13.50
4 Sale Sharks vs. Northampton Saints Nov 24 Sale Sharks 1.50
5 Wasps vs. Bristol Nov 24 Wasps 17.10
6 Leicester Tigers vs. Saracens Nov 25 Saracens -2.50