Stats Chat

December 5, 2018

Rugby Premiership Predictions for Round 10

Team Ratings for Round 10

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

	Current Rating	Rating at Season Start	Difference
Saracens	12.69	11.19	1.50
Exeter Chiefs	11.39	11.13	0.30
Wasps	4.39	8.30	-3.90
Gloucester Rugby	3.83	1.23	2.60
Northampton Saints	2.42	3.42	-1.00
Leicester Tigers	2.41	6.26	-3.80
Harlequins	2.38	2.05	0.30
Bath Rugby	0.95	3.11	-2.20
Sale Sharks	0.10	-0.81	0.90
Worcester Warriors	-2.39	-5.18	2.80
Newcastle Falcons	-2.43	-3.51	1.10
Bristol	-4.14	-5.60	1.50

Performance So Far

So far there have been 54 matches played, 40 of which were correctly predicted, a success rate of 74.1%.
Here are the predictions for last week’s games.

	Game	Date	Score	Prediction	Correct
1	Harlequins vs. Exeter Chiefs	Nov 30	28 – 26	-4.70	FALSE
2	Bristol vs. Leicester Tigers	Dec 01	41 – 10	-3.60	FALSE
3	Gloucester Rugby vs. Worcester Warriors	Dec 01	36 – 16	10.80	TRUE
4	Northampton Saints vs. Newcastle Falcons	Dec 01	14 – 16	11.60	FALSE
5	Saracens vs. Wasps	Dec 01	29 – 6	12.80	TRUE
6	Bath Rugby vs. Sale Sharks	Dec 02	7 – 7	7.10	FALSE

Predictions for Round 10

Here are the predictions for Round 10. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

	Game	Date	Winner	Prediction
1	Worcester Warriors vs. Northampton Saints	Dec 21	Worcester Warriors	0.70
2	Exeter Chiefs vs. Saracens	Dec 22	Exeter Chiefs	4.20
3	Leicester Tigers vs. Harlequins	Dec 22	Leicester Tigers	5.50
4	Newcastle Falcons vs. Gloucester Rugby	Dec 22	Gloucester Rugby	-0.80
5	Sale Sharks vs. Bristol	Dec 22	Sale Sharks	9.70
6	Wasps vs. Bath Rugby	Dec 23	Wasps	8.90

December 4, 2018

Briefly

By Thomas Lumley

On the relationship between infographics and scientific illustration
A non-peer-reviewed lab-bench research study being used to market some ‘natural’ product isn’t new on StatsChat. All the problems being pointed out in the news story is, though.
If you like running gratuitously long distances, you might like runningstats.nz. It includes comparisons of all the NZ marathon, half-marathon, and 10k events that are based on comparing the same runners on different routes and so aren’t biased by different participation in different events.
Chris Knox has a nice interactive at oneroof for looking at real estate price changes across NZ.
Lessons From Posting A Fake Map

View comments (1)

December 3, 2018

Margin of error

By Thomas Lumley

From Scoop.co.nz, the latest Colmar Brunton poll results

National 46 percent up three points
Labour 43 percent down two points
Greens 5 percent down two points
NZ First 4 percent down one point
Maori Party unchanged on 4 one percent
ACT up one point to one percent

Question: which of these changes are greater than the ‘margin of error’ for polls of this size?

The maximum margin of error in these polls is 3% (the maximum margin of error is for parties polling not too far from 50%). That makes the maximum margin of error for changes between two polls about 4.5% — there are two polls involved, and that multiplies the likely error by the square root of two. For any given party, about one poll in six should show a change of three or more points if the underlying support is stable. That’s in a perfect mathematical world — in the real world, the likely sample errors are larger because the polls aren’t an ideal random sample.

If the 3-point increase in National’s support were real, it would be interesting. But a single poll is a very blunt instrument and the grounds for calling this a “surge” are very weak.

The maximum margin of error doesn’t apply to the Greens, though. When you get to smaller parties, the likely sampling error is smaller in absolute terms, though larger as a proportion of their support. I’ve posted before on this topic, and you can look up the table there to find that the margin of error at 5% is a bit under 1.5 percentage points, so a change of two points is borderline interesting — depending on whether it was rounded up to two or down to two.

You might also wonder if the same applied to ACT. There we’re completely at the mercy of the rounding — it will be possible to tell more when Colmar Brunton releases their detailed report, which (going from past versions) gives an extra decimal place for parties below 5%

View comments (2)

November 28, 2018

Rugby Premiership Predictions for Round 9

By David Scott

Team Ratings for Round 9

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

	Current Rating	Rating at Season Start	Difference
Saracens	12.17	11.19	1.00
Exeter Chiefs	12.00	11.13	0.90
Wasps	4.90	8.30	-3.40
Leicester Tigers	3.70	6.26	-2.60
Gloucester Rugby	3.35	1.23	2.10
Northampton Saints	3.06	3.42	-0.40
Harlequins	1.77	2.05	-0.30
Bath Rugby	1.34	3.11	-1.80
Sale Sharks	-0.30	-0.81	0.50
Worcester Warriors	-1.92	-5.18	3.30
Newcastle Falcons	-3.07	-3.51	0.40
Bristol	-5.43	-5.60	0.20

Performance So Far

So far there have been 48 matches played, 38 of which were correctly predicted, a success rate of 79.2%.
Here are the predictions for last week’s games.

	Game	Date	Score	Prediction	Correct
1	Newcastle Falcons vs. Bath Rugby	Nov 23	16 – 8	0.30	TRUE
2	Worcester Warriors vs. Harlequins	Nov 23	20 – 13	0.70	TRUE
3	Exeter Chiefs vs. Gloucester Rugby	Nov 24	23 – 6	13.50	TRUE
4	Sale Sharks vs. Northampton Saints	Nov 24	18 – 13	1.50	TRUE
5	Wasps vs. Bristol	Nov 24	32 – 28	17.10	TRUE
6	Leicester Tigers vs. Saracens	Nov 25	22 – 27	-2.50	TRUE

Predictions for Round 9

Here are the predictions for Round 9. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

	Game	Date	Winner	Prediction
1	Harlequins vs. Exeter Chiefs	Nov 30	Exeter Chiefs	-4.70
2	Bristol vs. Leicester Tigers	Dec 01	Leicester Tigers	-3.60
3	Gloucester Rugby vs. Worcester Warriors	Dec 01	Gloucester Rugby	10.80
4	Northampton Saints vs. Newcastle Falcons	Dec 01	Northampton Saints	11.60
5	Saracens vs. Wasps	Dec 01	Saracens	12.80
6	Bath Rugby vs. Sale Sharks	Dec 02	Bath Rugby	7.10

Pro14 Predictions for Round 10

By David Scott

Team Ratings for Round 10

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

	Current Rating	Rating at Season Start	Difference
Leinster	12.53	9.80	2.70
Glasgow Warriors	9.87	8.55	1.30
Munster	9.34	8.08	1.30
Scarlets	5.36	6.39	-1.00
Connacht	1.88	0.01	1.90
Cardiff Blues	0.37	0.24	0.10
Ulster	0.31	2.07	-1.80
Ospreys	-0.50	-0.86	0.40
Edinburgh	-1.29	-0.64	-0.70
Cheetahs	-2.61	-0.83	-1.80
Treviso	-4.36	-5.19	0.80
Dragons	-9.31	-8.59	-0.70
Southern Kings	-9.44	-7.91	-1.50
Zebre	-11.61	-10.57	-1.00

Performance So Far

So far there have been 63 matches played, 52 of which were correctly predicted, a success rate of 82.5%.
Here are the predictions for last week’s games.

	Game	Date	Score	Prediction	Correct
1	Scarlets vs. Ulster	Nov 24	29 – 12	8.10	TRUE
2	Leinster vs. Ospreys	Nov 24	52 – 7	15.50	TRUE
3	Glasgow Warriors vs. Cardiff Blues	Nov 24	40 – 15	13.00	TRUE
4	Cheetahs vs. Treviso	Nov 24	31 – 25	6.30	TRUE
5	Southern Kings vs. Connacht	Nov 26	14 – 31	-5.80	TRUE
6	Zebre vs. Munster	Nov 26	7 – 32	-15.60	TRUE
7	Dragons vs. Edinburgh	Nov 26	18 – 12	-4.40	FALSE

Predictions for Round 10

Here are the predictions for Round 10. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

	Game	Date	Winner	Prediction
1	Munster vs. Edinburgh	Dec 01	Munster	15.10
2	Ospreys vs. Zebre	Dec 01	Ospreys	15.60
3	Cheetahs vs. Connacht	Dec 02	Cheetahs	0.00
4	Ulster vs. Cardiff Blues	Dec 02	Ulster	4.40
5	Dragons vs. Leinster	Dec 02	Leinster	-17.30
6	Glasgow Warriors vs. Scarlets	Dec 02	Glasgow Warriors	9.00
7	Southern Kings vs. Treviso	Dec 02	Treviso	-0.60

November 27, 2018

NZ Census updates

By Thomas Lumley

Two press releases from Stats NZ today

2018 Census data release delayed

Why the 2018 Census data release is delayed

Setting the record straight?

By Thomas Lumley

Brian Wansink, a prominent food researcher from Cornell, was forced to retire earlier this year. Andrew Gelman has some good perspectives. Wansink’s research was on contextual effects on eating — eg, the impact of plate size — and a bunch of these papers have now been retracted.

This week, Wired has a Thanksgiving-themed article about his research. It includes this quote from an email he sent to colleagues ahead of his retirement

“We may believe that our papers have been unfairly retracted. But what they can’t retract is the impact these have had on people’s lives and the impact they will continue to have.”

That’s a perfect summary of the problems with studies that over-promise and are over-publicised. Whether the research was done well or not, it’s going to stick. Subsequent developments — whether modifications, replication failures, or retractions — never get the impact of the original claim.

November 26, 2018

Briefly

By Thomas Lumley

“Rather than assume algorithms will produce better outcomes and hope they don’t accelerate discrimination, we should assume they will be discriminatory and inequitable unless designed specifically to redress these issues.” Lucy Bernholz
” Introduction of [a predictive risk screening tool] resulted in a statistically significant increase in emergency hospital admissions and use of other [National Health] services without evidence of benefits to patients or the [National Health Service].” In the academic journal BMJ, so a bit more technical
Why the NY Times map of the US election results is so good: a Twitter thread
Stacey Kirk in the Sunday Star-Times on the campaign to get Pharmac to pay for one of the most expensive drugs in the world.
Interesting interactive in the Herald about quality-of-life and work in NZ cities. It’s very economist in style. That’s true on the good sense that it appreciates high house prices are a signal that lots of people want to live somewhere and low house prices are a signal that lots of people don’t. It’s also true in the bad sense that there some places where not many people want to live, but the people who do live there really like it — and this sort of analysis suppresses that variation in preferences.
Interesting book on data science and data use: “Data Feminism”

View comments (5)

Privacy and mathwashing

By Thomas Lumley

The Herald has a story from the Washington Post on an “AI” screening tool for babysitters, that allegedly uses both computer vision and text processing to screen social media for risk factors. Here are three quotes from it:

1. A company co-founder says

Parents, he said, should see the ratings as a companion that “may or may not reflect the sitter’s actual attributes.”

But the danger of hiring a problematic or violent babysitter, he added, makes the AI a necessary tool for any parent hoping to keep his or her child safe.

The first thing to note about this is that you could make the same claims about astrology or handwriting analysis or a tarot reading. There’s no quantitative information about accuracy given, and it’s hard to see how the company could even know much how accurate its ratings were, or how biased. It’s not even for sure that the risk rating is positively correlated with risk to kids; the company seems careful not to make even a claim this weak.

2.

Parents could, presumably, look at their sitters’ public social media accounts themselves. But the computer-generated reports promise an in-depth inspection of years of online activity, boiled down to a single digit: an intoxicatingly simple solution to an impractical task.

If the algorithms actually predicted risk better and were less biased than typical employers there might be an advantage to this: your social media would be shared with a faceless US company rather than your potential employer, so there might be less actual privacy invasion — after all, some faceless US companies already have your social media. It might be less embarrassing than your boss knowing what your favourite member of the appropriate sex calls you. The computer could also be set up to ignore irrelevant information like whether you talk about your sexual orientation online. With the setup as it is, that’s not the case, and one of the biggest risks is the completely unfounded appearance of both accuracy and objectivity — “mathwashing” as the jargon puts it.

3.

Where she lives, “100 per cent of the parents are going to want to use this,” she added. “We all want the perfect babysitter.”

One of the significant risks of automating human judgement is that it can go viral. There’s a limit to how well ordinary human prejudices can scale — you’ve got some chance of finding someone who has different biases. The prejudices of one computer checklist, though, can keep someone completely out of an employment sector.

View comments (1)

November 20, 2018

Rugby Premiership Predictions for Round 8

By David Scott

Team Ratings for Round 8

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

	Current Rating	Rating at Season Start	Difference
Saracens	11.95	11.19	0.80
Exeter Chiefs	11.68	11.13	0.50
Wasps	5.52	8.30	-2.80
Leicester Tigers	3.92	6.26	-2.30
Gloucester Rugby	3.67	1.23	2.40
Northampton Saints	3.38	3.42	-0.00
Harlequins	2.34	2.05	0.30
Bath Rugby	1.76	3.11	-1.40
Sale Sharks	-0.61	-0.81	0.20
Worcester Warriors	-2.49	-5.18	2.70
Newcastle Falcons	-3.49	-3.51	0.00
Bristol	-6.04	-5.60	-0.40

Performance So Far

So far there have been 42 matches played, 32 of which were correctly predicted, a success rate of 76.2%.
Here are the predictions for last week’s games.

	Game	Date	Score	Prediction	Correct
1	Gloucester Rugby vs. Leicester Tigers	Nov 16	36 – 13	3.60	TRUE
2	Harlequins vs. Newcastle Falcons	Nov 16	20 – 7	11.00	TRUE
3	Bath Rugby vs. Worcester Warriors	Nov 17	30 – 13	8.90	TRUE
4	Northampton Saints vs. Wasps	Nov 17	36 – 17	1.80	TRUE
5	Saracens vs. Sale Sharks	Nov 17	31 – 25	19.30	TRUE
6	Bristol vs. Exeter Chiefs	Nov 18	29 – 31	-13.30	TRUE

Predictions for Round 8

Here are the predictions for Round 8. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

	Game	Date	Winner	Prediction
1	Newcastle Falcons vs. Bath Rugby	Nov 23	Newcastle Falcons	0.30
2	Worcester Warriors vs. Harlequins	Nov 23	Worcester Warriors	0.70
3	Exeter Chiefs vs. Gloucester Rugby	Nov 24	Exeter Chiefs	13.50
4	Sale Sharks vs. Northampton Saints	Nov 24	Sale Sharks	1.50
5	Wasps vs. Bristol	Nov 24	Wasps	17.10
6	Leicester Tigers vs. Saracens	Nov 25	Saracens	-2.50

Stats Chat

Rugby Premiership Predictions for Round 10

Team Ratings for Round 10

Performance So Far

Predictions for Round 10

Briefly

Margin of error

Rugby Premiership Predictions for Round 9

Team Ratings for Round 9

Performance So Far

Predictions for Round 9

Pro14 Predictions for Round 10

Team Ratings for Round 10

Performance So Far

Predictions for Round 10

NZ Census updates

Setting the record straight?

Briefly

Privacy and mathwashing

Rugby Premiership Predictions for Round 8

Team Ratings for Round 8

Performance So Far

Predictions for Round 8

Recent comments

Popular posts

Latest posts

All topics

Recommended sites

Subscribe:

Receive our posts via email:

Team Ratings for Round 10

Performance So Far

Predictions for Round 10

Team Ratings for Round 9

Performance So Far

Predictions for Round 9

Team Ratings for Round 10

Performance So Far

Predictions for Round 10

Team Ratings for Round 8

Performance So Far

Predictions for Round 8

Recent comments

Popular posts

Latest posts

All topics

Recommended sites