Posts from June 2025 (28)

June 24, 2025

Top 14 Predictions for the Final

Team Ratings for the Final

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Stade Toulousain 13.56 8.76 4.80
Bordeaux Begles 4.71 3.96 0.80
Toulon 4.67 5.32 -0.60
Clermont 2.59 0.41 2.20
Stade Rochelais 2.19 4.85 -2.70
Racing 92 1.31 2.75 -1.40
Section Paloise 0.92 1.38 -0.50
Bayonne 0.90 -1.69 2.60
Castres Olympique 0.57 -0.09 0.70
Montpellier 0.48 -0.96 1.40
Lyon -1.07 -0.18 -0.90
Stade Francais -2.53 1.86 -4.40
USA Perpignan -3.91 -0.66 -3.30
Vannes -8.66 -10.00 1.30

 

Performance So Far

So far there have been 186 matches played, 143 of which were correctly predicted, a success rate of 76.9%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Stade Toulousain vs. Bayonne Jun 21 32 – 25 19.20 TRUE
2 Bordeaux Begles vs. Toulon Jun 22 39 – 24 6.80 TRUE

 

Predictions for the Final

Here are the predictions for the Final. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Stade Toulousain vs. Bordeaux Begles Jun 29 Stade Toulousain 8.00

 

NRL Predictions for Round 17

Team Ratings for Round 17

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Storm 9.98 9.29 0.70
Roosters 5.79 7.44 -1.70
Panthers 5.43 8.50 -3.10
Sharks 3.09 5.10 -2.00
Dolphins 3.08 -1.96 5.00
Bulldogs 2.62 0.07 2.50
Sea Eagles 1.88 2.97 -1.10
Raiders 1.41 -3.61 5.00
Warriors 0.49 -1.68 2.20
Broncos -0.39 -1.82 1.40
Cowboys -2.06 4.11 -6.20
Knights -2.54 -0.05 -2.50
Eels -4.05 -3.02 -1.00
Rabbitohs -4.25 -4.35 0.10
Dragons -4.92 -4.55 -0.40
Titans -7.41 -5.50 -1.90
Wests Tigers -8.17 -10.97 2.80

 

Performance So Far

So far there have been 120 matches played, 68 of which were correctly predicted, a success rate of 56.7%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Wests Tigers vs. Raiders Jun 20 12 – 16 -7.10 TRUE
2 Warriors vs. Panthers Jun 21 18 – 28 -0.50 TRUE
3 Dolphins vs. Knights Jun 21 20 – 26 10.10 FALSE
4 Rabbitohs vs. Storm Jun 21 24 – 25 -12.30 TRUE
5 Broncos vs. Sharks Jun 22 34 – 28 -1.30 FALSE
6 Roosters vs. Cowboys Jun 22 42 – 8 8.70 TRUE
7 Eels vs. Titans Jun 22 36 – 20 5.30 TRUE

 

Predictions for Round 17

Here are the predictions for Round 17. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Panthers vs. Bulldogs Jun 26 Panthers 5.80
2 Sea Eagles vs. Wests Tigers Jun 27 Sea Eagles 13.10
3 Knights vs. Raiders Jun 27 Raiders -0.90
4 Broncos vs. Warriors Jun 28 Broncos 2.60
5 Dragons vs. Eels Jun 28 Dragons 2.10
6 Dolphins vs. Rabbitohs Jun 28 Dolphins 10.30
7 Storm vs. Sharks Jun 29 Storm 9.90
8 Titans vs. Cowboys Jun 29 Cowboys -2.30

 

AFL Predictions for Week 17

Team Ratings for Week 17

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Western Bulldogs 24.66 18.20 6.50
Geelong Cats 21.49 15.04 6.50
Collingwood 20.66 5.39 15.30
Adelaide Crows 19.16 2.69 16.50
Brisbane Lions 15.77 22.65 -6.90
Hawthorn Hawks 12.68 21.95 -9.30
Fremantle Dockers 6.49 5.99 0.50
Gold Coast Suns 4.88 -6.41 11.30
GWS Giants 4.30 9.08 -4.80
Sydney Swans 1.59 12.60 -11.00
Carlton Blues -1.24 5.01 -6.30
Melbourne Demons -3.10 -0.21 -2.90
Port Adelaide Power -5.17 7.63 -12.80
St Kilda Saints -6.93 0.89 -7.80
Essendon Bombers -17.61 -10.15 -7.50
North Melbourne -21.12 -37.08 16.00
Richmond Tigers -33.83 -31.00 -2.80
West Coast Eagles -35.08 -34.67 -0.40

 

Performance So Far

So far there have been 127 matches played, 81 of which were correctly predicted, a success rate of 63.8%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Fremantle Dockers vs. Essendon Bombers Jun 19 104 – 63 33.20 TRUE
2 Geelong Cats vs. Brisbane Lions Jun 20 51 – 92 17.30 FALSE
3 Carlton Blues vs. North Melbourne Jun 21 73 – 84 29.50 FALSE
4 Port Adelaide Power vs. Sydney Swans Jun 21 52 – 71 8.20 FALSE
5 Collingwood vs. St Kilda Saints Jun 21 108 – 74 30.80 TRUE
6 GWS Giants vs. Gold Coast Suns Jun 22 106 – 99 11.50 TRUE
7 Western Bulldogs vs. Richmond Tigers Jun 22 135 – 56 59.50 TRUE

 

Predictions for Week 17

Here are the predictions for Week 17. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Port Adelaide Power vs. Carlton Blues Jun 26 Port Adelaide Power 7.10
2 Sydney Swans vs. Western Bulldogs Jun 27 Western Bulldogs -12.10
3 Gold Coast Suns vs. Melbourne Demons Jun 28 Gold Coast Suns 19.00
4 Hawthorn Hawks vs. North Melbourne Jun 28 Hawthorn Hawks 37.80
5 Collingwood vs. West Coast Eagles Jun 28 Collingwood 66.70
6 Richmond Tigers vs. Adelaide Crows Jun 29 Adelaide Crows -42.00
7 Fremantle Dockers vs. St Kilda Saints Jun 29 Fremantle Dockers 24.40

 

June 23, 2025

Briefly

  • ‘Kids in sport stay out of court’ – Sport NZ to help curb youth offending from RNZ.  This is another one of these cause and effect ones. Is it that being pushed into sport makes kids less likely to offend? Is it that kids with the qualities — self-control, work ethic, parents who drive you to games — to engage with sport are less likely to commit dumb crimes? Or (as anonymous law commenter @StrictlyObiter suggests) is it that “promising young sportsmen” are more likely to get a discharge without conviction?  Or, more likely, all of the above in some complicated mixture
  • From the Bennett Institute for Applied Data Science at Oxford, another example of counting being hard. They wanted to find out how much of each medication was used across the National Health Service
  • On pizza as a leading indicator of US military activity
  • It’s twenty years since XKCD did a big colour survey, showing people coloured patches and asking for colour names.  Nicola Rennie made this poster of the top (ie, most agreed on) colour names — click to embiggen.

Evidence of things not seen

A couple of studies out recently look at coffee and health.  One from Harvard(and reported by CNBC)  says coffee (but not decaf or tea) increases the chance of healthy aging in women. Another, from Tufts, (and reported by Newsweek and The Independent) says that unsweetened black coffee, or coffee with very small amounts of sugar or normal coffee amounts of milk, reduces death rates slightly, but not coffee with more sugar or milk (as in everything from a Kiwi small flat white to American-style lattes)

There are two problems with these studies.  The first is that I can’t see them.  One is an abstract from a conference presentation; the other is a paper in an academic journal, but not one the University of Auckland gives me access to.

Compounding this, the two abstracts only give information for their preferred beverages. It’s not possible to tell whether “Decaffeinated coffee and tea intake were not significantly associated with odds of HA nor any domains” means that there’s evidence the correlations were different for tea and decaf or whether there was just a bit more uncertainty around plausibly the same correlation.  Similarly, “However, the mortality benefits were restricted to black coffee [HR (95% CI): 0.86 (0.77, 0.97)] and coffee with low added sugar and saturated fat content [HR (95% CI): 0.86 (0.75, 0.99)]” doesn’t tell us what they found for other coffee types. Nor is the information in the press releases I could find. Since the difference between ways of drinking coffee was the main news tag for these studies, that’s a bit unsatisfactory.

I’ll also note that the Tufts team published an abstract in 2020 with a slightly smaller version of the data from the same survey series, and concluded “Adding milk/cream, alone or with sugar/sweetener, did not significantly change the results.”

A basic principle for studies like these is that conclusions about difference require evidence of difference.  This applies to conclusions in the paper, and even more so to conclusions you want the press to report.

June 21, 2025

Census roundup

Not necessarily endorsed by me, but many of these people do know what they are talking about.

I do also want to emphasize that no-one expert thinks this is a proposal to stop collecting data for the government. Administrative data already marks when you are born or die, when you enter or leave New Zealand, when you pay taxes or go to school or get health care.  This information is more reliably and rapidly collected administratively than in the Census. What we risk losing is not that, but other things.

Reeling them in

Q:  One News says fishing can improve your mental health!

A: That sounds fairly plausible, actually. Did they say how they know?

Q: “research from the UK”

A: A bit non-specific, innit?

Q:

A: I think it’s this paper. The number matches (“Almost 17% less likely”) and it’s from the UK and there doesn’t seem to be a better match

Q: And people who fished more had less mental illness?

A: People who fished more often had less history of depression, suicidal thoughts, and self-harm. People who fished longer had more suicidal thoughts.

Q: How often did people have to fish to be in the “17% less likely” group

A: It’s not clearly described.  The model in the paper actually has 17% more likely, so maybe it’s a model for “not mental health problem”.  If the 17% is for a one-step difference in the survey question then it’s a surprisingly large effect of a very small difference: 5-6 times a week is a different category from 3-4 times; once every two weeks is different from once per month.

Q: Could the anglers just be healthier anyway, or richer or something? Did they collect that information?

A: They did collect it, but they didn’t use it in the analysis, at least in this paper.

Q:

A:

Q: How did they recruit the people?

A: “an online survey  that was advertised through the Instagram, Facebook, and Twitter accounts of Angling Direct and Tackling Minds. Angling Direct also sent the survey link to their mailing list, and the link was distributed via the Anglia Ruskin University Twitter account, as well as the authors’ own networks.”

Q: That … sounds like it might not be perfectly representative

A: 98% of the respondents were men, for example. And 40% were in the top 20% of household income nationally.

Q: Would I be right in guessing that Angling Direct is some sort of fishing magazine?

A: It’s actually a chain of fishing supply stores in the UK.  Claims to be the UK’s leading fishing-tackle retailer

Q: Ok, and Tackling Minds is maybe some sort of fishing education thing?

A: It’s a charity that uses fishing as a mental health intervention.

Q: Couldn’t that have some impact on the correlations between fishing and mental health in the sample?

A: Indeed it could

June 19, 2025

Compared to what?

Via Bluesky from Instagram, and attributed to Chris Hipkins

When StatsNZ produces the data here, it was purely descriptive: number go sideways, number go down, number go up. The use on @nzlabour’s Instagram and with a Chris Hipkins electoral authorisation obviously intends a comparison, even without the annotations.  A simple comparison to the past — butter is more expensive now — is true, but it’s not what’s implied. We can tell it isn’t, because it would have no political implications and so wouldn’t be worth marketing.

The implied comparison here is to a scenario where the price of butter stays const (or keeps decreasing?) in 2024. The comparison is clearly bogus (which is why the graph is such an effective way to present it).  You might approve or disapprove of NZ butter prices following global trends, and of the NZ supermarket duopoly having substantial pricing power, but these are ongoing issues and neither one is the fault of the current government. A Labour government that committed to not increasing taxes  isn’t going to introduce price caps or government subsidies for butter!

The graph has the opposite problem to a lot of Covid comparisons.  Here, the problem is comparing to a hypothetical world that is unrealistically different. For Covid, it’s comparing to a hypothetical world that’s unrealistically similar: talking as if we could have skipped lockdowns and just had a normal economy, when the real alternative is lots of illness and death and a much worse economic problem.  The usefulness of counterfactual comparisons relies on making realistic choices about what would have been the same or different.

June 18, 2025

Tatau tātou, eh?

According to the Herald, the government has decided to stop doing the Census after the next (2028) round and switch to yearly administrative data from 2030.  The press release is here, and StatsNZ’s page is here.  There’s no commitment so far to get the necessary legislative changes passed before the election, but that may come.

This was inevitable at some point.  Door-to-door enumeration is getting less effective and administrative data are getting more complete: eventually the two lines will cross. There are quite a few countries that have more detailed and thorough government data collection than us and don’t bother with censuses. They get on fine. I’m not sure we’re there yet, but maybe we will be in 2030?

At the crude level of “how many people are there and roughly where do they live and what work do they do?”, administrative data is great.  The use of administrative data in the 2018 and 2023 Censuses improved the counts of people by region, and especially improved the counts for Māori.  There are some important weaknesses, though.

First, the `administrative’ data used to augment the 2018 and 2023 Censuses included past Census data, not just routinely-collected government data.  In 2018, the first-priority source for additional data was the 2013 Census, and it was often important. For example, when creating the “Māori descent for electoral purposes” variable, StatsNZ found 15% of the “Yes” values and 7.7% of the “No” values in 2013 Census data. [Table 4.2, Initial report of Census data quality panel].  If we stop doing Censuses, the existing Census data will rapidly become less useful.

Second, administrative data is much less effective for household statistics than for individual statistics.  Most routine government data collection is about individuals.  If Chris reported a particular Auckland address in March 2025 and Pat reported that address in December 2024 and Sandy reported it in July 2024 and Alex in June 2024, how do you work out which subset of these folks were ever living there together? And that’s before you get to situations like if you’ve just started flatting but your doctor has your Mum’s address and your boss has your Dad’s address.   In 2018, household data were a big weakness of the Census — nearly 8% of the census population didn’t have an assigned household. StatsNZ did a lot of work on this subsequently, but it’s hard.

Third, there are data that just aren’t collected routinely. Iwi affiliation, disabilities, and housing quality variables were examples from 2018. If these variables are wanted, they will have to be collected in other surveys, and there’s no clear reason to expect the other surveys to be more accurate than the Census. In particular, they may have worse non-response rates for Māori and for minority groups.

There’s also a potential social license issue.  People understand the Census and have some idea of what it’s for, and mostly approve.  The IDI is much less well understood, and I think is less popular. Replacing the Census with surveys and vacuuming up of data collected for other purposes could well have a negative effect on public willingness to give up their data and public trust in the results.

Good sources if you want to read about this include the StatsNZ page, whatever Len Cook writes, and also the reports of the 2018 Census Data Quality Panel (there’s a 2023 report, but it’s much smaller and mostly talks about minor improvements in methods).

June 17, 2025

Top 14 Predictions for the Semi-finals

Team Ratings for the Semi-finals

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Stade Toulousain 14.01 8.76 5.20
Toulon 4.97 5.32 -0.30
Bordeaux Begles 4.39 3.96 0.40
Clermont 2.59 0.41 2.20
Stade Rochelais 2.17 4.85 -2.70
Racing 92 1.31 2.75 -1.40
Section Paloise 0.95 1.38 -0.40
Castres Olympique 0.57 -0.09 0.70
Montpellier 0.50 -0.96 1.50
Bayonne 0.43 -1.69 2.10
Lyon -1.07 -0.18 -0.90
Stade Francais -2.53 1.86 -4.40
USA Perpignan -3.89 -0.66 -3.20
Vannes -8.68 -10.00 1.30

 

Performance So Far

So far there have been 184 matches played, 141 of which were correctly predicted, a success rate of 76.6%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Toulon vs. Castres Olympique Jun 15 52 – 23 11.90 TRUE
2 Bayonne vs. Clermont Jun 14 20 – 3 4.50 TRUE

 

Predictions for the Semi-finals

Here are the predictions for the Semi-finals. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Stade Toulousain vs. Bayonne Jun 21 Stade Toulousain 19.20
2 Bordeaux Begles vs. Toulon Jun 22 Bordeaux Begles 6.80