April 5, 2018

Immigration NZ and the harm model

Immigration NZ, by and large, has been good at transparency in the past– you may think some of their policies are inhumane or arbitrary, but you can easily find out what their policies are.  That’s a pleasant contrast to the other place I’ve lived as an immigrant. Even their operational manual is available online. So, when you hear in this morning’s Radio NZ story “Immigration NZ using data system to predict likely troublemakers”, you might want to give them the benefit of the doubt and assume they are just taking more steps to make their decision procedures explicit.

But then you get to the quotes

“We will model the data sets we have available to us and look at who or what’s the demographic here that we’re looking at around people who are likely to commit harm in the immigration system or to New Zealand,” he said.

“Things like who’s incurring all the hospital debt or the debt to this country in health care, they’re not entitled to free healthcare, they’re not paying for it.

“So then we might take that demographic and load that into our harm model and say even though person A is doing this is there any likelihood that someone else that is coming through the system is going to behave in the same way and then we’ll move to deport that person at the first available opportunity so they don’t have a chance to do that type of harm.

At the very least, they are saying that you can have two people with the same record of what they’ve done in New Zealand, in the same circumstances, and one of them will be deported and the other not deported based on, say, country of origin or age.  It’s true that to be deported you have to have done something that gives them a justification — but “at the first available opportunity” is fairly broad when you’re Immigration NZ. And if  they’re talking about people who are “not entitled to free health care”, then “immigrants” is the wrong term. [update: Radio NZ have now changed the first word of the story from “Immigrants” to “Overstayers”. Apart from that issue of terminology the same comments still apply]

So, how does this differ from, say, the IRD using statistical models to target people with higher probability of having committed tax fraud for auditing? There are two important differences in principle. The first is that the IRD is interested in auditing people who have already committed tax fraud, not people who might do so in the future. The second is that the consequences of being caught don’t depend on the predicted probability. Immigration NZ, on the other hand, seems to be interested in treating people differently based on things they haven’t done but might do in the future.

Now, Immigration NZ has to deport some people. It has to make decisions about who to let into the country in the first place, and who to give extensions of visas, or grant residency. That’s what it’s for. These decisions will have serious impacts on the lives of would-be immigrants — ranging from those who have an application for residency denied to those who don’t even bother applying because there’s no hope.

Since Immigration NZ does make these sorts of decisions, do we want them to do it based on a statistical model? That’s actually a serious question. It depends. There are at least three issues with the model: the ‘transparency‘ issue, the ‘audit‘ issue and the ‘allowable information‘ issue. All of these are also a problem with decisions made by humans.

The ‘allowable information‘ issue is ‘racial profiling’. As a society, we’ve decided that some information just should not be used to make certain types of decisions — regardless of whether it’s genuinely predictive. For anyone other than Immigration NZ, country of origin would be in that category. Invoking a statistical model — essentially, writing it down in a flowchart — wouldn’t be a justification. To some extent Immigration NZ is required to treat prospective immigrants differently based on their country of origin; the question is how far they can go. The Human Rights Commission is likely to have an opinion here, and it’s quite possible they’ll say Immigration NZ has gone too far.

The ‘transparency‘ issue is that the model should be public.  Voters should be able to find out their government’s policy on deportations; people trying to immigrate should know their chances. The tax office have an argument for keeping their model secret; they don’t want people to be able to tweak their accounts to escape detection. The immigration office don’t.

The ‘audit‘ issue is related but more complicated.  Immigration NZ need to know (and should have independent verification, and should tell us) how accurate the model is and what inputs it’s sensitive to, and how reliable the data are. How many of the deported people does the model say would have committed serious crimes? How much unnecessary government expenditure does it predict they will require? How well do these predictions match up to reality? Are there relevant groups of people for whom the model is importantly less accurate — people from particular countries, people with or without family in NZ, etc — so that the costs of automated decision making aren’t justified by benefits.  And to what extent do the inputs to the model suffer from self-reinforcing bias?

The classic problem of self-reinforcing bias comes from a different context, predictions of future offences by convicted criminals. We don’t have data on who commits crimes, only on who is arrested, charged, or convicted.  To the extent that people from particular demographic groups are more likely to attract the notice of the justice system, it will look as if they are more likely to commit crime, and this will lead to more targeted enforcement. And so on, round and round.

In the immigration setting, we’d be concerned about any of the criteria that can be affected by current immigration enforcement practice — if people are currently more likely to be deported or more likely to have applications refused based subjectively on country of origin, this will tend to show up in the new models.  Healthcare costs, on the other hand, aren’t directly affected by Immigration NZ decisions and so don’t have the same self-reinforcing vicious circle — though failing to pay the bills might.

Having a statistical model isn’t necessarily a bad thing, just like having a formal flowchart or points system isn’t necessarily a bad thing.  The model can have various sorts of bias, but so can actual human immigration officers.  In contrast to some of the social policy models, this model isn’t being used to make new distinctions in a setting where everyone used to be treated uniformly — the immigration system has always made individual decisions about visas and deportations.

In principle, a model  could be developed with care to include only the right sorts of inputs, to predict outputs that aren’t subject to vicious circles,  to have clear and reliably estimated costs and benefits associated with decisions, and to be open to independent audit. Such a model would be more accountable to the Minister, Parliament, and the nation than the decisions of individual immigration officers.

The fact that we, and the incoming Minister, only found out about the system this morning doesn’t suggest we’ve got that sort of model. Neither does the disappearance of data from their website, where they’ve just discovered privacy problems (without all that much effect, since the data are still up at archive.org). Nor the explicit use of country of origin. Nor the spokesperson’s complete lack of reference to safeguards in the modelling process, or the argument that they can’t be doing racial profiling because they also use gender, age and type of visa in the model.

avatar

Thomas Lumley (@tslumley) is Professor of Biostatistics at the University of Auckland. His research interests include semiparametric models, survey sampling, statistical computing, foundations of statistics, and whatever methodological problems his medical collaborators come up with. He also blogs at Biased and Inefficient See all posts by Thomas Lumley »

Comments

  • avatar
    Steve Curtis

    The first word in the story word says ‘overstayers’ so they have broken the rules about length of stay. It clarifies this in more detail. (Is this a ‘later version’ of the story?)
    ‘Immigration New Zealand has been modelling data – including age, gender and ethnicity – of overstayers….
    So it is the same isnt it as IRD who model those who have committed tax fraud.
    As all overstayers are liable for deportation they seem to have a way, clumsy it may be, of focusing on those who are ‘deplorables’.

    6 years ago

    • avatar
      Thomas Lumley

      It is a later version of the story. The first word in the original version was “Immigrants”.

      The same issues still apply. Yes, all the people they are deporting are eligible to be deported. Immigration NZ isn’t deporting people illegally — the story would be much, much bigger if they were. But, in deciding who to deport, they are treating people differently in a way that (a) depends on country of origin and other criteria that would not ordinarily be legitimate, (b) is not public, and (c) doesn’t appear to have been evaluated as to whether it delivers any of the claimed benefits.

      All of those are bad things and they don’t become good things just because it’s happening to overstayers.

      6 years ago

  • avatar
    Megan Pledger

    I’ve mixed feelings. If it’s just being used as a way to prioritise deportations, deportations that were going to occur anyway, then I guess I don’t have any problem with it.

    But using ethnicity as a variable sounds icky on many levels. On a statistical level there is the general problem of what level ethnicity is going to be aggregated/disaggregated – does a person from Tonga get deported because an age/sex peer from Samoa didn’t pay a medical bill?

    I think there will be equity issues around gender as well. What happens if the medical bills are caused by pregnancy – does all the “deportation risk” fall on the women who got pregnant and none on her sexual partner? What if the medical bills are due to assault or rape? And does the “deportation risk” change if the offender is a deportee or a non-deportee?

    6 years ago

    • avatar
      Thomas Lumley

      I agree using data to prioritise deportations could easily be better than not using data. But the country of origin and gender things– icky, as you say. I might feel the same about age, except that age discrimination is so fundamental and explicit in the immigration system.

      But also, if it’s being done based on good data there’s no reason not to make the model and its evaluations public. If the criteria are good they should still be good if people know them. We’re not trying trap anyone.

      And finally, while it is apparently just being used in overstayers, the arguments supporting it would still apply in other cases where Immigration NZ has discretion. If it’s a good idea for overstayers it would be a good idea elsewhere too.

      6 years ago

  • avatar
    Jason Eager

    Given that we’re talking about the sovereignty of different nations interacting in this context (i.e. immigration), and we’re talking about people in the country ILLEGALLY (And therefore they’ve gone from being a friendly visitor/worker to being an INVADER), it is absolutely ludicrous and irresponsible for the people responsible for immigration to NOT consider their country of origin, and to check for patterns of illegal behavior of other people from that country, to see if their culture or their government is encouraging a defacto invasion or pillaging of resources from this country. (i.e. Mexico’s interactions with the USA). Nationality should not be automatically considered a direct proxy with race, and when the two happen to be directly correlated, then the connection between the two should not stop the department of immigration from acting responsibly to protect NZ’s sovereignty.

    TL:DR: It’s perfectly legitimate for NZ Immigration to consider the nationality of illegal immigrants to reduce harm to NZ, which is their PRIMARY mission. Take your virtue signaling and shove it where the sun doesn’t shine.

    6 years ago

    • avatar
      Thomas Lumley

      As I pointed out before, the argument doesn’t rely on them being unauthorised immigrants. Immigration NZ has discretion as to how to act on deportations — and in a lot of other cases as well. As do, say, the police and the courts in many situations. Your argument doesn’t really persuade me that the rest of the country should have absolutely no input on how this discretion is exercised, and that the rules shouldn’t be public. Or that I don’t have any right an opinion on the topic.

      It’s very easy for you not to read my opinions, though, then it will be just as if I had shoved them into the Aro Valley.

      6 years ago

    • avatar
      megan pledger

      We want to protect out sovereignty but we should to do it in a fair and ethical manner.

      There are three clear statistical issues…
      1) Most models are based on averages but people are individuals,
      2) I suspect the models won’t get anywhere near the complexity of human behaviour and
      3) the approriateness of aggregating/ disagregating the independent variables.

      6 years ago

      • avatar
        Steve Curtis

        The trouble with looking at features of the Immigration ‘system’ is that its intentionally discriminatory. The Human Rights commission is barred from taking action.
        ““immigration matters inherently involve different treatment on the basis of personal characteristics” (such as health, age, national origin).”

        The Young , the qualified , the wealthy are favoured.
        Being pregnant, major health issues, criminal convictions, no money are grounds for being barred from the country even if you have a visa.
        It must be very hard for many to navigate a very bureaucratic system and hiccups in the change from one visa to another may lead to becoming an ‘overstayer’.
        There is an appeal process available but I understand thats for the person involved to apply and a Immigration officer can grant a section 61 visa , but again all those previous ‘discriminatory ‘ factors can be applied. Those of good character etc would have a better chance here
        http://communitylaw.org.nz/community-law-manual/chapter-32-immigration-and-refugees/if-youre-here-illegally-understanding-your-options-chapter-32/

        6 years ago

  • avatar
    Jim Rose

    IRD seems incompetent if it only targets people who have already committed tax fraud.

    Certain industries more likely to avoid taxes as are certain income groups rich enough to hire tax accountants.

    Compliance with tax also correlated with social capital.

    Friends from less democratic countries treat the government as the enemy, assume greater corruption.

    Entire immigration advisory business based on profiting from distrust of government and assumption brokers are needed to navigate bureaucratic process like back home.

    6 years ago

    • avatar
      Thomas Lumley

      I don’t mean people who are already known to have committed tax fraud, I mean people who have a high probability of having committed it. IRD can’t do anything to you if you were going to commit tax fraud two years in the future but haven’t done so at the time they audit you.

      6 years ago