I thought this was an excellent treatment of the garbage-in-garbage-out aspect o...

notahacker · on Sept 11, 2016

Aren't there numerous ML techniques potentially be applicable to evaluating potential future crimes where even the people responsible for writing the algorithm and feeding the massive dataset in don't really understand how a particular person might "get to red"? Transparency seems to be only part of the problem.

_skel · on Sept 11, 2016

The corollary to this: if an algorithm uses AI or machine learning to the extent that nobody precisely understands why it makes the decisions it does, it will be very difficult to change its behavior in specific cases, e.g. "make it stop doing that," especially when the inputs cannot be changed.

This is going to come up at some point when a self-driving car does something that appears totally irrational and ends up causing an accident. Engineers will need to come up with some kind of explanation, and I suspect the general public will not be satisfied when they learn that the explanation may be unknowable, or may reduce only to probability instead of certainty.

I deal with some of this at work in far more trivial use cases, and non-engineers just can't seem to accept that sometimes you cannot fix the imperfections without introducing worse imperfections in other areas of the system, and that ML generally leads to output that is "good enough" instead of perfect.

comex · on Sept 11, 2016

Maybe. But at least we can demand to know the system's input data and what measure(s) the training process is designed to optimize. For example, in the case of Beware, which public records are being used, and what is the benchmark for red/yellow/green? Also, are there any systems in place to try to reduce things like racial bias or the self-reinforcement effect described in the report? Why or why not?

M_Grey · on Sept 11, 2016

Maybe, but I'm not aware of any that have produced predictions on a broad in vivo population that are better than existing methods. It's one thing to do a heat map of a region at a certain time of day, and express that in terms of overall statistical risk. It's quite another to try and read the future intentions of a human being based on what amounts to a wealth of noise and a paucity of signal.

jayjay71 · on Sept 11, 2016

Just curious as the name is familiar - do you go to Georgia Tech?

M_Grey · on Sept 11, 2016

I don't sorry.

mgleason_3 · on Sept 11, 2016

Government transparency is absolutely necessary, especially financial, but very lacking.

rodgerd · on Sept 11, 2016

> I think the use of crime modeling doesn't have to be controversial

Outside of a 50's-style sci-fi utopia, I simply can't see how. What we consider a crime worth pursuing is, alone, a hugely difficult conversation.