Not only are they using AI before they've properly assessed them, they also end up using Copilot which must be one of the worse AIs currently available, probably because of existing Microsoft relations. And on top of all that, they hope to be able to rely on "Please review the outputs" which obviously isn't an actual solution here, of course people will get complacent and throw stuff over the wall whenever they can.
This is honestly the fundamental problem of AI as I see it
When we offload our work to a different person we can calibrate our expectations to our past experiences with that person. With AI the experience is not very consistent. To use AI effectively you basically should treat it as a low trust, brand new coworker every single time you use it
That doesn't really scale, so people have two choices: be constantly hyper vigilant for mistakes the AI makes, or become complacent and trust it more than they should
People rightly point out that humans make mistakes too, not just AI. But humans have a pretty manageable cap on the amount of output they can produce. One human can pretty thoroughly review the outputs of a small team of other humans
One human can't possibly thoroughly review the volume of output that an LLM they are prompting can produce
Are we thinking about how we’re using it, or???
It seems like; there’s two kinds of data that might go into this, boilerplate and subjective information. Subjective information should be input by the police, because I would assert the specific wording matters. It matters that the words used to describe what the policeman saw comes out of the policeman’s brain. If it’s boilerplate, I’d AI really more reliable then copy-paste?
Everyone I talk to (including outside of tech) is going through this phase at their companies. It’s not working.
Checking the output seems like a simple request, but the question becomes: Check against what? If the police are making a document that sources from another report that another officer used AI to produce from their notes which were also run through AI and on and on, an inconsistency that leaks in at a previous step will check out when someone reviews the output against the inputs.
We’re all also discovering that many people’s idea of reviewing the output is to skim it and verify that it looks convincing enough. Checking facts is hard and takes time. These people are using AI because they want to work less, not to give themselves extra work.
It’s not typing that’s the bottleneck, at least not often, so this is essentially assuming that you can do all the needed work without actually doing it, which is obviously wishful thinking.
echelon_musk•48m ago
lloydatkinson•44m ago
cjs_ac•39m ago
phyalow•36m ago
layer8•33m ago
zipy124•29m ago
Aurornis•13m ago