If a ton of these mistakes are genuinely simple calculation errors, it seems like giving the models access to a calculator tool would help a fair bit.
I’m surprised they haven’t tried this, I’m running my own in parallel against my accountant in this way.
ofrzeta•3h ago
Unsurprisingly. Sometimes I feel like I am in a madhouse. Or in an alchemist's laboratory.