sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25)

https://simonwillison.net/2026/Jul/5/sqlite-utils-fable/

40•ognyankulev•2h ago

Comments

Tiberium•1h ago

The title cost is only if this was raw API usage, but it was included in a subscription, so it's a small subset of the $200 plan:

> I upgraded to the Claude Max $200/month plan (I was previously on $100/month) to increase my Fable allowance for the remaining time until the July 7th Fablepocalypse, when even Claude Max subscribers will have to pay full API cost for the model.

I really wonder if Anthropic will stick with their decision to keep Fable on extra usage credits until they "get more compute", especially in the light of GPT 5.6 very likely coming out next week (it's confirmed to have the exact same pricing as GPT 5.5)

andy_ppp•1h ago

This is to prevent Chinese labs distilling Claude again right? And free advertising again?

embedding-shape•58m ago

> especially in the light of GPT 5.6 very likely coming out next week

Finally have an explanation why GPT 5.5 xhigh felt dumber and dumber these last few weeks, always the same thing when a new model release is about to come out...

toxik•33m ago

Opus has been extremely stupid recently, reckon that's because Fable needs to look appealing?

user43928•6m ago

I have never noticed a degradation in either Claude or OpenAI models, and the benchmarks people set up have never shown a statistically significant deviation either: https://marginlab.ai/trackers/claude-code

Yet the same claim is being posted every single day, including new claims that the Fable 5 model has degraded compared to the initial release, guardrails aside.

embedding-shape•4m ago

Almost slipping into conspiracy territory, but without insights into what the labs actually do internally, hard not to:

Anyways, heard about A/B testing before? ML people tend to like it a lot, hard to imagine neither OpenAI or Anthropic are already deep into categorizing people into buckets and running an wild amount of A/B testing all over the place, especially in the weeks leading up to new model releases, in various ways.

dreadnip•1h ago

The problem I have with this workflow is that the models are still too eager to please. If I ask it to scan a release and note possible issues, it absolutely will find issues. If I keep running the same prompt, it will keep finding issues. I’ve spammed GitHub PR reviews and it just keep finding (or inventing?) new issues. There is never a “Nothing found, good to go!”. I have to keep reminding myself that the model will always give me what I ask for, regardless of the reality/truth.

threatripper•1h ago

You get the same result if you pay humans a good sum of money to find issues.

nvme0n1p1•1h ago

Definitely not. I've never seen a human trapped in that kind of infinite loop. Humans know that if they don't stop at the end of the day, they don't get to go home to their wife, and if they don't finalize their list of issues, they never get their contract paid out.

embedding-shape•59m ago

Pay people per hour of work and even if there is no actual work, people will definitively find a way of spending hours doing things. If you've worked with contractors/outsourced roles before this will happen from time to time.

Tiberium•1h ago

I think this was true with older models, but at least with GPT 5.5 it can genuinely tell you "no issues found" after a few passes of finding real issues.

hnbad•1h ago

Fun fact: because AI written works don't have copyright (in the EU at least) and the level of prompting many people engage in doesn't suffice to create a copyrightable "work" and software licenses require you to actually be able to grant a license using rights you hold on a work, not only are many AI generated "works" not actually protected by copyright but by selling licenses you're actually in breach of contract law and may end up owing the licensee software you don't have.

vasco•1h ago

And nothing happened and zero people got in trouble over it.

- Narrator

Muromec•11m ago

...So far

keizo•29m ago

Glad to see others dual wielding: “I used to think that the idea of having one model review the work of another was somewhat absurd—it felt weirdly superstitious. The problem is it really does work”

5701652400•19m ago

just a note. in most parts of the world 149.25 USD can cover utilities, water, and food for a month for 1 adult person or even a family.

xyzzy123•13m ago

In Sydney Australia its < 2 days of median rent.

mirekrusin•12m ago

In others it's pizza night for family or half a bill for sushi dinner, so what?

Muromec•12m ago

That's my electricity bill for a year, okay

There is a point of diminishing returns though; the issues suggested will get speculative, or point out comment unclarity, or "defense in depth". But I agree it’s somewhat annoying to rarely get clear pushback in terms of "no, this looks good enough to me, release it"

baq•1h ago

You didn’t do it enough. They stop finding bugs eventually. Also, different models can find different bugs (though they do find the same ones, too, which is good and expected). For best results you want to run multi model reviews in loops.

If you had multiple people look at your PRs multiple times on different days results would be very similar.

MallocVoidstar•1h ago

No, depending on the complexity of the issue models can be into loops, where they go "this is definitely an issue and must be fixed", and then the resulting fixed code gets "this is definitely an issue and must be fixed", and then the resulting fixed code has the original 'issue'.

memoriyato3•19m ago

yeah, happened to me: "A is very wrong, you should do B", and on the next fresh review loop "B is very wrong, you should do A"

typically this means there is some ambiguity in the specification, and the model flips between alternative interpretations

bfjvibybd6cuvu6•19m ago

That's a different kind of loop.

For a normal review loops you can ask the model to return with nothing found if nothing is found and not invent things and it will do a better job of exiting without anything found.

PunchyHamster•20m ago

I've had it find bug, I asked it to make test to trigger the bug, and then it figured out it's not a bug. It will absolutely do wish fulfilment

left-struck•15m ago

Yeah when these models find a bug i like to ask it to write a test that will fail if the bug is real and pass when the bug is solved.

It’s not perfect but usually it works pretty well, and I’ve had the model come back to me with oh actually the test passed, the bug doesn’t work exist

As a bonus, you’ve now got a test that can detect that bug if it comes up again.

embedding-shape•1h ago

> There is never a “Nothing found, good to go!”.

Like when you do recursive programming, have you tried providing more/better stop conditions? If you literally just say "Continue until there are no more issues" then it'll do just that, but if you scope it better, like "Only mention issues related to X, Y or that leads to Z" and so on, you'll get less noise and more focus on issues that actually matter (to you).

memoriyato3•15m ago

also helps adding negative conditions like "do not nitpick", or specific bad attractors that you see "do not investigate/report anything related to symlinks, they are not a concern"

onion2k•57m ago

If I keep running the same prompt, it will keep finding issues.

I've had the same experience, but whenever I've reviewed what it finds it's basically right. It's pedantic, and a lot of the problems aren't things I really care about, but they definitely are real problems.

I'm not sure you can blame the AI for always finding problems if a) you asked it to, and b) there are problems to find.

starquake•33m ago

I use Claude Code and one of the steps in my workflow is do a review loop until no issues are found and it never loops. So my experience is entirely different. Even if I say: fix all issues. So not only the critical issues.

Myzel394•25m ago

That's just plain wrong. The new models do not hallucinate as much as they used to (in my personal experience)

TripolitianFish•19m ago

> plain wrong > (in my experience) What are you even saying.

girvo•2m ago

That their vibes are more real than your vibes

imhoguy•18m ago

You need to create review skill and there define what "issue" or "good" are for you to limit sensitiviness. Otherwise you depend on model's random threshold or non of such then you get perfection chasing.

Anyway it will never match your judgemend completely unless you upload your brain dump into model.

gib444•10m ago

It's not eagerness to please (that's anthropomorphising), rather it's a desire to bill you more money/use more tokens

(The fixed prices are just temporary discounts)

Shadcn/UI now defaults to Base UI instead of Radix

sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25)

Command and Conquer Generals natively ported to macOS, iPhone, iPad using Fable

If you're a button, you have one job

Pandoc Lua Filters

GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance

Megawatts by Microwave

Apocketlypse

Jellyfish can heal wounds in minutes. Scientists want their secrets

Google Books (or similar) all book scans – $200k bounty (2025)

Leaking YouTube creators' private videos

Moby Dick Workout

Artful Cats: Feline-Inspired Art and Artifacts

Web-based cryptography is always snake oil

Atomic Force Microscope high-speed video, stainless etching, bacteria, and more

Is The Economist Always Wrong?

Return of the Nigerian Prince Redux: Beware Book Club and Book Review Scams (2025)

Better Models: Worse Tools

The Log Is the Agent

EV Batteries Are Defying Expectations After Miles

Meta's Un-Stable Signature

Potential session/cache leakage between workspace instances or consumer accounts

My ASN Journey series (2024)

Zig: All Package Management Functionality Moved from Compiler to Build System

"Beyond the limit": Satellites and mirrors in space pose threat to the night sky

Reducing Assumptions, Exploding Your Code

Record-breaking solo rower Kelsey Pfendler arrives in Hawaii

What ORMs have taught me: just learn SQL (2014)

A Summer of Solar Cooking (2023)

The Preemptive Draw and Preemptive Grip in the Cash-in-Transit Sector

Shadcn/UI now defaults to Base UI instead of Radix

sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25)

Command and Conquer Generals natively ported to macOS, iPhone, iPad using Fable

If you're a button, you have one job

Pandoc Lua Filters

GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance

Megawatts by Microwave

Apocketlypse

Jellyfish can heal wounds in minutes. Scientists want their secrets

Google Books (or similar) all book scans – $200k bounty (2025)

Leaking YouTube creators' private videos

Moby Dick Workout

Artful Cats: Feline-Inspired Art and Artifacts

Web-based cryptography is always snake oil

Atomic Force Microscope high-speed video, stainless etching, bacteria, and more

Is The Economist Always Wrong?

Return of the Nigerian Prince Redux: Beware Book Club and Book Review Scams (2025)

Better Models: Worse Tools

The Log Is the Agent

EV Batteries Are Defying Expectations After Miles

Meta's Un-Stable Signature

Potential session/cache leakage between workspace instances or consumer accounts

My ASN Journey series (2024)

Zig: All Package Management Functionality Moved from Compiler to Build System

"Beyond the limit": Satellites and mirrors in space pose threat to the night sky

Reducing Assumptions, Exploding Your Code

Record-breaking solo rower Kelsey Pfendler arrives in Hawaii

What ORMs have taught me: just learn SQL (2014)

A Summer of Solar Cooking (2023)

The Preemptive Draw and Preemptive Grip in the Cash-in-Transit Sector

sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25)

Comments