Three kinds of AI products work

https://www.seangoedecke.com/ai-products/

67•emschwartz•2h ago

Comments

8organicbits•2h ago

> It’s easy to verify changes by running tests or checking if the code compiles

This is actually a low bar, when the agent wrote those tests.

baxtr•1h ago

From the article:

> Summary

By my count, there are three successful types of language model product:

- Chatbots like ChatGPT, which are used by hundreds of millions of people for a huge variety of tasks

- Completions coding products like Copilot or Cursor Tab, which are very niche but easy to get immediate value from

- Agentic products like Claude Code, Codex, Cursor, and Copilot Agent mode, which have only really started working in the last six months

On top of that, there are two kinds of LLM-based product that don’t work yet but may soon:

- LLM-generated feeds

- Video games that are based on AI-generated content

Shebanator•1h ago

Author forgot about image, video, and music creation. These have all been quite successfully commercially, though maybe not as much artistically.

carsoon•26m ago

Recent articles seem only to mean LLMs when they reference AI. There are tons of commercial usecases for other models. Image Classification models, Image Generation models (traditionally difusion models, although some do use llm for image now), TTS models, Speach Transcription, translation models, AI driving models(autopilot), AI risk assessment for fraud, 3D structural engineering enhancement models.

With many of the good usecases of AI the end user doesn't know that ai exists and so it doesn't feel like there is AI present.

wongarsu•1h ago

This seems to be biased heavily towards products that look like an LLM. And yes, only a small number of those work. But that's because if your product is a thing I chat with, it immediately is in competition with ChatGPT/Claude/Grok/etc, leading to everything the article expressed. But those are hardly the only use cases for LLMs, let alone AI (whatever people nowadays mean by AI)

To name some of the obvious counter-examples, Grammarly and Deepl are both AI (and now partially LLM-based) products that don't fit any of the categories in the post, but seem pretty successful to me. Lots of successful applications of Vison-LLMs in document scanning too, whether you are deciphering handwritten text or just trying to get structured data out of pdfs.

themanmaran•30m ago

Perhaps I'm biased since we're in a document heavy industry, but I think the original post misses a lot of the non-tech company use cases. An insane percentage of human time is spent copy pasting things from documents.

dbreunig•12m ago

Agree. I bucket things into three piles:

1. Batch/Pipeline: Processing a ton of things, with no oversight. Document parsing, content moderation, etc.

2. AI Features: An app calls out to an AI-powered function. Grammarly might pass out a document for a summary, a CMS might want to generate tags for a post, etc.

3. Agents: AI manages the control flow.

So much of discussion online is heavily focused towards agents so that skews the macro view, but these patterns are pretty distinct.

bix6•1h ago

On agents it’s interesting but not surprising coding has seen so much initial success.

Personally I’m waiting for better O365 and SharePoint agents. I think there’s a lot of automation and helper potential there.

airstrike•1h ago

I'm building an opinionated take on this. It's shaping up nicely.

If you're a Rust developer reading this, interested in AI + GUI + Enterprise SaaS, and wants to talk, I'm building a team as we speak. E-mail in profile.

esseph•1h ago

At this point MS should probably sunset SharePoint and try again.

torlok•1h ago

So the only AI products that work is a chat bot you can talk to, or a chat bot that can perform tasks for you. Next thing you'll tell me is that the only businesses that work are ones where you can ask somebody to do something for you in exchange for money.

owenpalmer•1h ago

> Next thing you'll tell me is that the only businesses that work are ones where you can ask somebody to do something for you in exchange for money.

What other type of business is there?

hobs•1h ago

That is the joke.

gordonhart•46m ago

The best kind of businesses are the ones I don’t have to ask; they’ve already built a better product than what I would have asked for. That’s kinda the point the OP is making about chat vs a [good] dedicated interface.

ohyoutravel•38m ago

Realistically there are only four types of businesses writ large: tourism, food service, railroads, and sales. People building AI-based products should focus on those verticals.

lelandbatey•16m ago

Really only two kinds:

- Energy generation and

- Expending energy to convince the folks generating energy to give you money for activating their neurons (food service, entertainment, tourism, transportation, sales).

Any other fun ways to compartmentalize an economy?

tehjoker•10m ago

Not shown: any activity involved in production, science, or healthcare just off the top of my head

alickz•29m ago

The only GUI products that work are GUIs that you can interface with, or that perform tasks for you

Maybe the real value of AI, particularly LLMs, is in the interface it provides for other things, and not in the AI itself

What if AI isn't the _thing_? What if it's the thing that gets us _to_ the thing?

theptip•1h ago

I think this is kind of like saying “Only three kinds of internet products work, SaaS, webpages, and mobile apps”

At the level of granularity selected, maybe true. But too coarse to make any interesting distinctions or predictions.

Aldipower•1h ago

In my current project the agent (GPT-5) isn't helpful at all. Damn thing, lying all the time to me.

kevin_thibedeau•1h ago

They're idiot savants. Use them for their strengths. Know their weaknesses.

Aldipower•51m ago

So, what are their strengths then? I've fed it with a detailed, very well documented and typed API description. Asking to construct me some not too hard code snippets based on that. GPT-5 then pretend to do the right thing, but actually is creating meaningless nonsense out of it. Even after I tried to reiterate and refine my tasks. Every junior dev is waaay better.

ohyoutravel•35m ago

Parsing a thousand line stack trace and telling me what the problem was. Writing regexes. Spitting out ffmpeg commands.

skerit•1h ago

The article claims Claude Sonnet 3.5 was released less than 9 months ago, but this is wrong.

Claude 3.5 was released in june 2024.

Maybe he has been writing this article for a while, maybe he meant Claude Code or Claude 4.0

simonw•1h ago

He meant Sonnet 3.7 which was released on the same day as Claude Code, Feb 24th 2025: https://www.anthropic.com/news/claude-3-7-sonnet

With hindsight, given that Claude Code turned into a billion dollar precut category, it was a bit of a miss bundling those two announcements together like that!

SirensOfTitan•1h ago

I’ve been working on a learning / incremental reading tool for a while, and I’ve found LLM and LLM adjacent tech useful, but as ways of resolving ambiguity within a product that doesn’t otherwise show any use of LLM. It’s like LLM-as-parser.

owenpalmer•1h ago

Is there somewhere I can try the tool out? I'm interested in that kind of thing.

zkmon•1h ago

>> in five years time most internet users will spend a big part of their day scrolling an AI-generated feed.

Yep. Looking forward to the future where you can eat plastic pop-corn while watching the AI-generated video feeds.

pixl97•52m ago

Why 5 years, I'm pretty sure we're there today.

vorticalbox•36m ago

By Ai generated feeds do you mean a feed that is just full of AI posts or an AI generating a feed to one can scroll?

koliber•1h ago

A few more seem to work as well, because I've used them and found them valuable

- human language translation

- summarization

- basic content generation

- spoken language transcription

loloquwowndueo•1h ago

> basic content generation

Dunno, man, I can spot ai-generated content a mile away, it tends to be incredibly useless so once I spot it, I’ll run in the opposite direction.

HelloUsername•1h ago

> once I spot it

Exactly; pretty sure you've seen media or read text that you thought was human created..

carsoon•36m ago

You spot bad ai content. Since there is no button that will tell you if something was Ai generated you never know if what you read was/wasn't.

koliber•21m ago

I hate what LLM spit out and would never accept the whole output verbatim.

I love how they occasionally come up with a turn of phrase, a thought path, or surprising perspective. I work with them iteratively to brainstorm, transform, and crate compose content that I incorporate into my own work.

Regarding spotting AI-generate content, I was once accused of posting AI-generated content where I bona-fide typed every single letter myself without as much as glancing at an LLM. People's confidence in spotting AI content will vary and err on fake-positives and fake-negatives too. My kids now think all CG movies are AI generated, even the ones that pre-date image and video gen. They're pretty sure it's AI though.

thewebguyd•1h ago

I've also found LLMs helpful for breaking down user requests into a technical spec or even just clarifying requests.

I make a lot of business reporting where I work and dashboards for various things. When I get user requests for data, it's rarely clear or well thought out. They struggle with articulating their actual requirements and usually leads to a lot of back and forth emails or meetings and just delays things further.

I now paste their initial request emails into an LLM and tell it "This is what I think they are trying to accomplish, interpret their request into defined business metrics" or something similar and it does a pretty good job and saves a ton of the back and forth. I can usually then feed it a sample json response or our database schema and have it also make something quick with streamlit.

It's saved me (and the users) a ton of time and headaches of me trying to coerce more and more information from them, the LLMs have been decent enough at interpreting what they're actually asking for.

I'd love to see a day where I can hook them up with RO access to a data warehouse or something and make a self-service tool that users can prompt and it spits out a streamlit site or something similar for them.

notatoad•1h ago

> summarization

can you point me to a useful example of this? i see websites including ai-generated summaries all the time, but i've yet to see one that is actually useful and it seems like the product being sold here is simply "ai", not the summary itself - that is, companies and product managers are under pressure to implement some sort of AI, and sticking summaries in places is a way for them to fill that requirement and be able to say "yes, we have AI in our product"

koliber•26m ago

I sometimes get contracts, NDAs, or terms and conditions which normally I would automatically accept because they are low stakes and I don't have time to read them. At best I would skim them.

Now I pass them through an LLM and ask them to point out interesting, unconventional, or surprising things, and to summarize the document in a few bullet points. They're quite good at this, and I am can use what I discover later in my relationship with the counterparty in various ways.

I also use it to "summarize" a large log output and point out the interesting bits that are relevant to my inquiry.

Another use case is meeting notes. I use fireflies.ai for some of my meetings and the summaries are decent.

I guess summarization might not be the right word for all the cases, but it deals with going through the hay stack to find the needle.

gregates•8m ago

Do you go through the haystack yourself first, find the needle, and then use that to validate your hypothesis that the AI is good at accomplishing that task (because it usually finds the same needle)? If not, how do you know they're good at the task?

My own experience using LLMs is that we frequently disagree about which points are crucial and which can be omitted from a summary.

notatoad•5m ago

for your first one, if you're just feeding docs into a chatbot prompt and asking for a summary, i think that matches what the article would call a "chatbot product" rather than a summarization product.

fireflies.ai is interesting though, that's more what i was looking for. i've used the meeting summary tool in google meet before and it was hilariously bad, it's good to hear that there are some companies out there having success with this product type.

renewiltord•1h ago

The classic problem that online commenters face is that they only know products that are on Hacker News and Reddit. And I get why. Not being plugged into anything the only way to get information is social media so you only know social media.

E.g. https://www.thomsonreuters.com/en/press-releases/2025/septem...

B2B AI company, 2 years in sold for hundreds of millions, not an agent, chatbot, or completion. Do you know it exists? No. You only read Hacker News. How could you know?

Dilettante_•40m ago

  Additive’s GenAI-native platform streamlines the repetitive, time-consuming task of ingesting and parsing pass-through entity documents

From TFA:

  There’s another kind of agent that isn’t about coding: the research agent. LLMs are particularly good at tasks like “skim through ten pages of search results” or “keyword search this giant dataset for any information on a particular topic”.

PopAlongKid•38m ago

>The company’s [Additive] technology automates complex tasks such as extracting footnotes from K-1s, K-3s, and related forms, so every staff member can become a reviewer and complete work that used to take weeks in a matter of hours.

Any tax professional who takes weeks to enter footnote info from a K-1 form into their professional tax prep software is probably just as bad at other job-related tasks and either needs more training or to find another job.

larodi•1h ago

More than three kinds are then actually listed in the article

shermantanktop•1h ago

Formatting did not help. Three kinds, but then subheadings in the same size font, and then here come two more kinds, plus a side journey into various topics.

adammarples•1h ago

>I think there are serious ethical problems with this kind of product.

Unless there are serious ethical problems with people generating arbitrary text ie. Writing - then no there isn't

bob1029•1h ago

Chatbot is the only one I agree with (human in the loop).

Agents are essentially the chatbot, but without the human in the loop. Chatbot without human in the loop is a slop factory. Things like "multi-agent systems" are a clever ploy to get you to burn tokens and ideally justify all this madness.

Copilot/completion does not work in business terms for me. It looks like it works and it might feel like it's working in some localized technical sense, but it does not actually work on strategic timescales with complex domains in such a way that a customer would eventually be willing to pay you money for the results. The hypothesis that work/jobs will be created due to sloppy AI is proving itself out very quickly. I think "completion" tools like classic IntelliSense are still at the peak of efficiency.

mrweasel•1h ago

Chatbot in many environment simply doesn't work, because we won't let them and if we did, they'd be agents. Here I'm mostly thinking in terms of things like customer service chats. A chatbot that can't reach into other systems are essentially only useful for role playing.

The copilot/completion thing also doesn't work for me. I have no doubt that a lot of developers are having a lot of benefits from the coding LLMs, but I can't make them work.

I think one glaring obvious missing kind of AI is medical image recognition, which is already deployed and working in many scenarios.

happyopossum•1h ago

Very myopic view here - agents are turning out useful output in many fields outside of coding..

Xiol•1h ago

Such as?

carsoon•32m ago

Legal seems to be a big usecase for AI. I think more for simplification and classification versus generation though.

ZeroConcerns•1h ago

Well, the elephant in the room here is that the generic AI product that is being promised, i.e. "you get into your car in the morning, and on your drive to the office dictate your requirements for one of the apps that is going to guarantee your retirement, in order to find it completely done, rolled out to all the app stores and making money already once you arrive" isn't happening anytime soon, if ever, yet everyone pretty much is acting like it's already there.

Can "AI" in its current form deliver value? Sure, and it absolutely does but it's more in the form of "several hours saved per FTE per week" than "several FTEs saved per week".

The way I currently frame it: I have a Claude 1/2-way-to-the-Max subscription that costs me 90 Euros a month. And it's absolutely worth it! Just today, it helped me debug and enhance my iSCSI target in new and novel ways. But is it worth double the price? Not sure yet...

madeofpalk•57m ago

The other part to this is that LLMs as a technology definitely has some value as a foundation to build features/products on other than chat bots. But unclear to be whether that value can sustain current valuations.

Is a better de-noisier algorithm in Adobe Lightroom worth $500 billion?

ansgri•38m ago

A bit off-topic, but denoise in LR is like 3 years behind the purpose-built products like Topaz, so a bad example. They've added any ML-based denoise to it when, like a year ago?

ZeroConcerns•27m ago

> Is a better de-noisier algorithm in Adobe Lightroom worth $500 billion?

No.

But: a tool that allows me to de-noise some images, just by uploading a few samples and describing what I want to change, just might be? Even more so, possibly, if I can also upload a desired result and let the "AI" work on things until it matches that?

But also: cool, that saves me several hours per week! Not: oh, wow, that means I can get rid of this entire department...

pixl97•54m ago

Skeptics always like to toss in 'if ever' as some form of enlightenment they they are aware of some fundamental limitation of the universe only they are privy to.

mzajc•29m ago

Of the universe, perhaps, but humans certainly are a limiting factor here. Assuming we get this technology someday, why would one buy your software when the mere description of its functionality allows one to recreate it effortlessly?

adastra22•48m ago

Agentic tools is already delivering an increase in productivity equivalent to many FTEs. I say this as someone in the position of having to hire coders and needing far fewer than we otherwise would have.

ZeroConcerns•38m ago

Well, yeah, as they say on Wikipedia: {{Citation Needed}}

Can AI-as-it-currently-is save FTEs? Sure: but, again, there's a template for that: {{How Many}} -- 1% of your org chart? 10%? In my case it's around 0.5% right now.

Or, to reframe it a bit: can AI pay Sam A's salary? Sure! His stock options? Doubtful. His future plans? Heck nah!

vorticalbox•44m ago

I use mongo at work and LLM helped me find index issues.

Feeding it the explain, query and current indexes it can quickly tell what it was doing and why it was slow.

I saved a bunch time as I didn’t have to read large amounts of json from explain to see what is going on.

websap•1h ago

> Users simply do not want to type out “hey, can you increase the font size for me” when they could simply hit “ctrl-plus” or click a single button3.

I would def challenge this. “Turn off private relay”, “send this photo to X”, “Add a pit stop at a coffee shop along the way” are all voice commands I would love to use

chrisweekly•39m ago

Yes, this! esp the last one. Finding coffee shop / restaurant options ALONG THE WAY seems like it should've been solved years ago. Scenario: while driving, "want to eat in about an hour, must have vegetarian options, don't add more than 10m extra drive time" and get a shortlist to pick from.

Mikhail_Edoshin•22m ago

Old Apple Newton had a feature, I don't remember how they called it, but on any screen you could write "please", and then describe what to do, e. g. using one of their examples: "please fax this to Bob". And it worked. Internally it was a rather simple keyword match plus access to data, such as the system address book. New applications could register their own names for actions and relevant dictionaries.

levocardia•50m ago

Very obviously missing the mundane agentic work. I think the following things are basically already solved, and are just waiting for the right harness:

- Call this government service center, wait on hold for 45 minutes, then when they finally answer, tell them to reactivate my insurance marketplace account that got wrongly deleted.

- Find a good dentist within 2mi from my house, call them to make sure they take my insurance, and book an appointment sometime in the next two weeks no earlier than 11am

- Figure out how I'm going to get from Baltimore to Boston next Thursday, here's $100 and if you need more, ask me.

- I want to apply a posterizing filter in photoshop, take control of my mouse for the next 10sec and show me where it is in the menu

- Call that gym I never go to and cancel my membership

input_sh•22m ago

Basically already solved = you've never used it for any of those purposes and have no idea if or how well would they work?

irq-1•6m ago

> - Find a good dentist within 2mi from my house, call them to make sure they take my insurance, and book an appointment sometime in the next two weeks no earlier than 11am

The web caused dentists to make websites, but they don't post their appointment calendar; they don't have to.

Will AI looking for appointments cause businesses to post live, structured data (like calendars)? The complexity of scheduling and multiple calendars is perfect for an AI solution. What other AI uses and interactive systems will come soon?

- Accounting: generate balance sheets, audit in real-time, and have human accountants double check it (rather than doing)

- Correspondence: create and send notifications of all sorts, and consume them

- Purchase selection: shifting the lack of knowledge about products in the customers favor

- Forms: doing taxes or applying for a visa

Mikhail_Edoshin•32m ago

AI would make a very good librarian. It doesn't understand, only comprehends, but in this case it is enough.

Thing is, there is no library for it to work in.

theonething•21m ago

seem like data analysis would be a good one. Company ingests massive amounts of disparate business data. Ask AI to clean and normalize it, visualize it and give recommendations.

kken•19m ago

Well, considering that the long term idea is to have AGI, general intelligence, it seems that the goal as also to only have a single product in the end.

There may be different ways to access it, but the product is always the same.

EagnaIonat•4m ago

> This doesn’t work well because savvy users can manipulate the chatbot into calling tools. So you can never give a support chatbot real support powers like “refund this customer”, ...

I would disagree with this.

Part of how security is handled in current agentic systems is to not let the LLM have any access to how the underlying tools work. At best it's like hitting "inspect" in your browser and changing the web page.

Of course, that assumes that the agentic chatbot has been built correctly.

The fate of "small" open source

Pennies Are Trash Now

iPhone users can now add US passport info to their digital wallets

Git AI Assistant-Generate PR descriptions using your GitHub Copilot subscription

Made an AI assistant that lives in your Messages app

Chromium Hardening Guide

CUDA-Q Back Ends: Quantum Hardware (QPU)

Britain to announce 'most significant' change to asylum rules in years

Toyota promises 40-year solid-state EV batteries by 2028

Near-Perfect Broadband Quantum Memory Enabled by Spin-Wave Compaction

History's epic theories of what causes Northern Lights

Don't Post Passive-Aggressive Webpages

China's CO2 emissions flat or falling for past 18 months, analysis finds

FreeTube – The Private YouTube Client

The Advent of Compiler Optimisations [video]

Which Class Is Better?

Show HN: Artifactguesser

Bubble or Nothing

Brian Moore Homepage

Community as Minimum Viable World Building

Trump buys at least $82M in bonds since late August, financial disclosures show

What if you don't need MCP at all?

MongoDB brings Hibernate ORM support to developers

Everything You Always Wanted to Know About Mathematics [pdf]

CUDA Ontology

AI is killing privacy. We can't let that happen

Tool2agent – build guardrails for LLM agents

Ancient rock circles in California park defy explanation

How to Scale Distributed Product Teams from 10 to 100

Decoding Leibniz Notation (2024)