frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Three kinds of AI products work

https://www.seangoedecke.com/ai-products/
67•emschwartz•2h ago

Comments

8organicbits•2h ago
> It’s easy to verify changes by running tests or checking if the code compiles

This is actually a low bar, when the agent wrote those tests.

baxtr•1h ago
From the article:

> Summary

By my count, there are three successful types of language model product:

- Chatbots like ChatGPT, which are used by hundreds of millions of people for a huge variety of tasks

- Completions coding products like Copilot or Cursor Tab, which are very niche but easy to get immediate value from

- Agentic products like Claude Code, Codex, Cursor, and Copilot Agent mode, which have only really started working in the last six months

On top of that, there are two kinds of LLM-based product that don’t work yet but may soon:

- LLM-generated feeds

- Video games that are based on AI-generated content

Shebanator•1h ago
Author forgot about image, video, and music creation. These have all been quite successfully commercially, though maybe not as much artistically.
carsoon•26m ago
Recent articles seem only to mean LLMs when they reference AI. There are tons of commercial usecases for other models. Image Classification models, Image Generation models (traditionally difusion models, although some do use llm for image now), TTS models, Speach Transcription, translation models, AI driving models(autopilot), AI risk assessment for fraud, 3D structural engineering enhancement models.

With many of the good usecases of AI the end user doesn't know that ai exists and so it doesn't feel like there is AI present.

wongarsu•1h ago
This seems to be biased heavily towards products that look like an LLM. And yes, only a small number of those work. But that's because if your product is a thing I chat with, it immediately is in competition with ChatGPT/Claude/Grok/etc, leading to everything the article expressed. But those are hardly the only use cases for LLMs, let alone AI (whatever people nowadays mean by AI)

To name some of the obvious counter-examples, Grammarly and Deepl are both AI (and now partially LLM-based) products that don't fit any of the categories in the post, but seem pretty successful to me. Lots of successful applications of Vison-LLMs in document scanning too, whether you are deciphering handwritten text or just trying to get structured data out of pdfs.

themanmaran•30m ago
Perhaps I'm biased since we're in a document heavy industry, but I think the original post misses a lot of the non-tech company use cases. An insane percentage of human time is spent copy pasting things from documents.
dbreunig•12m ago
Agree. I bucket things into three piles:

1. Batch/Pipeline: Processing a ton of things, with no oversight. Document parsing, content moderation, etc.

2. AI Features: An app calls out to an AI-powered function. Grammarly might pass out a document for a summary, a CMS might want to generate tags for a post, etc.

3. Agents: AI manages the control flow.

So much of discussion online is heavily focused towards agents so that skews the macro view, but these patterns are pretty distinct.

bix6•1h ago
On agents it’s interesting but not surprising coding has seen so much initial success.

Personally I’m waiting for better O365 and SharePoint agents. I think there’s a lot of automation and helper potential there.

airstrike•1h ago
I'm building an opinionated take on this. It's shaping up nicely.

If you're a Rust developer reading this, interested in AI + GUI + Enterprise SaaS, and wants to talk, I'm building a team as we speak. E-mail in profile.

esseph•1h ago
At this point MS should probably sunset SharePoint and try again.
torlok•1h ago
So the only AI products that work is a chat bot you can talk to, or a chat bot that can perform tasks for you. Next thing you'll tell me is that the only businesses that work are ones where you can ask somebody to do something for you in exchange for money.
owenpalmer•1h ago
> Next thing you'll tell me is that the only businesses that work are ones where you can ask somebody to do something for you in exchange for money.

What other type of business is there?

hobs•1h ago
That is the joke.
gordonhart•46m ago
The best kind of businesses are the ones I don’t have to ask; they’ve already built a better product than what I would have asked for. That’s kinda the point the OP is making about chat vs a [good] dedicated interface.
ohyoutravel•38m ago
Realistically there are only four types of businesses writ large: tourism, food service, railroads, and sales. People building AI-based products should focus on those verticals.
lelandbatey•16m ago
Really only two kinds:

- Energy generation and

- Expending energy to convince the folks generating energy to give you money for activating their neurons (food service, entertainment, tourism, transportation, sales).

Any other fun ways to compartmentalize an economy?

tehjoker•10m ago
Not shown: any activity involved in production, science, or healthcare just off the top of my head
alickz•29m ago
The only GUI products that work are GUIs that you can interface with, or that perform tasks for you

Maybe the real value of AI, particularly LLMs, is in the interface it provides for other things, and not in the AI itself

What if AI isn't the _thing_? What if it's the thing that gets us _to_ the thing?

theptip•1h ago
I think this is kind of like saying “Only three kinds of internet products work, SaaS, webpages, and mobile apps”

At the level of granularity selected, maybe true. But too coarse to make any interesting distinctions or predictions.

Aldipower•1h ago
In my current project the agent (GPT-5) isn't helpful at all. Damn thing, lying all the time to me.
kevin_thibedeau•1h ago
They're idiot savants. Use them for their strengths. Know their weaknesses.
Aldipower•51m ago
So, what are their strengths then? I've fed it with a detailed, very well documented and typed API description. Asking to construct me some not too hard code snippets based on that. GPT-5 then pretend to do the right thing, but actually is creating meaningless nonsense out of it. Even after I tried to reiterate and refine my tasks. Every junior dev is waaay better.
ohyoutravel•35m ago
Parsing a thousand line stack trace and telling me what the problem was. Writing regexes. Spitting out ffmpeg commands.
skerit•1h ago
The article claims Claude Sonnet 3.5 was released less than 9 months ago, but this is wrong.

Claude 3.5 was released in june 2024.

Maybe he has been writing this article for a while, maybe he meant Claude Code or Claude 4.0

simonw•1h ago
He meant Sonnet 3.7 which was released on the same day as Claude Code, Feb 24th 2025: https://www.anthropic.com/news/claude-3-7-sonnet

With hindsight, given that Claude Code turned into a billion dollar precut category, it was a bit of a miss bundling those two announcements together like that!

SirensOfTitan•1h ago
I’ve been working on a learning / incremental reading tool for a while, and I’ve found LLM and LLM adjacent tech useful, but as ways of resolving ambiguity within a product that doesn’t otherwise show any use of LLM. It’s like LLM-as-parser.
owenpalmer•1h ago
Is there somewhere I can try the tool out? I'm interested in that kind of thing.
zkmon•1h ago
>> in five years time most internet users will spend a big part of their day scrolling an AI-generated feed.

Yep. Looking forward to the future where you can eat plastic pop-corn while watching the AI-generated video feeds.

pixl97•52m ago
Why 5 years, I'm pretty sure we're there today.
vorticalbox•36m ago
By Ai generated feeds do you mean a feed that is just full of AI posts or an AI generating a feed to one can scroll?
koliber•1h ago
A few more seem to work as well, because I've used them and found them valuable

- human language translation

- summarization

- basic content generation

- spoken language transcription

loloquwowndueo•1h ago
> basic content generation

Dunno, man, I can spot ai-generated content a mile away, it tends to be incredibly useless so once I spot it, I’ll run in the opposite direction.

HelloUsername•1h ago
> once I spot it

Exactly; pretty sure you've seen media or read text that you thought was human created..

carsoon•36m ago
You spot bad ai content. Since there is no button that will tell you if something was Ai generated you never know if what you read was/wasn't.
koliber•21m ago
I hate what LLM spit out and would never accept the whole output verbatim.

I love how they occasionally come up with a turn of phrase, a thought path, or surprising perspective. I work with them iteratively to brainstorm, transform, and crate compose content that I incorporate into my own work.

Regarding spotting AI-generate content, I was once accused of posting AI-generated content where I bona-fide typed every single letter myself without as much as glancing at an LLM. People's confidence in spotting AI content will vary and err on fake-positives and fake-negatives too. My kids now think all CG movies are AI generated, even the ones that pre-date image and video gen. They're pretty sure it's AI though.

thewebguyd•1h ago
I've also found LLMs helpful for breaking down user requests into a technical spec or even just clarifying requests.

I make a lot of business reporting where I work and dashboards for various things. When I get user requests for data, it's rarely clear or well thought out. They struggle with articulating their actual requirements and usually leads to a lot of back and forth emails or meetings and just delays things further.

I now paste their initial request emails into an LLM and tell it "This is what I think they are trying to accomplish, interpret their request into defined business metrics" or something similar and it does a pretty good job and saves a ton of the back and forth. I can usually then feed it a sample json response or our database schema and have it also make something quick with streamlit.

It's saved me (and the users) a ton of time and headaches of me trying to coerce more and more information from them, the LLMs have been decent enough at interpreting what they're actually asking for.

I'd love to see a day where I can hook them up with RO access to a data warehouse or something and make a self-service tool that users can prompt and it spits out a streamlit site or something similar for them.

notatoad•1h ago
> summarization

can you point me to a useful example of this? i see websites including ai-generated summaries all the time, but i've yet to see one that is actually useful and it seems like the product being sold here is simply "ai", not the summary itself - that is, companies and product managers are under pressure to implement some sort of AI, and sticking summaries in places is a way for them to fill that requirement and be able to say "yes, we have AI in our product"

koliber•26m ago
I sometimes get contracts, NDAs, or terms and conditions which normally I would automatically accept because they are low stakes and I don't have time to read them. At best I would skim them.

Now I pass them through an LLM and ask them to point out interesting, unconventional, or surprising things, and to summarize the document in a few bullet points. They're quite good at this, and I am can use what I discover later in my relationship with the counterparty in various ways.

I also use it to "summarize" a large log output and point out the interesting bits that are relevant to my inquiry.

Another use case is meeting notes. I use fireflies.ai for some of my meetings and the summaries are decent.

I guess summarization might not be the right word for all the cases, but it deals with going through the hay stack to find the needle.

gregates•8m ago
Do you go through the haystack yourself first, find the needle, and then use that to validate your hypothesis that the AI is good at accomplishing that task (because it usually finds the same needle)? If not, how do you know they're good at the task?

My own experience using LLMs is that we frequently disagree about which points are crucial and which can be omitted from a summary.

notatoad•5m ago
for your first one, if you're just feeding docs into a chatbot prompt and asking for a summary, i think that matches what the article would call a "chatbot product" rather than a summarization product.

fireflies.ai is interesting though, that's more what i was looking for. i've used the meeting summary tool in google meet before and it was hilariously bad, it's good to hear that there are some companies out there having success with this product type.

renewiltord•1h ago
The classic problem that online commenters face is that they only know products that are on Hacker News and Reddit. And I get why. Not being plugged into anything the only way to get information is social media so you only know social media.

E.g. https://www.thomsonreuters.com/en/press-releases/2025/septem...

B2B AI company, 2 years in sold for hundreds of millions, not an agent, chatbot, or completion. Do you know it exists? No. You only read Hacker News. How could you know?

Dilettante_•40m ago

  Additive’s GenAI-native platform streamlines the repetitive, time-consuming task of ingesting and parsing pass-through entity documents
From TFA:

  There’s another kind of agent that isn’t about coding: the research agent. LLMs are particularly good at tasks like “skim through ten pages of search results” or “keyword search this giant dataset for any information on a particular topic”.
PopAlongKid•38m ago
>The company’s [Additive] technology automates complex tasks such as extracting footnotes from K-1s, K-3s, and related forms, so every staff member can become a reviewer and complete work that used to take weeks in a matter of hours.

Any tax professional who takes weeks to enter footnote info from a K-1 form into their professional tax prep software is probably just as bad at other job-related tasks and either needs more training or to find another job.

larodi•1h ago
More than three kinds are then actually listed in the article
shermantanktop•1h ago
Formatting did not help. Three kinds, but then subheadings in the same size font, and then here come two more kinds, plus a side journey into various topics.
adammarples•1h ago
>I think there are serious ethical problems with this kind of product.

Unless there are serious ethical problems with people generating arbitrary text ie. Writing - then no there isn't

bob1029•1h ago
Chatbot is the only one I agree with (human in the loop).

Agents are essentially the chatbot, but without the human in the loop. Chatbot without human in the loop is a slop factory. Things like "multi-agent systems" are a clever ploy to get you to burn tokens and ideally justify all this madness.

Copilot/completion does not work in business terms for me. It looks like it works and it might feel like it's working in some localized technical sense, but it does not actually work on strategic timescales with complex domains in such a way that a customer would eventually be willing to pay you money for the results. The hypothesis that work/jobs will be created due to sloppy AI is proving itself out very quickly. I think "completion" tools like classic IntelliSense are still at the peak of efficiency.

mrweasel•1h ago
Chatbot in many environment simply doesn't work, because we won't let them and if we did, they'd be agents. Here I'm mostly thinking in terms of things like customer service chats. A chatbot that can't reach into other systems are essentially only useful for role playing.

The copilot/completion thing also doesn't work for me. I have no doubt that a lot of developers are having a lot of benefits from the coding LLMs, but I can't make them work.

I think one glaring obvious missing kind of AI is medical image recognition, which is already deployed and working in many scenarios.

happyopossum•1h ago
Very myopic view here - agents are turning out useful output in many fields outside of coding..
Xiol•1h ago
Such as?
carsoon•32m ago
Legal seems to be a big usecase for AI. I think more for simplification and classification versus generation though.
ZeroConcerns•1h ago
Well, the elephant in the room here is that the generic AI product that is being promised, i.e. "you get into your car in the morning, and on your drive to the office dictate your requirements for one of the apps that is going to guarantee your retirement, in order to find it completely done, rolled out to all the app stores and making money already once you arrive" isn't happening anytime soon, if ever, yet everyone pretty much is acting like it's already there.

Can "AI" in its current form deliver value? Sure, and it absolutely does but it's more in the form of "several hours saved per FTE per week" than "several FTEs saved per week".

The way I currently frame it: I have a Claude 1/2-way-to-the-Max subscription that costs me 90 Euros a month. And it's absolutely worth it! Just today, it helped me debug and enhance my iSCSI target in new and novel ways. But is it worth double the price? Not sure yet...

madeofpalk•57m ago
The other part to this is that LLMs as a technology definitely has some value as a foundation to build features/products on other than chat bots. But unclear to be whether that value can sustain current valuations.

Is a better de-noisier algorithm in Adobe Lightroom worth $500 billion?

ansgri•38m ago
A bit off-topic, but denoise in LR is like 3 years behind the purpose-built products like Topaz, so a bad example. They've added any ML-based denoise to it when, like a year ago?
ZeroConcerns•27m ago
> Is a better de-noisier algorithm in Adobe Lightroom worth $500 billion?

No.

But: a tool that allows me to de-noise some images, just by uploading a few samples and describing what I want to change, just might be? Even more so, possibly, if I can also upload a desired result and let the "AI" work on things until it matches that?

But also: cool, that saves me several hours per week! Not: oh, wow, that means I can get rid of this entire department...

pixl97•54m ago
Skeptics always like to toss in 'if ever' as some form of enlightenment they they are aware of some fundamental limitation of the universe only they are privy to.
mzajc•29m ago
Of the universe, perhaps, but humans certainly are a limiting factor here. Assuming we get this technology someday, why would one buy your software when the mere description of its functionality allows one to recreate it effortlessly?
adastra22•48m ago
Agentic tools is already delivering an increase in productivity equivalent to many FTEs. I say this as someone in the position of having to hire coders and needing far fewer than we otherwise would have.
ZeroConcerns•38m ago
Well, yeah, as they say on Wikipedia: {{Citation Needed}}

Can AI-as-it-currently-is save FTEs? Sure: but, again, there's a template for that: {{How Many}} -- 1% of your org chart? 10%? In my case it's around 0.5% right now.

Or, to reframe it a bit: can AI pay Sam A's salary? Sure! His stock options? Doubtful. His future plans? Heck nah!

vorticalbox•44m ago
I use mongo at work and LLM helped me find index issues.

Feeding it the explain, query and current indexes it can quickly tell what it was doing and why it was slow.

I saved a bunch time as I didn’t have to read large amounts of json from explain to see what is going on.

websap•1h ago
> Users simply do not want to type out “hey, can you increase the font size for me” when they could simply hit “ctrl-plus” or click a single button3.

I would def challenge this. “Turn off private relay”, “send this photo to X”, “Add a pit stop at a coffee shop along the way” are all voice commands I would love to use

chrisweekly•39m ago
Yes, this! esp the last one. Finding coffee shop / restaurant options ALONG THE WAY seems like it should've been solved years ago. Scenario: while driving, "want to eat in about an hour, must have vegetarian options, don't add more than 10m extra drive time" and get a shortlist to pick from.
Mikhail_Edoshin•22m ago
Old Apple Newton had a feature, I don't remember how they called it, but on any screen you could write "please", and then describe what to do, e. g. using one of their examples: "please fax this to Bob". And it worked. Internally it was a rather simple keyword match plus access to data, such as the system address book. New applications could register their own names for actions and relevant dictionaries.
levocardia•50m ago
Very obviously missing the mundane agentic work. I think the following things are basically already solved, and are just waiting for the right harness:

- Call this government service center, wait on hold for 45 minutes, then when they finally answer, tell them to reactivate my insurance marketplace account that got wrongly deleted.

- Find a good dentist within 2mi from my house, call them to make sure they take my insurance, and book an appointment sometime in the next two weeks no earlier than 11am

- Figure out how I'm going to get from Baltimore to Boston next Thursday, here's $100 and if you need more, ask me.

- I want to apply a posterizing filter in photoshop, take control of my mouse for the next 10sec and show me where it is in the menu

- Call that gym I never go to and cancel my membership

input_sh•22m ago
Basically already solved = you've never used it for any of those purposes and have no idea if or how well would they work?
irq-1•6m ago
> - Find a good dentist within 2mi from my house, call them to make sure they take my insurance, and book an appointment sometime in the next two weeks no earlier than 11am

The web caused dentists to make websites, but they don't post their appointment calendar; they don't have to.

Will AI looking for appointments cause businesses to post live, structured data (like calendars)? The complexity of scheduling and multiple calendars is perfect for an AI solution. What other AI uses and interactive systems will come soon?

- Accounting: generate balance sheets, audit in real-time, and have human accountants double check it (rather than doing)

- Correspondence: create and send notifications of all sorts, and consume them

- Purchase selection: shifting the lack of knowledge about products in the customers favor

- Forms: doing taxes or applying for a visa

Mikhail_Edoshin•32m ago
AI would make a very good librarian. It doesn't understand, only comprehends, but in this case it is enough.

Thing is, there is no library for it to work in.

theonething•21m ago
seem like data analysis would be a good one. Company ingests massive amounts of disparate business data. Ask AI to clean and normalize it, visualize it and give recommendations.
kken•19m ago
Well, considering that the long term idea is to have AGI, general intelligence, it seems that the goal as also to only have a single product in the end.

There may be different ways to access it, but the product is always the same.

EagnaIonat•4m ago
> This doesn’t work well because savvy users can manipulate the chatbot into calling tools. So you can never give a support chatbot real support powers like “refund this customer”, ...

I would disagree with this.

Part of how security is handled in current agentic systems is to not let the LLM have any access to how the underlying tools work. At best it's like hitting "inspect" in your browser and changing the web page.

Of course, that assumes that the agentic chatbot has been built correctly.

The fate of "small" open source

https://nolanlawson.com/2025/11/16/the-fate-of-small-open-source/
1•todsacerdoti•59s ago•0 comments

Pennies Are Trash Now

https://www.theatlantic.com/ideas/2025/11/pennies-circulation-mint/684935/
2•JumpCrisscross•1m ago•0 comments

iPhone users can now add US passport info to their digital wallets

https://apnews.com/article/apple-iphone-travel-passport-ae7ab15d6a32e6005d9c85def4e39737
1•petethomas•2m ago•0 comments

Git AI Assistant-Generate PR descriptions using your GitHub Copilot subscription

https://marketplace.visualstudio.com/items?itemName=gyaneshgouraw.git-ai-assistant
1•gyaneshgouraw•3m ago•1 comments

Made an AI assistant that lives in your Messages app

https://textit2.me/
1•ethanfox•4m ago•1 comments

Chromium Hardening Guide

https://github.com/RKNF404/chromium-hardening-guide
1•akyuu•5m ago•0 comments

CUDA-Q Back Ends: Quantum Hardware (QPU)

https://nvidia.github.io/cuda-quantum/latest/using/backends/hardware.html
1•westurner•6m ago•1 comments

Britain to announce 'most significant' change to asylum rules in years

https://www.cnn.com/2025/11/16/uk/britain-asylum-reform-shabana-mahmood-hnk-intl
2•RickJWagner•7m ago•1 comments

Toyota promises 40-year solid-state EV batteries by 2028

https://evxl.co/2025/11/09/toyota-promises-40-year-solid-state-ev-batteries-by-2028/
5•geox•12m ago•2 comments

Near-Perfect Broadband Quantum Memory Enabled by Spin-Wave Compaction

https://arxiv.org/abs/2505.02424
1•westurner•12m ago•1 comments

History's epic theories of what causes Northern Lights

https://www.bbc.com/future/article/20251114-historys-epic-theories-of-what-causes-aurora
1•consumer451•12m ago•0 comments

Don't Post Passive-Aggressive Webpages

https://dontpostpassiveaggressivewebpages.com/
2•birdculture•12m ago•0 comments

China's CO2 emissions flat or falling for past 18 months, analysis finds

https://www.theguardian.com/world/2025/nov/11/china-co2-emissions-flat-or-falling-for-past-18-mon...
2•bookofjoe•13m ago•0 comments

FreeTube – The Private YouTube Client

https://freetubeapp.io
3•akyuu•14m ago•0 comments

The Advent of Compiler Optimisations [video]

https://www.youtube.com/watch?v=j-BwR-Cw0Gk
2•SchwKatze•15m ago•0 comments

Which Class Is Better?

2•elainezzz•15m ago•0 comments

Show HN: Artifactguesser

https://artifactguesser.com
2•technomoloch•16m ago•0 comments

Bubble or Nothing

https://publicenterprise.org/report/bubble-or-nothing/
1•cratermoon•16m ago•0 comments

Brian Moore Homepage

https://brianmoore.com/
1•ZeljkoS•17m ago•0 comments

Community as Minimum Viable World Building

https://rosie.land/posts/community-as-minimum-viable-world-building/
1•rosiesherry•17m ago•0 comments

Trump buys at least $82M in bonds since late August, financial disclosures show

https://www.cnbc.com/2025/11/15/trump-buys-82-million-in-bonds-since-late-august.html
3•koolba•17m ago•0 comments

What if you don't need MCP at all?

https://mariozechner.at/posts/2025-11-02-what-if-you-dont-need-mcp/
1•jdkee•24m ago•0 comments

MongoDB brings Hibernate ORM support to developers

https://www.mongodb.com/company/blog/product-release-announcements/introducing-mongodb-extension-...
1•atandon11•24m ago•0 comments

Everything You Always Wanted to Know About Mathematics [pdf]

https://www.math.cmu.edu/~jmackey/151_128/bws_book.pdf
1•gjvc•25m ago•0 comments

CUDA Ontology

https://jamesakl.com/posts/cuda-ontology/
1•gugagore•25m ago•0 comments

AI is killing privacy. We can't let that happen

https://www.fastcompany.com/91435189/ai-privacy-openai-tracking-apps
15•johnshades•25m ago•8 comments

Tool2agent – build guardrails for LLM agents

https://github.com/tool2agent/tool2agent
1•klntsky•26m ago•0 comments

Ancient rock circles in California park defy explanation

https://www.sfgate.com/california-parks/article/anza-borrego-rock-circles-21143471.php
1•c420•27m ago•0 comments

How to Scale Distributed Product Teams from 10 to 100

https://intelligentfuturetech.com/blog/scaling-distributed-product-teams-2025/
2•ift•30m ago•1 comments

Decoding Leibniz Notation (2024)

https://www.spakhm.com/leibniz
2•coffeemug•34m ago•0 comments