frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: Has anyone deployed LLMs to production?

12•saaspirant•1d ago
I have been trying to tune Gemini flash to do some classification for me and it's not performing well at all. I had to change a lot of prompts and still it didn't seem to "learn" anything from the training set. The classification embarrassingly lacks common sense.

Has anyone used AI for anything useful? Apart from programming of course.

Comments

muzani•1d ago
They're great at first level customer service. Lots of questions are repetitive and they go through this better than humans. It was the biggest boost to customer satisfaction rating.

On the other end, I actually canceled a $100/month subscription once through email (it was company email that I no longer had access too). Gave evidence. It canceled the subscription within 20 mins.

Also gemini flash is unreliable. The best cost efficiency today seems to be gpt-4.1. The cheaper models seem to be okay for summarization mostly. Gemini Flash was much better a year ago, still unreliable, but at least it followed instructions.

mooreds•1d ago
We use it heavily for doc search. We bought Kapa.ai a few years ago and leverage their solution, not an in-house build.
byoung2•23h ago
I was having trouble getting GPT-4o to extract data like address, email, phone, tracking number from random emails in an inbox. Sometimes it would do it perfectly and other times it would fail miserably on a similar email. Then I tried asking it to first markup the email with schema.org metadata. Then I asked it to extract the data from the schema.org markup. That worked nearly every time.

Maybe there is an extra step you can work into your prompt that would help it get to the proper classification

nkristoffersen•21h ago
We are using over 50 billion LLM tokens for NLP/classification purposes per month. A mix of self hosted and cloud hosted models. But I have not attempted any fine tuning. Just prompt, (and perhaps more importantly) context “engineering”.
incomingpain•16h ago
I have Microsoft's Phi4 deployed onto https://mapleintel.ca for the AI side. Currently over 44,000 ips in that list.

I tried 'reasoning plus' but it was so much slower.

Against the Censorship of Adult Content by Payment Processors

https://soatok.blog/2025/07/24/against-the-censorship-of-adult-content-by-payment-processors/
1•SlackingOff123•9m ago•0 comments

Google ordered to pay Argentine pictured naked in garden

https://www.batimes.com.ar/news/argentina/google-ordered-to-pay-argentine-pictured-naked-in-garden.phtml
1•mgarciaisaia•23m ago•0 comments

Judge Scraps Opinion After Lawyer Flags Made-Up Quotes

https://news.bloomberglaw.com/business-and-practice/judge-withdraws-pharma-opinion-after-lawyer-flags-made-up-quotes
1•1vuio0pswjnm7•27m ago•1 comments

Show HN: Blueboots – A retro themed Fedora OS built with one Containerfile

https://github.com/bluebootsy/os
1•twelvenmonkeys•28m ago•0 comments

Efrit: A native elisp coding agent running in Emacs

https://github.com/steveyegge/efrit
1•simonpure•38m ago•0 comments

Show HN: I built a notion ai agent

https://www.youtube.com/watch?v=Uu3Np3bG9v4
1•ifeanyi_sa•40m ago•0 comments

Ask HN: Should HN introduce a "Tell HN" tab?

1•bhag2066•40m ago•0 comments

Distro-Hopping and RICEing

https://l-o-o-s-e-d.net/distro-hopping
2•l00sed•42m ago•2 comments

Show HN: AI image generator with 6 artistic mentors for better prompts

https://createvision.ai
1•yestwind•42m ago•0 comments

Equilibrium in the Embedding Space: When Novelty Becomes Familiar

https://lightcapai.medium.com/equilibrium-in-the-embedding-space-when-novelty-becomes-familiar-547862bdd38f
1•WASDAai•43m ago•0 comments

Show HN: Add viral TikTok audio to work meetings

https://soundboard.recall.ai/
2•saporito•44m ago•0 comments

RustMailer – Week 1 Update: 729 Views, 165 Clones, 13 Stars (in 9 Days)

https://www.indiehackers.com/post/rustmailer-week-1-update-729-views-165-clones-13-stars-in-9-days-9zDlC2HmjXFH7mKzcmpb
1•rustmailer•54m ago•0 comments

Good Docs Describe, Bad Docs Prescribe

https://rethinkingsoftware.substack.com/p/good-docs-describe-bad-docs-prescribe
2•aard•56m ago•1 comments

Show HN: Crawell – Extract any page as Markdown or download images in bulk

https://chromewebstore.google.com/detail/crawell/cmfcognoilmabnclomeehljmknallaaa
1•kamjin•1h ago•0 comments

Running Serverless WASM Functions on the Edge with K3s and SpinKube

https://www.fermyon.com/blog/spinkube-k3s
2•breve•1h ago•0 comments

Asciinema: Record and share your terminal sessions

https://asciinema.org
2•phendrenad2•1h ago•0 comments

Chinese drones carry 180ton of steel and concrete up mountain in pioneering feat

https://www.scmp.com/news/china/politics/article/3319460/chinese-drones-carry-180-tonnes-steel-and-concrete-mountain-pioneering-feat
4•xbmcuser•1h ago•1 comments

Benchmarking LLMs on open source Vulkan

https://airlied.blogspot.com/2025/07/ramalamamesa-benchmarks-on-my-hardware.html
1•uluyol•1h ago•0 comments

FCC approves Paramount-Skydance merger after Trump settlement, Colbert cancelled

https://turnto10.com/news/nation-world/fcc-approves-8b-paramount-skydance-merger-after-trump-settlement-colbert-cancellation-bias-dei-trusted-local-news-lawsuit
5•healsdata•1h ago•4 comments

PocketPages: No-Build Multi-Page Apps for PocketBase

https://pocketpages.dev/
1•thunderbong•1h ago•0 comments

The Economics of Superintelligence

https://www.economist.com/leaders/2025/07/24/the-economics-of-superintelligence
2•pseudolus•1h ago•2 comments

A new way to build Trending filters using Elasticsearch

https://secalerts.co/news/new-way-to-build-trending-filter-in-elasticsearch/63UEuSxIfsnLDFGFpfF7aU
1•louisstow•1h ago•1 comments

Handling request scoped dependencies in Golang without abusing the context

https://winsnes.io/posts/rsd/
1•T-Winsnes•1h ago•0 comments

NextTurn – GitHub meets LinkedIn with XP, ranks, and Prestige for developers

https://www.nextturn.dev/
1•Jstreetman•1h ago•1 comments

Spot-if-AI: detect if a track has been generated with tools such as Suno or Udio

https://chromewebstore.google.com/detail/spot-if-ai/olbnhjmkblmlmoolnglgpdljkaaogkbg
1•qosmo•1h ago•1 comments

Chinese Hackers Are Exploiting Flaws in Widely Used Software, Microsoft Says

https://www.nytimes.com/2025/07/23/world/asia/chinese-hackers-microsoft-sharepoint.html
2•mhga•1h ago•0 comments

Just built WeBuyBack – a resale marketplace

https://webuyback-marketplace-hub.lovable.app
2•someone32849283•1h ago•2 comments

University of Maryland Linux Users Group Mirror

https://mirror.umd.edu
2•1vuio0pswjnm7•1h ago•0 comments

US nuclear weapons agency 'among 400 organisations breached by Chinese hackers'

https://www.theguardian.com/technology/2025/jul/23/sharepoint-targeted-by-chinese-threat-actor-hackers-says-microsoft
2•mhga•1h ago•0 comments

Kagan Says She Was Impressed by AI Bot Claude's Legal Analysis

https://news.bloomberglaw.com/litigation/kagan-says-she-was-impressed-by-ai-bot-claudes-legal-analysis
2•signatoremo•1h ago•0 comments