news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We stopped paying OpenAI to debug our own code

https://modelriver.com/blog/test-mode-ai-workflows

2•vishaal_007•1h ago

Comments

vishaal_007•1h ago

Co-founder here—it's just me and my partner bootstrapping this thing. We've been wasting tokens left and right just trying to debug our response parsing code. Not even the AI logic, mind you, just our own sloppy stuff and don't get me started on CI: tests flaking out because GPT decides to rephrase something randomly. We got fed up paying real money to fix our bugs, so we hacked together a "Test Mode”. It routes your calls through the full pipeline (auth, logging, everything) but swaps in your sample data instead of hitting the actual provider. No tokens burned, totally deterministic, and lightning fast.

How are folks handling testing for AI integrations? Mocking always felt half-baked to us since it bypasses the real flow. What's working for you?

Show HN: Yardstiq – Compare LLM outputs side-by-side in your terminal

https://www.yardstiq.sh

1•stanleycyang•35s ago•0 comments

My accent costs me 30 IQ points on Zoom. So we built an ML model to fix it

https://krisp.ai/blog/introducing-accent-conversion-for-the-listener/

1•artavazdsm•3m ago•2 comments

The MokaBot Brews Better Coffee Than Me [video]

https://www.youtube.com/watch?v=UGf7mtfhOFM

1•jocoda•4m ago•0 comments

Altered default-mode network functional connectivity with mobile phone addiction [pdf]

https://e-century.us/files/ijcem/12/2/ijcem0078031.pdf

1•vitto_gioda•4m ago•0 comments

New scams emerging as leaked Odido data pops up on social media

https://nltimes.nl/2026/03/03/new-scams-emerging-leaked-odido-data-pops-social-media

1•TechTechTech•6m ago•0 comments

Show HN: BustAPI Back

https://github.com/RUSTxPY/BustAPI

1•ZOROX•6m ago•0 comments

Kind Technologies

http://www.kindtechnologies.com/index.htm

2•theonething•7m ago•0 comments

Wolt pulls out of Japan amid DoorDash exit from some Asian markets

https://www.japantimes.co.jp/business/2026/02/26/companies/wolt-japan-exit/

3•mikhael•8m ago•0 comments

Pass-Through of Tariffs: Evidence from European Wine Imports

https://www.nber.org/202603/digest/pass-through-tariffs-evidence-european-wine-imports

4•neehao•8m ago•0 comments

Show HN: TicketToPR, an open source tool that turns Notion tickets into PRs

https://github.com/JohnRiceML/ticket-to-pr

2•hello_code•8m ago•1 comments

Show HN: Pencil Puzzle Bench – LLM Benchmark for Multi-Step Verifiable Reasoning

https://ppbench.com/

2•bluecoconut•10m ago•0 comments

Production Agentic RAG Course

https://github.com/jamwithai/production-agentic-rag-course

2•redbell•10m ago•0 comments

Google CLI

https://gogcli.sh/

1•simonebrunozzi•10m ago•0 comments

Bhutan's crypto experiment shows how hard digital money is in the real world

https://restofworld.org/2026/bhutan-bitcoin-tourism-payment-adoption-failure/

1•PaulHoule•10m ago•0 comments

Show HN: DejaShip – an intent ledger to stop AI agents from building duplicates

https://github.com/mingulov/dejaship

1•mdn0•11m ago•0 comments

Show HN: WordPress for Voice Agents – Unpod.ai

https://github.com/parvbhullar/unpod

1•parvbhullar•11m ago•0 comments

I have $10k+ in cloud credits and want to turn them into a real business

1•Palominocoq•12m ago•0 comments

Show HN: I vibecoded a glucose analysis tool

https://github.com/daedalus/agp_tool

1•dclavijo•13m ago•0 comments

User Privacy and LLMs: An Analysis of Frontier Developers' Privacy Policies

https://arxiv.org/abs/2509.05382

1•redbell•13m ago•0 comments

Gary McKinnon Pentagon hacker interview [video]

https://www.youtube.com/watch?v=2ttdlCa5ZCI

1•childintime•14m ago•0 comments

A Story Bigger Than Iran by Garry Kasparov

https://www.thenextmove.org/p/a-story-bigger-than-iran

2•wslh•14m ago•0 comments

Authentication bypass in pac4j-JWT using only the RSA public key

https://www.codeant.ai/security-research/pac4j-jwt-authentication-bypass-public-key

2•Amartya_jha•15m ago•1 comments

I built a self-hosted RSS system for filtering (NetNewsWire + Miniflux)

https://jordankrueger.com/blog/self-hosted-rss-with-claude-code/

1•pandemicsoul•17m ago•1 comments

Gemini 3.1 Flash-Lite: Built for intelligence at scale

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-lite/

5•meetpateltech•17m ago•0 comments

Ask HN: How is Claude agent experience in Xcode 26.3?

3•malshe•17m ago•0 comments

Tech Is Shooting Itself in the Foot

https://datasciencetalent.co.uk/techs-dumbest-mistake-why-firing-programmers-for-ai-will-destroy-...

1•frag•18m ago•0 comments

Fine-Tuning Qwen3 Embeddings for product category classification

https://blog.ivan.digital/fine-tuning-qwen3-embeddings-for-product-category-classification-on-the...

1•ipotapov•18m ago•0 comments

Why your 2-week passkey sprint will turn into 6 months

https://www.corbado.com/blog/native-ios-android-passkey-implementation-challenges

1•vdelitz•19m ago•0 comments

Gemini 3.1 Flash-Lite

https://deepmind.google/models/model-cards/gemini-3-1-flash-lite/

1•meetpateltech•20m ago•0 comments

When AI Writes the Software, Who Verifies It?

https://leodemoura.github.io/blog/2026/02/28/when-ai-writes-the-worlds-software.html

2•todsacerdoti•20m ago•0 comments