frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

How do you handle production webhook delivery reliability in your apps?

6•Tanjim•2d ago
Hey everyone,

I’ve been thinking a lot about webhook delivery reliability lately. In many projects I’ve worked on, building robust webhook infra turned out to be deceptively complex:

- Retry logic (exponential backoff, timeouts) - Handling non-2xx responses - Delivery monitoring and alerting - Back-pressure or queueing to avoid overwhelming receivers - Secure signing and validation flows

In one project, a failed webhook caused a payment processing delay for hours because the retry logic was buggy. Another time, burst traffic took down the receiver endpoint with no DLQ strategy in place.

I’ve been researching different approaches teams here use:

Do you build your own custom webhook delivery queue and monitoring system? Use cloud solutions like AWS EventBridge or Step Functions to orchestrate? Or integrate third-party tools that handle delivery, retries, and observability for you?

I’m curious about how you ensure production-grade reliability at scale without burning dev hours on plumbing. Recently, I’ve been working on a tool in this space to handle these issues automatically, but would love to hear:

- What architecture have you found most reliable? - What are the edge cases you’ve encountered (e.g. signature mismatches, downstream outages)? - Any horror stories or lessons learned from webhook failures in production?

Looking forward to learning from your experiences and best practices around webhook infra!

Comments

tasn•2d ago
Very biased, but I think you should just use Svix[1].

Though if you're interested, I recorded a video about webhook architecture at some point you may find useful: https://m.youtube.com/watch?v=4jvV75OD620

1: https://www.svix.com

kasey_junk•2d ago
I’m not affiliated with svix but a happy customer.

It’s just worked, for years for us in production. We’ve never had an issue.

Now our use case is pretty simple but for us it’s a piece of infrastructure we never worry about.

leakycap•2d ago
I think your questions beg another: where can we just take out this layer of complexity, and how?

Sometimes rather than chasing edge cases, I find another way to do the same thing using a routine or library that already has all the edge cases ironed out.

If you're a small team or one person, you can't expect to stay on top of something that starts broken.

ezekg•2d ago
Totally agree. For me, with a vanilla Rails app, I leaned on Sidekiq to handle webhook queueing, processing, and retries: https://keygen.sh/blog/how-to-build-a-webhook-system-in-rail...

It's scaled quite well. Billions of webhooks. I barely ever think about it.

Ask HN: What Are You Working On? (June 2025)

363•david927•1d ago•1131 comments

Ask HN: What's the 2025 stack for a self-hosted photo library with local AI?

156•jamesxv7•7h ago•74 comments

Ask HN: 80s electronics book club; anyone remember this illustrator?

23•codpiece•2d ago•21 comments

Ask HN: Which skill do you believe will take the longest to be replaced by AI?

2•atleastoptimal•3h ago•3 comments

Ask HN: How have you shared computers with your young child (~3 to 5)

9•msencenb•7h ago•9 comments

Ask HN: Stock Android tablet free of bloatware?

6•miki_tyler•8h ago•2 comments

Canon selphy cp1500 privacy concerns

2•azca•2h ago•2 comments

Ask HN: How did low contrast text become so pervasive?

17•mr-pink•22h ago•16 comments

Ask HN: Which Free Software or Open Source Project Needs Help?

13•em-bee•1d ago•6 comments

A literary magazine accessible only via telnet

28•edent•5d ago•13 comments

Ask HN: How Are You Reading HN in June 2025?

4•johncole•4h ago•2 comments

Ask HN: Where do you host your Go apps

6•asim•14h ago•3 comments

Ask HN: Anyone using augmented reality, VR, glasses, helmets etc. in industry?

86•NewUser76312•5d ago•81 comments

Tell HN: (dictionary|thesaurus).reference.com is now a spam site

50•akkartik•1d ago•13 comments

Ask HN: Is the header CSS broken for you?

45•LorenDB•1d ago•13 comments

Ask HN: Is noprocrast still working for you?

6•infotainment•20h ago•5 comments

Something 'deeper' than Emacs, or am I looking for a unicorn?

4•willschetelich•22h ago•7 comments

Ask HN: Startup shutting down, should we open source?

13•amadeoeoeo•3d ago•36 comments

Ask HN: What do use for private service monitoring?

3•vednig•1d ago•5 comments

Ask HN: Languages Designed for WASM?

4•90s_dev•1d ago•7 comments

Ask HN: What Happened to James Halliday ( Substack)?

4•bit1993•1d ago•2 comments

A reverse-delta backup strategy – obvious idea or bad idea?

12•datastack•2d ago•25 comments

What's the best gem you've found on Hacker News?

5•imwoody•13h ago•4 comments

Ask HN: What's Your Car?

9•behnamoh•11h ago•25 comments

Ask HN: Better-auth or Nextauth or something else

6•dasubhajit•2d ago•0 comments

Ask HN: Why aren't AIs being used as app beta testers yet?

15•amichail•3d ago•25 comments

Ask HN: Why does my Node.js multiplayer game lag at 500 players with low CPU?

15•jbryu•4d ago•26 comments

Tell HN: Meta developer account suspended

172•ArthurVL1•6d ago•83 comments

Ask HN: Alternatives to Cloudflare for DNS?

8•modmodmod•3d ago•8 comments

How do you handle production webhook delivery reliability in your apps?

6•Tanjim•2d ago•4 comments