frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Security breaks during partial failures – design notes from distributed systems

5•sandhyavinjam•2h ago
TL;DR: Many security mechanisms fail not during attacks, but during partial outages. This post documents early design notes for a failure-aware security framework for distributed systems.

The problem

In production distributed systems, security often breaks when things are half working:

auth services degrade → retries explode

fallback paths widen access

recovery logic becomes the attack surface

Nothing is “exploited”, yet the system becomes unsafe.

Most security models assume stable components and clean failures. Real systems don’t behave that way.

Design assumptions

We assume:

correlated failures

retries are adversarial

timeouts are unsafe defaults

recovery paths matter as much as steady-state logic

We don’t assume:

global consistency

perfect identity

reliable clocks

centralized enforcement

Framework ideas (high level)

This work explores four ideas:

1. Failure-aware trust

Trust degrades under failure, not just compromise

Access narrows automatically during partial outages

2. Security invariants at runtime

Invariants are continuously enforced

Violations trigger containment, not alerts

3. Retry-safe security primitives

Idempotent, monotonic, side-effect bounded

Retries can’t escalate privilege

4. Security as observable state

Trust level, degradation, and containment are visible

If you can’t observe it, you can’t secure it

What this is not

Not zero trust marketing

Not compliance

Not a finished system

It’s an attempt to treat failure as the normal case, not an exception.

Why publish this early?

Because many real failures:

don’t fit clean research papers

happen during incidents, not attacks

are invisible outside production systems

We’re sharing design notes to get feedback before formalizing or evaluating further.

Feedback welcome

If you’ve seen security regressions during outages or retries causing unsafe behavior, I’d like to hear about it.

This is ongoing work. No claims of novelty or completeness.

Comments

1970-01-01•1h ago
Check out https://news.ycombinator.com/item?id=31627925

Security breaks during partial failures – design notes from distributed systems

5•sandhyavinjam•2h ago•1 comments

Ask HN: Building a tool to ensure things get done on time

3•Vishal19111999•1h ago•0 comments

Ask HN: When do we expose "Humans as Tools" so LLM agents can call us on demand?

31•vedmakk•9h ago•21 comments

Tell HN: Happy New Year

430•schappim•1d ago•199 comments

Ask HN: Which AI productivity tools are you using in 2026?

3•Vishal19111999•1h ago•0 comments

Ask HN: Why is Apple's voice transcription hilariously bad?

6•keepamovin•6h ago•4 comments

Ask HN: How did you learn to code?

23•chistev•20h ago•71 comments

Ask HN: How Are You Handling Auth in 2026?

5•joshcsimmons•11h ago•13 comments

Ask HN: What did you read in 2025?

334•kwar13•6d ago•443 comments

I built a public skill registry and MCP server so Codex can install new skills

2•iluxu•13h ago•0 comments

Ask HN: Loneliness at 19, how to cope?

60•yresting•4d ago•105 comments

Ask HN: What is the best microVMs for AI agents?

8•zfoong•1d ago•7 comments

Semantica – Open-source semantic layer and GraphRAG framework

7•kaifahmad1•20h ago•0 comments

Ask HN: Any example of successful vibe-coded product?

78•sirnicolaz•2d ago•117 comments

Ask HN: Does reading HN make you happy?

47•yakattak•2d ago•37 comments

Tell HN: Happy New Year!

4•realberkeaslan•1d ago•2 comments

Ask HN: How to do a Personal Cybersecurity audit

24•preciousoo•3d ago•12 comments

Tell HN: Stripe Dashboard Is Slow

2•_RPM•14h ago•2 comments

Ask HN: How long before the first civilian cargo flights are AI piloted?

2•givemeethekeys•1d ago•13 comments

Happy New Year HN!

11•thunderbong•1d ago•4 comments

Ask HN: How did you make yourself more marketable?

11•ronbenton•2d ago•13 comments

A curated directory of open-source AI projects

12•doanbactam•2d ago•2 comments

Ask HN: How to go back to listening to MP3s?

9•muratsu•5d ago•24 comments

TP-Link only works with a permanent internet connection

8•roscas•3d ago•7 comments

Ask HN: How are you sandboxing coding agents?

44•m-hodges•5d ago•31 comments

Tell HN: I am afraid AI will take my job at some point

24•funnyfoobar•6d ago•39 comments

Ask HN: How do you manage kids' accounts?

12•xfax•3d ago•7 comments

Ask HN: How do you get visibility if you're suuuuper bad at marketing?

13•ClipNoteBook•4d ago•22 comments

Ask HN: What do you use to manage your coding projects?

5•SunshineTheCat•2d ago•11 comments

Users decide which online platforms to trust in 2025

5•taka-dev•2d ago•3 comments