frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Architecture+cost drivers for a deterministic rule/metric engine 1,200metrics

2•Trackdiver•3w ago
I’m designing a large-scale deterministic analytics engine and would appreciate architectural + cost/effort advice from people who’ve built similar systems.

The core challenge: • ~1,200 domain-specific metrics • All rule-based (no ML), fully deterministic • Metrics share common primitives but differ in configuration • Metrics combine into composite indices • Outputs must be auditable and reproducible (same inputs → same outputs) • I want metrics definable declaratively (not hard-coded one by one)

The system ingests structured event data, computes per-entity metrics, and produces ranked outputs with full breakdowns.

I’m specifically looking for guidance on: • Architectures for large configurable rule/metric engines • How to represent metric definitions (DSL vs JSON/YAML vs expression trees) • Managing performance without sacrificing transparency • Avoiding “1,200 custom functions” antipatterns • What you’d do differently if starting this today

Cost / effort sanity check (important): If you were scoping this as a solo engineer or small team, what are the biggest cost drivers and realistic milestones? • What should “Phase 1” include to validate the engine (e.g., primitives + declarative metric format + compute pipeline + 100–200 metrics)? • What’s a realistic engineering effort range for Phase 1 vs “all 1,200” (weeks/months, 1–2 devs vs 3–5 devs)? • Any common traps that make cost explode (data modeling mistakes, premature UI, overengineering the DSL, etc.)?

I’m not looking to hire here — just trying to sanity-check design decisions and expected effort before implementation.

Thanks in advance for any insight.

Comments

crosslayer•3w ago
A pattern I’ve seen bite systems like this isn’t compute or storage first… it’s semantic drift in metric definitions over time.

When you have ~1,200 deterministic metrics sharing primitives, the real cost driver becomes definition coupling, not execution. If metrics are “configurable” but allowed to encode control flow, branching semantics, or hidden normalization rules, you end up with 1,200 soft-coded functions anyway… just harder to reason about.

One approach that’s worked well for me is to explicitly separate:

• Primitive signals (pure, immutable, versioned) • Metric transforms (strictly functional, no side effects, no cross-metric reads) • Aggregation/composition layers (where ranking and composite indices live)

The key constraint… metric definitions must be referentially transparent and evaluable in isolation. If a metric can’t be recomputed offline from recorded inputs and its definition hash, it’s already too powerful.

On representation… I’ve had better outcomes with a constrained expression tree (or typed DSL) than raw JSON/YAML. The goal isn’t flexibility… it’s preventing the system from becoming a general purpose programming environment.

For Phase 1, I’d strongly cap scope at:

• A small, fixed primitive vocabulary • 100–200 metrics max • Explicit versioning + replay tooling • Hard limits on metric execution cost

The biggest cost explosions I’ve seen come from

• Allowing metrics to depend on other metrics implicitly • Letting “configuration” evolve without versioned invariants • Optimizing performance before semantic boundaries are locked

Curious whether you’re thinking about definition immutability and replayability as first class constraints, or treating them as implementation details later.

Nintendo Wii Themed Portfolio

https://akiraux.vercel.app/
1•s4074433•2m ago•1 comments

"There must be something like the opposite of suicide "

https://post.substack.com/p/there-must-be-something-like-the
1•rbanffy•4m ago•0 comments

Ask HN: Why doesn't Netflix add a “Theater Mode” that recreates the worst parts?

2•amichail•5m ago•0 comments

Show HN: Engineering Perception with Combinatorial Memetics

1•alan_sass•11m ago•1 comments

Show HN: Steam Daily – A Wordle-like daily puzzle game for Steam fans

https://steamdaily.xyz
1•itshellboy•13m ago•0 comments

The Anthropic Hive Mind

https://steve-yegge.medium.com/the-anthropic-hive-mind-d01f768f3d7b
1•spenvo•13m ago•0 comments

Just Started Using AmpCode

https://intelligenttools.co/blog/ampcode-multi-agent-production
1•BojanTomic•15m ago•0 comments

LLM as an Engineer vs. a Founder?

1•dm03514•15m ago•0 comments

Crosstalk inside cells helps pathogens evade drugs, study finds

https://phys.org/news/2026-01-crosstalk-cells-pathogens-evade-drugs.html
2•PaulHoule•16m ago•0 comments

Show HN: Design system generator (mood to CSS in <1 second)

https://huesly.app
1•egeuysall•17m ago•1 comments

Show HN: 26/02/26 – 5 songs in a day

https://playingwith.variousbits.net/saturday
1•dmje•17m ago•0 comments

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

https://github.com/Paraxiom/topological-coherence
1•slye514•20m ago•1 comments

Top AI models fail at >96% of tasks

https://www.zdnet.com/article/ai-failed-test-on-remote-freelance-jobs/
4•codexon•20m ago•2 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/
1•NaOH•21m ago•0 comments

Bob Beck (OpenBSD) on why vi should stay vi (2006)

https://marc.info/?l=openbsd-misc&m=115820462402673&w=2
2•birdculture•24m ago•0 comments

Show HN: a glimpse into the future of eye tracking for multi-agent use

https://github.com/dchrty/glimpsh
1•dochrty•25m ago•0 comments

The Optima-l Situation: A deep dive into the classic humanist sans-serif

https://micahblachman.beehiiv.com/p/the-optima-l-situation
2•subdomain•26m ago•1 comments

Barn Owls Know When to Wait

https://blog.typeobject.com/posts/2026-barn-owls-know-when-to-wait/
1•fintler•26m ago•0 comments

Implementing TCP Echo Server in Rust [video]

https://www.youtube.com/watch?v=qjOBZ_Xzuio
1•sheerluck•26m ago•0 comments

LicGen – Offline License Generator (CLI and Web UI)

1•tejavvo•29m ago•0 comments

Service Degradation in West US Region

https://azure.status.microsoft/en-gb/status?gsid=5616bb85-f380-4a04-85ed-95674eec3d87&utm_source=...
2•_____k•30m ago•0 comments

The Janitor on Mars

https://www.newyorker.com/magazine/1998/10/26/the-janitor-on-mars
1•evo_9•31m ago•0 comments

Bringing Polars to .NET

https://github.com/ErrorLSC/Polars.NET
3•CurtHagenlocher•33m ago•0 comments

Adventures in Guix Packaging

https://nemin.hu/guix-packaging.html
1•todsacerdoti•34m ago•0 comments

Show HN: We had 20 Claude terminals open, so we built Orcha

1•buildingwdavid•35m ago•0 comments

Your Best Thinking Is Wasted on the Wrong Decisions

https://www.iankduncan.com/engineering/2026-02-07-your-best-thinking-is-wasted-on-the-wrong-decis...
1•iand675•35m ago•0 comments

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

https://www.warcraftcn.com/
2•vyrotek•36m ago•0 comments

Trump Vodka Becomes Available for Pre-Orders

https://www.forbes.com/sites/kirkogunrinde/2025/12/01/trump-vodka-becomes-available-for-pre-order...
1•stopbulying•37m ago•0 comments

Velocity of Money

https://en.wikipedia.org/wiki/Velocity_of_money
1•gurjeet•40m ago•0 comments

Stop building automations. Start running your business

https://www.fluxtopus.com/automate-your-business
1•valboa•44m ago•1 comments