frontpage.

Show HN: Coelanox – auditable inference runtime in Rust (BERT runs today)

https://www.coelanox.com/

1•Shark1n4Suit•1h ago

PyTorch and ONNX Runtime tell you what came out. They can't tell you what actually ran to get there — which ops executed, in what order, on what inputs.

A model gets packaged into a sealed .cnox container. SHA-256 is verified before a single op executes. Inference walks a fixed plan over a minimal opset. Every run can emit a per-op audit log: op type, output tensor hash, output sample — cryptographically linked to the exact container and input that produced it. If something goes wrong in production, you have a trail.

Scalar backend today — reference implementation and permanent fallback when hardware acceleration isn't available. Audit and verification is identical across all backends. SIMD next, GPU after that.

Input below is synthetic (all-ones) — pipeline is identical with real inputs.

github.com/Coelanox/CLF Audit example: { "schema": 2, "run": { "run_id": "59144ede-5a27-4dff-bc25-94abade5b215", "started_at_unix_ms": 1776535116721, "container_path": "/home/shark/cnox/models/output/bert_base_uncased.cnox", "container_sha256_hex": "184c291595536e3ef69b9a6a324ad5ee4d0cef21cc95188e4cfdedb7f1f82740", "backend": "scalar" }, "input": { "len": 98304, "sha256_hex": "54ac99d2a36ac55b4619119ee26c36ec2868552933d27d519e0f9fd128b7319f", "sample_head": [ 1.0, 1.0, 1.0, 1.0 ] }, "ops": [ { "op_index": 0, "op_type": "Add", "out_len": 98304, "out_sample_head": [ 0.12242669, -4.970478, 2.8673656, 5.450008 ], "out_sha256_hex": "19f8aa0a618e5513aed4603a7aae2a333c3287368050e76d4aca0f83fb220e78" }, { "op_index": 1, "op_type": "Add", "out_len": 98304, "out_sample_head": [ 0.9650015, 0.23414998, 1.539839, 0.30231553 ], "out_sha256_hex": "7ae2f025c8acf67b8232e694dd43caf3b479eb078366787e4fdc16d651450ad4" }, { "op_index": 2, "op_type": "MatMul", "out_len": 98304, "out_sample_head": [ 1.0307425, 0.19207191, 1.5278282, 0.3000223 ], "out_sha256_hex": "44c28e64441987b8f0516d77f45ad892750b3e5b3916770d3baa5f2289e41bdd" }, { "op_index": 3, "op_type": "Gelu", "out_len": 393216, "out_sample_head": [ 0.68828076, -0.0033473556, 1.591219, -0.16837223 ], "audit_elided": "hash_skipped: len 393216 > max 262144" }

Hippo Turns One Master Password into Many Without Storing Any

Our Longing for Inconvenience

David Sklansky, the 'First Nerd to Enter Poker,' Dies at 78

Launching Ising, open models to accelerate the path to useful quantum computers

What Is Llms.txt and Does Your Business Need One?

Dad brains: How fatherhood rewires the male mind

Show HN: AWS's Kiro just got an Open source Codex

Pupil dilation suggests people start solving before all numbers are in

Classic Papers: Articles That Have Stood the Test of Time

Why Zip drives dominated the 90s, then vanished almost overnight

The man who saw the future: the legacy of cultural theorist Mark Fisher

Robots learn: A brief, contemporary history

20000 Gates and 20 MIPS [pdf]

Tiny Go and Rust programs appear to start equally fast (on some machines)

AI writes code 100x faster – why hasn't productivity?

British Empire: How a Small Island Took over the World

Meshcore: Architecture for a Decentralized P2P LLM Inference Network

My first impressions on ROCm and Strix Halo

Let Sleeping CPUs Lie – S0ix

Singapore Tourism Board Launches AI-Powered Robodog Guides at Sentosa

Code → Eval → HLD → LLD → Code

Mistral API is degrading [04/2026]

Ask HN: Can you show me some useful AI-written programs?

Air Is Full of DNA

Mapping India's homegrown AI ecosystem – 110 apps, 22 languages, 28 sectors

I found out the hard way that Linux is not a dad-friendly gaming OS

Israel kills two UNICEF water truck drivers in Gaza

_Generic Printf() in Standard C23

Kazakhstan still relies on its ageing industrial giants

Show HN: How Are You-elderly fall detection app I built solo with AI in 6 months