frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Hekate – A Zero-Copy ZK Engine Overcoming the Memory Wall

4•y00zzeek•5h ago
Most ZK proving systems are optimized for server-grade hardware with massive RAM. When scaling to industrial-sized traces (2^20+ rows), they often hit a "Memory Wall" where allocation and data movement become a larger bottleneck than the actual computation.

I have been developing Hekate, a ZK engine written in Rust that utilizes a Zero-Copy streaming model and a hybrid tiled evaluator. To test its limits, I ran a head-to-head benchmark against Binius64 on an Apple M3 Max laptop using Keccak-256.

The results highlight a significant architectural divergence:

At 2^15 rows: Binius64 is faster (147ms vs 202ms), but Hekate is already 10x more memory efficient (44MB vs ~400MB).

At 2^20 rows: Binius64 hits 72GB of RAM usage, entering swap hell on a laptop. Hekate processes the same workload in 4.74s using just 1.4GB of RAM.

At 2^24 rows (16.7M steps): Hekate finishes in 88s with a peak RAM of 21.5GB. Binius64 is unable to complete the task due to OOM/Swap on this hardware.

The core difference is "Materialization vs. Streaming". While many engines materialize and copy massive polynomials in RAM during Sumcheck and PCS operations, Hekate streams them through the CPU cache in tiles. This shifts the unit economics of ZK proving from $2.00/hour high-memory cloud instances to $0.10/hour commodity hardware or local edge devices.

I am looking for feedback from the community, especially those working on binary fields, GKR, and memory-constrained SNARK/STARK implementations.

Comments

SERSI-S•5h ago
Interesting work. This seems highly relevant for ZK systems that need to generate large proofs on commodity hardware. Streaming-first proving could be a key enabler for permissionless ZK infrastructure
y00zzeek•4h ago
Exactly. If we can't prove 2^24 rows on a laptop, ZK will stay centralized forever. Hekate is my answer to the memory wall that forces teams into $2+/hour AWS instances. Proving should be a commodity, not a luxury.
SERSI-S•4h ago
Agreed. The scary part is that memory requirements quietly define who is allowed to be a prover. If ZK infra assumes 64–128GB RAM by default, decentralization is already lost, regardless of the cryptography. Streaming-first designs feel like a prerequisite for permissionless proving, not just an optimization.
y00zzeek•4h ago
My motivation for building Hekate is simple: I am done watching well-funded teams with 50+ people and a busload of PhDs produce engineering trash.

There is a massive, widening gap between academic brilliance and silicon-level implementation. You can write the most elegant paper in the world, but if your prover requires 100GB of RAM to execute a basic trace, you haven't built a protocol, you've built a research project that collapses under its own weight.

I don't have "strategic planning" committees or HR-mandated consensus. If Hekate's core doesn't meet my performance standards, I rewrite it in 48 hours. This agility is a weapon. I want to prove that a single engineer, driven by physics and zero-copy principles, can wreck the unit economics of a multi-million dollar venture-backed startup.

Disrupting inefficient financial models is more than fun—it's necessary. The current "safe" hiring meta (US-only, HR-compliant, resume-padded candidates) is a strategic failure. While industry leaders focus on compliance, state-sponsored actors like Lazarus are eating their lunch.

You don't need "safe" candidates. You need predators. You need the difficult, inconvenient outliers who don't need a visa to outcode your entire department. Hekate is a reminder that in deep-tech, capital is noise, but performance is the only signal that matters.

wmf•4h ago
You should probably write this as a blog post or readme and submit the link instead. I can't provide any technical feedback since I don't even understand what a row is in this context.

I don't have "strategic planning" committees or HR-mandated consensus...

Look, if your code is better just say it's better. But this kind of LinkedIn slop conspiracist virtue signaling isn't a good look. It's fine to believe that but you should never say it out loud.

y00zzeek•4h ago
Fair point on the tone. I'll trade the rhetoric for physics.

A "row" in this context is a single step of the Keccak-f[1600] permutation within the AIR (Algebraic Intermediate Representation) table. Most engines materialize this entire table in RAM before proving. At 2^24 rows, that’s where you hit the "Memory Wall" and your cloud bill goes parabolic.

Hekate is "better" because it uses a Tiled Evaluator to stream these rows through the CPU cache (L1/L2) instead of saturating the memory bus. While Binius64 hits 72GB RAM on 2^20 rows, Hekate stays at 21.5GB for 16x the workload (2^24).

The "committees" comment refers to the gap between academic theory and hardware-aware implementation. One prioritizes papers; the other prioritizes cache-locality. Most well-funded teams choose the easy path (more RAM, more AWS credits) over the hard path (cache-aware engineering).

If you want to talk shop, tell me how you'd handle GPA Keys computation at 2^24 scale without a zero-copy model. I’m genuinely curious.

dwb•34m ago
lol, post the prompt that generated this
y00zzeek•1h ago
Since the edit window is closed, I want to clarify the AIR structure for those asking about the "row" definition.

In Hekate's Keccak AIR, the relationship is ~25 trace rows per 1 Keccak-f[1600] permutation.

2^24 Rows = The raw size of the execution trace matrix (height). ~671k Permutations = The actual cryptographic workload (equivalent to hashing ~90MB of data).

The benchmark compares the cost to prove the same cryptographic work, regardless of internal AIR row mapping.

THE MANIFESTO: https://github.com/oumuamua-corp/hekate

Show HN: GibRAM an in-memory ephemeral GraphRAG runtime for retrieval

https://github.com/gibram-io/gibram
6•ktyptorio•2h ago•2 comments

Show HN: ChunkHound, a local-first tool for understanding large codebases

https://github.com/chunkhound/chunkhound
79•NadavBenItzhak•11h ago•25 comments

Show HN: Speed Miners – A tiny RTS resource mini-game

https://speedminers.fun/
30•nickponline•11h ago•4 comments

Show HN: LibreblogRSS – a simple Android RSS reader that mimics social timelines

https://github.com/LibreBlogOrg/LibreblogRSS
2•libreblog•1h ago•0 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
146•el_pa_b•1d ago•46 comments

Show HN: Hekate – A Zero-Copy ZK Engine Overcoming the Memory Wall

4•y00zzeek•5h ago•8 comments

Show HN: 30min video analysis for $0.003 via frame-tiling and Vision API

https://github.com/unhaya/vam-seek-ai
4•haasiy•3h ago•2 comments

Show HN: App to spoof GPS location on iOS without jailbreaking

https://github.com/acheong08/ios-location-spoofer
9•acheong08•7h ago•1 comments

Show HN: I built a tool to assist AI agents to know when a PR is good to go

https://dsifry.github.io/goodtogo/
37•dsifry•23h ago•32 comments

Show HN: Turkish Sieve Engine – GPU-Accelerated Prime Number Generator

https://github.com/bilgisofttr/turkishsieve
3•bilgisoft•4h ago•0 comments

Show HN: Intuitive TUI for Ghostty Terminal Configuration

https://github.com/intaek-h/ghofig
2•intaek•5h ago•0 comments

Show HN: A self-custody medical records prototype (lessons learned)

https://github.com/Mzhvnn-tch/sehati-apps
2•SERSI-S•5h ago•1 comments

Show HN: Agam Space – Self-hosted, zero-knowledge, E2EE file storage

https://github.com/agam-space/agam-space
5•rameshl•8h ago•0 comments

Show HN: My way – 18-agent autonomous workflow for ClaudeCode – issues to deploy

https://github.com/avifenesh/awesome-slash
3•anotherCodder•8h ago•0 comments

Show HN: Microwave – Native iOS app for videos on ATproto

https://testflight.apple.com/join/cVxV1W3g
35•sinned•4d ago•10 comments

Show HN: Tusk Drift – Turn production traffic into API tests

https://github.com/Use-Tusk/tusk-drift-cli
32•jy-tan•2d ago•6 comments

Show HN: Gollem – Go framework for agentic AI app with MCP and built-in tools

https://github.com/m-mizutani/gollem
2•masa00•8h ago•0 comments

Show HN: 1Code – Open-source Cursor-like UI for Claude Code

https://github.com/21st-dev/1code
69•Bunas•2d ago•44 comments

Show HN: Spliff – Correlating XDP and TLS via eBPF (Building a Linux EDR)

https://github.com/NoFear0411/spliff
4•spliffedr•9h ago•12 comments

Show HN: Fun things to do with your VM/370 machine

https://rbanffy.github.io/fun-with-old-mainframes.github.io/fun-with-vm370.html
18•rbanffy•6d ago•4 comments

Show HN: TinyCity – A tiny city SIM for MicroPython (Thumby micro console)

https://github.com/chrisdiana/TinyCity
140•inflam52•2d ago•26 comments

Show HN: pgwire-replication - pure rust client for Postgres CDC

https://github.com/vnvo/pgwire-replication
45•sacs0ni•6d ago•8 comments

Show HN: mdto.page – Turn Markdown into a shareable webpage instantly

https://mdto.page
56•hjinco•1d ago•31 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
121•code_brian•3d ago•48 comments

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP

https://github.com/cosinusalpha/webctl
134•cosinusalpha•3d ago•39 comments

Show HN: Govctl – A CLI enforcing RFC-driven discipline on AI coding

https://github.com/govctl-org/govctl
2•gabriel_wu•11h ago•0 comments

Show HN: Project RCPC – A community network for distributed logic and A

https://github.com/JTRSoftware/Project_RCPC
2•jtr87•11h ago•0 comments

Show HN: WebGPU React Renderer Using Vello

https://github.com/mblode/react-vello
5•mblode•11h ago•1 comments

Show HN: Tabstack – Browser infrastructure for AI agents (by Mozilla)

128•MrTravisB•3d ago•23 comments

Show HN: Hc: an agentless, multi-tenant shell history sink

https://github.com/alessandrocarminati/hc
42•acarminati•2d ago•3 comments