frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Bonsplit – Tabs and splits for native macOS apps

https://bonsplit.alasdairmonk.com
169•sgottit•9h ago•24 comments

Show HN: Fence – Sandbox CLI commands with network/filesystem restrictions

https://github.com/Use-Tusk/fence
45•jy-tan•5d ago•12 comments

Show HN: TUI for managing XDG default applications

https://github.com/mitjafelicijan/xdgctl
100•mitjafelicijan•10h ago•34 comments

Show HN: Netfence – Like Envoy for eBPF Filters

https://github.com/danthegoodman1/netfence
34•dangoodmanUT•6h ago•6 comments

Show HN: Gitmore – AI-powered Git reports that write themselves

2•inferno22•8h ago•0 comments

Show HN: AI-rganize – CLI tool for organizing your files

https://github.com/adefemi171/ai-rganize
2•tha_infra_guy•1h ago•0 comments

Show HN: LLMNet – The Offline Internet, Search the web without the web

https://github.com/skorotkiewicz/llmnet
17•modinfo•7h ago•4 comments

Show HN: A Zero-Copy 1.58-bit LLM Engine hitting 117 Tokens/s on single CPU core

https://github.com/r3-engine/r3-engine
2•dhilipsiva•1h ago•0 comments

Show HN: Accurate Password Guessing with AI

https://github.com/Tzohar/PassLLM
2•Plarsy•2h ago•0 comments

Show HN: I used my book generator to generate a catalog of books it can generate

https://www.ebook-forge.com/Omni
2•lywald•2h ago•2 comments

Show HN: CertRadar – Find every certificate ever issued for your domain

https://certradar.net/
7•ops_mechanic•3h ago•3 comments

Show HN: Uv-pack – Pack a uv environment for later portable (offline) install

https://github.com/davnn/uv-pack
3•davnn•2h ago•0 comments

Show HN: AutoShorts – Local, GPU-accelerated AI video pipeline for creators

https://github.com/divyaprakash0426/autoshorts
63•divyaprakash•14h ago•32 comments

Show HN: Kreamsicle – Cmd+K command palette for Hacker News

https://sajarin.com/blog/kreamsicle/
3•Sajarin•3h ago•2 comments

Show HN: Sightline – Shodan-style search for real-world infra using OSM Data

https://github.com/ni5arga/sightline
19•ni5arga•13h ago•0 comments

Show HN: C From Scratch – Learn safety-critical C with prove-first methodology

https://github.com/SpeyTech/c-from-scratch
55•william1872•21h ago•8 comments

Show HN: Open-source Figma design to code

https://github.com/vibeflowing-inc/vibe_figma
49•alepeak•1d ago•8 comments

Show HN: Bucket – Encrypted file sharing for people who live in the terminal

https://bucketlabs.org
4•bucket_•5h ago•3 comments

Show HN: Coi – A language that compiles to WASM, beats React/Vue

215•io_eric•5d ago•68 comments

Show HN: StormWatch – Weather emergency dashboard with prep checklists

https://jeisey.github.io/stormwatch/
43•lotusxblack•1d ago•11 comments

Show HN: Generate the perfect kickoff prompt

https://vibeprompting.dev
2•relatedcode•6h ago•0 comments

Show HN: Open Computer-Animated Multivariable Calculus Course in 6 Languages

https://calculus.academa.ai/
4•sinaatalay•6h ago•2 comments

Show HN: Free PDF Editor by TechRex – client-side PDF editing, OCR, compression

https://pdffreeeditor.com/
3•Maaz-Sohail•6h ago•0 comments

Show HN: AI powered daily tracker of the US slide into authoritarianism

https://www.worstdaysofar.com/
5•locallyoptimal•6h ago•0 comments

Show HN: VM-curator – a TUI alternative to libvirt and virt-manager

https://github.com/mroboff/vm-curator
36•theYipster•18h ago•7 comments

Show HN: Timer-wheel–based TTL cache for Node.js

https://github.com/m-thenot/tick-cache
2•mtht•7h ago•0 comments

Show HN: isometric.nyc – giant isometric pixel art map of NYC

https://cannoneyed.com/isometric-nyc/
1307•cannoneyed•3d ago•240 comments

Show HN: HomeGenGuide – Calculator for home generator installation costs

https://www.home-generator-installation.com
3•vansxxx•8h ago•0 comments

Show HN: RealXV6 – a faithful Unix V6 kernel port to 8086 real mode

https://github.com/FounderSG/RealXV6
5•FounderSG•5h ago•0 comments

Show HN: Semantic search engine for Studio Ghibli movie

https://ghibli-search.anini.workers.dev/
44•aninibread•4d ago•10 comments
Open in hackernews

Show HN: A Zero-Copy 1.58-bit LLM Engine hitting 117 Tokens/s on single CPU core

https://github.com/r3-engine/r3-engine
2•dhilipsiva•1h ago
The Project: I am building R3-Engine, a from-scratch, local AI inference engine for Microsoft's bitnet-b1.58-2B-4T. It is written in 100% Safe Rust, natively cross-compiles to Wasm SIMD128, and uses Zero heap allocations in the execution loop.

The Physics: By mapping a 64-byte aligned .r3 file directly from NVMe to CPU L3 Cache (Zero-Copy) and using AVX-512 VPOPCNTDQ for branchless math, the Ryzen 9950X3D achieves 117 Tokens/Second latency.

The Problem: The AI is mute (Outputting <unk>*)* The matrix multiplication pipeline is mathematically complete, but the output is stuck at Token ID 0 (<unk>). The issue lies in the transition between the quantized weights and the float-based non-linear activations.

Where I need expert input:

    Weight Tying in BitNet: Microsoft's 2B model ties Embeddings with the LM Head. I am cloning the embedding matrix for the output projection, but I suspect a scaling factor is missing.

    RMSNorm & SiLU in 1.58-bit: How should the raw integer accumulators (from the VPOPCNTDQ loop) be scaled before entering the SiLU activation and the subsequent layer?
GitHub Repo: https://github.com/r3-engine/r3-engine

If you know the physics of LLM Logit Sampling or ternary activation math, I would love your eyes on the codebase.