frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I embedded 685M public texts in 32 minutes (on 8x A100, Rust, TensorRT)

https://github.com/Artain-AI/ignite-ms
5•ddayanov•12h ago
Quick note on how it works and how I've done my batch embedding engine IgniteMS.

The whole thing runs as one process using Rust, reading input, tokenizing, packing batches, keeping the queue full. TensorRT handles inference. Python is only as a wrapper.

I built it this way because when you use more than couple of GPUs, the GPUs stop being the problem. CPU cannot feed them fast enough. One A100 can go through batches faster than Python can tokenize and feed, so the GPU just sits there idle waiting for work. Most of my time went into optimizing this. At 8 GPUs that was basically the entire challenge.

On cost. I ran the big 2B messages job on a spot p4d instance (8x A100 40GB). After filtering and dedupping I got 685M raw texts. With my new engine the whole production run finishes in about half an hour. Previously I used on-demand for these jobs, now switched to spots. If AWS reclaims the box, I just rerun it. It's roughly $7 for half-an-hour run. And at least right now spots are easier to get than on-demand.

Open warning: it's batch only and NVIDIA only. You can use it both as a docker image and native. I used some optimizations for my production run. With default settings you can expect to see ~250K msg/sec if you run the benchmark script on your p4d box. https://github.com/Artain-AI/ignite-ms/blob/main/BENCHMARKIN...

v1.1.0 added TensorRT 11 and 60 models, 23 tested on 1x and 4x A100.

Happy to share details.

Show HN: Uruky (EU-based Kagi alternative) now has Image Search and URL Rewrites

https://uruky.com/?il=en
219•BrunoBernardino•19h ago•198 comments

Show HN: Mercek – A Desktop IDE for AWS ECS

https://www.mercek.dev/
35•utibeumanah•7h ago•8 comments

Show HN: Hitoku Draft – Context aware local assistant

https://hitoku.me/draft/
16•lostathome•10h ago•3 comments

Show HN: Formally verified polygon intersection – Opus 4.8 oneshots, prev failed

https://github.com/schildep/verified-polygon-intersection
40•permute•6h ago•6 comments

Show HN: Prela – Purely Algebraic Relation Combinators

https://github.com/remysucre/prela
63•remywang•3d ago•13 comments

Show HN: Cost.dev (YC W21) – making agents cost-aware and cheaper to call

https://cost.dev/
29•akh•16h ago•15 comments

Show HN: I embedded 685M public texts in 32 minutes (on 8x A100, Rust, TensorRT)

https://github.com/Artain-AI/ignite-ms
5•ddayanov•12h ago•0 comments

Show HN: Papernews – self-hosted daily newspaper PDF for your reMarkable

https://github.com/marcj/papernews
4•bourbonproof•4h ago•1 comments

Show HN: NoiR Code – because QR sounds similar to "noir"

https://noir-code.suncake.xyz/
6•Sunkek•1d ago•2 comments

Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud

https://boxes.dev
85•nab•13h ago•63 comments

Show HN: ControllerTest-test gamepads,stick drift and polling rate by browser

https://controllertestonline.com/
3•zylics•3h ago•0 comments

Show HN: Using Haskell to play music on 3D printer motors (2020)

https://lucasoshiro.github.io/software-en/2020-07-31-music_gcode/
8•lucasoshiro•8h ago•2 comments

Show HN: Edsger – A handwritten Clojure REPL for the reMarkable 2

https://handwritten.danieljanus.pl/2026-06-01-edsger.html
253•nathell•2d ago•34 comments

Show HN: I reverse-engineered the world maps of Test Drive III (1990 DOS game)

https://github.com/s-macke/Test-Drive-3-Maps
213•s-macke•4d ago•55 comments

Show HN: A native port of Skate 3 for Windows & Linux via static recompilation

https://github.com/mchughalex/skate3recomp
4•turingfeel•6h ago•0 comments

Show HN: Nutrepedia – Nutrition info in 29 locales built with Clojure and Htmx

https://nutrepedia.com/en-us/
129•llovan•1d ago•28 comments

Show HN: Eyeball

https://eyeball.rory.codes/
288•mrroryflint•2d ago•86 comments

Show HN: FFmpeg WebCLI – Full FFmpeg in Browser, Offline PWA, No Uploads(WASM)

https://github.com/tejaswigowda/ffmpeg-webCLI
76•tejaswigowda•7h ago•23 comments

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)

https://github.com/zaydmulani09/mnemo
54•zaydmulani•1d ago•25 comments

Show HN: Bot or Not – Spot AI-generated randomness

https://play-bot-or-not.vercel.app/
4•EvanZhouDev•8h ago•0 comments

Show HN: Digger Solo – Local AI File Explorer

https://solo.digger.lol
3•sean_pedersen•8h ago•0 comments

Show HN: Vlags, small vector flags. Less than 900 bytes each

https://vlags.com/
3•ryancanzo•9h ago•0 comments

Show HN: A simple bookmark manager for Mac and iOS

https://bendansby.com/apps/markwell.html
4•webwielder2•3h ago•0 comments

Show HN: Live breath detection and biofeedback from a phone microphone

https://github.com/shiihaa-app/shiihaa-breath-detection
64•felixzeller•2d ago•25 comments

Show HN: Bio Glyph – Turn Your Face into a One-Line Drawing

https://bio.bairui.dev/
19•subairui•1d ago•15 comments

Show HN: Rscrypto, pure-Rust crypto with industry leading public benches

https://github.com/loadingalias/rscrypto
32•LoadingALIAS•1d ago•14 comments

Show HN: Ideogram 4.0 – open-weight 9.3B text-to-image model

https://github.com/ideogram-oss/ideogram4
44•pigcat•1d ago•10 comments

Show HN: Anthrosevka Mono, an Iosevka Build Inspired by Anthropic Mono

https://github.com/nanxstats/anthrosevka
2•road2stat•11h ago•0 comments

Show HN: Bit-banged 10BASE-T Ethernet and Wi-Fi router in Rust on the Pico 2 W

https://github.com/mattdeeds/pico-10base-t-rs
5•mdeeds•12h ago•0 comments

Show HN: RePlaya – self-hosted browser session replay with live tailing

https://github.com/s2-streamstore/replaya
50•shikhar•2d ago•8 comments