frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Poppy – A simple app to stay intentional with relationships

https://poppy-connection-keeper.netlify.app/
98•mahirhiro•6h ago•39 comments

Show HN: AlifZetta – AI Operating System That Runs LLMs Without GPUs

https://axz.si/
3•padamkafle•32m ago•1 comments

Show HN: Stacked Game of Life

https://stacked-game-of-life.koenvangilst.nl/
182•vnglst•4d ago•26 comments

Show HN: Vertex.js – A 1kloc SPA Framework

https://lukeb42.github.io/vertex-manual.html
42•LukeB42•3d ago•25 comments

Show HN: A shell-native cd-compatible directory jumper using power-law frecency

https://github.com/jghub/sd-switchdir
19•jghub•1d ago•3 comments

Show HN: Rust compiler in PHP emitting x86-64 executables

https://github.com/mrconter1/rustc-php
62•mrconter11•3d ago•48 comments

Show HN: A GFM+GF-MathJax/Latex HTML formatting adventure

https://github.com/scottvr/phart/blob/main/docs/GHM-LATEX.md
4•ycombiredd•4d ago•1 comments

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

https://github.com/lixiasky-back/coroTracer
42•lixiasky•1d ago•3 comments

Show HN: Fast Chladni figure simulation in Python with NumPy vectorization

https://github.com/ratwolfzero/Chladni_Figures
4•ratwolf•5h ago•1 comments

Show HN: Open dataset of real-world LLM performance on Apple Silicon

https://devpadapp.com/anubis-oss.html
2•uncSoft•7h ago•2 comments

Show HN: Qlog – grep for logs, but 100x faster

https://github.com/Cosm00/qlog
14•cosm00•14h ago•16 comments

Show HN: Shinobi – 10-second security scanner for developers

https://github.com/AkrijSama/Shinobi
2•SolidDark•8h ago•0 comments

Show HN: Nodepp – A C++ runtime for scripting at bare-metal speed

https://github.com/NodeppOfficial/nodepp
2•EDBC_REPO•8h ago•1 comments

Show HN: I put HN discussions next to the article where it belongs

https://cool-link-web-production.up.railway.app/l/blogabout
8•krenerd•18h ago•0 comments

Show HN: Your AI Slop Bores Me

https://www.youraislopbores.me/
3•mikidoodle•3h ago•1 comments

Show HN: DJ Claude – 6 Claude Codes in a jam band

https://www.loom.com/share/84dbe5de42f745ba98fe9495dc61fa2e
3•p-poss•9h ago•0 comments

Show HN: I built a sub-500ms latency voice agent from scratch

https://www.ntik.me/posts/voice-agent
563•nicktikhonov•2d ago•152 comments

Show HN: Timber – Ollama for classical ML models, 336x faster than Python

https://github.com/kossisoroyce/timber
204•kossisoroyce•3d ago•33 comments

Show HN: Omni – Open-source workplace search and chat, built on Postgres

https://github.com/getomnico/omni
172•prvnsmpth•3d ago•42 comments

Show HN: I built CLI for developer docs locally working with any Coding Agent

https://github.com/lifez/docsearch
2•lifez•11h ago•1 comments

Show HN: Potatoverse, home for your vibecoded apps

https://github.com/blue-monads/potatoverse
6•born-jre•11h ago•1 comments

Show HN: A universal protocol for AI agents to interact with any desktop UI

https://github.com/computeruseprotocol/computeruseprotocol
3•k4cper-g•11h ago•0 comments

Show HN: Effective Git

https://github.com/nolasoft/okgit
35•nola-a•4d ago•6 comments

Show HN: Open-sourced a web client that lets any device use Apple's on-device AI

https://github.com/Techopolis/perspective-intelligence-web-community
10•tayarndt•20h ago•1 comments

Show HN: Paste a URL and watch multiple AI models redesign it side-by-side

https://shuffle.dev/ai-website-redesign
4•kemyd•11h ago•0 comments

Show HN: I built a browser game where you compete against OpenAI, Anthropic, etc

https://thefrontier.pages.dev
3•adityapatni•12h ago•0 comments

Show HN: Athena Flow – a workflow runtime for Claude Code with a terminal UI

2•nadeem1•12h ago•0 comments

Show HN: Gobble – Yet Another OSS Alternative to Google Analytics/PostHog, etc.

https://github.com/inventhq/Gobble
2•vishinvents•13h ago•1 comments

Show HN: Pianoterm – Run shell commands from your Piano. A Linux CLI tool

https://github.com/vustagc/pianoterm
61•vustagc•2d ago•21 comments

Show HN: I built a tamper-evident evidence system for AI agents

https://guardianreplay.pages.dev/
2•Slaine•13h ago•2 comments
Open in hackernews

Show HN: (bits) of a Libc, Optimized for Wasm

https://github.com/ncruces/go-sqlite3/tree/main/sqlite3/libc
78•ncruces•10mo ago
I make a no-CGO Go SQLite driver, by compiling the amalgamation to Wasm, then loading the result with wazero (a CGO-free Wasm runtime).

To compile SQLite, I use wasi-sdk, which uses wasi-libc, which is based on musl. It's been said that musl is slow(er than glibc), which is true, to a point.

musl uses SWAR on a size_t to implement various functions in string.h. This is fine, except size_t is just 32-bit on Wasm.

I found that implementing a few of those functions with Wasm SIMD128 can make them go around 4x faster.

Other functions don't even use SWAR; redoing those can make them 16x faster.

Smooth sort also has trouble pulling its own weight; a Shell sort seems both simpler and faster, while similarly avoiding recursion, allocations and the addressable stack.

I found that using SIMD intrinsics (rather than SWAR) makes it easier to avoid UB, but the code would definitely benefit from more eyeballs.

See this for some benchmarks on both x86-64 and Aarch64: https://github.com/ncruces/go-sqlite3/actions/runs/145169318...

Comments

phickey•10mo ago
This looks like a nice approach to making wasi-libc faster. Could you submit these changes upstream?
ncruces•10mo ago
I'd like to be a little more sure that I'm not totally messing things up before doing that, but yes, eventually, that would be a nice outcome.

I've also only really tested wazero. I can't know for sure that this is a straight improvement for other runtimes and architectures.

For instance, the code delays using wasm_i8x16_bitmask as much as possible, because on Aarch64 it can be slower than not using SIMD at all, whereas it's plenty fast on x86-64.

phickey•10mo ago
The maintainers of wasi-libc are some of the best people to review this, and I don’t think it would be wasting their time to ask them to look at a PR.
ncruces•10mo ago
A PR is a significant investment from me. I'd have to figure out where something like this is supposed to fit, how the build infra works, etc.

One of the nice things about Go is how much that's a solved issue out of the box, compared to almost everything else; certainly compared to C.

Pinging them in an issue: https://github.com/WebAssembly/wasi-libc/issues/580

nu11ptr•10mo ago
It is still a bit early, but I'm majorly bullish on WASM for multiple use cases:

1. Client side browser polyglot "applets" (Java applets were ahead of their time IMO)

2. Server side polyglot "servlets" (Node.js, embedded runtimes, etc.)

3. Language interop/FFI (Lang A -> WASM -> Lang B, like wasm2c)

Why is #3 so interesting? The hardest thing in language conversion is the library calls. WASI standardizes that, so all the proprietary libs will eventually compile down to WASI as a sort of POSIX/libc like layer. In addition, WASM standardizes calling convention. The resulting new source code may not look like much, but it will solve the FFI calling convention/marshalling/library issues nicely.

frumplestlatz•10mo ago
I’m not sure how it solves the FFI problem. Lowest common denominator calling conventions don’t make it any easier to bridge languages than it already is.

C calling conventions are already the standard for FFI in native code, and that means dropping down to what can be expressed in C if you want to cross that boundary.

ncruces•10mo ago
As far as Go is concerned, the Wasm sandbox makes the (addressable, C) stack explicit, which solves at least some of the issues CGO has to deal with.

It's not a panacea, though; it introduces other issues.

fuhsnn•10mo ago
Wasm intrinsics look neat as a higher-level fixed size SIMD abstraction. I wonder how good the compilers can do if using them for AOT targets with libraries like simd-everywhere.

string.h is missing strstr(), there's an algorithm of similar complexity you might consider: http://0x80.pl/notesen/2016-11-28-simd-strfind.html

ncruces•10mo ago
Yeah, so far I did exactly the ones (my build of) SQLite needed and not others.

If there's interest, the set of implemented functions can definitely be extended.

cedws•10mo ago
Would you consider writing some blog posts or other resources about WASM? I was experimenting recently with WIT, and ran into a mountain of issues. There's a lot of jargon that could do with some untangling.

It took me a lot longer than it should have to put together this basic module, and even then there's this shared library I had to download to build it, and I couldn't figure out why this requires a libc:

https://github.com/cedws/wasm-wit-test

ncruces•10mo ago
I'm not that great at long form writing to be honest, it's always a bit of a chore, and I'm never happy with the result.

To answer your question, it needs a libc because you're including stdlib.h, and exporting and allocator (even if you're not otherwise using it). You need a libc for malloc.

This is generally a good idea, if you need to send anything beyond numbers across the API (e.g. you need an allocator if you want to send strings as pointers).

I never used WIT, so I have no idea if this a requirement for WIT.

cedws•10mo ago
Ah ok. Thanks!
forrestthewoods•10mo ago
What is SWAR?
ncruces•10mo ago
SIMD within a register: https://en.wikipedia.org/wiki/SWAR

It's generally used for techniques that apply SIMD principles within general-purpose registers and instructions.

Assume you've loaded a 64-bit register (a uint64_t) with 8 bytes (unsigned char) of data. Can you answer the question “is any of these 8 bytes zero (the NUL terminator)?”

If you find a cheap way to do it, you can make strlen go faster by consuming 8 bytes at a time.

Et voilà:

   #define ONES ((uint64_t)-1/UCHAR_MAX)
   #define HIGHS (ONES \* (UCHAR_MAX/2+1))
   #define HASZERO(x) ((x)-ONES & ~(x) & HIGHS)
forrestthewoods•10mo ago
TIL, thanks!
tuananh•10mo ago
very cool project.

it's kinda frustrating to compile sqlite for wasm. can be done but quite troublesome.