frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Mac history echoes in current Mac operating systems

http://tenfourfox.blogspot.com/2025/08/mac-history-echoes-in-mac-operating.html
40•classichasclass•1h ago•7 comments

Claude Code IDE integration for Emacs

https://github.com/manzaltu/claude-code-ide.el
588•kgwgk•14h ago•193 comments

Rules by Which a Great Empire May Be Reduced to a Small One (1773)

https://founders.archives.gov/documents/Franklin/01-20-02-0213
80•freediver•4h ago•25 comments

A Candidate Giant Planet Imaged in the Habitable Zone of α Cen A

https://arxiv.org/abs/2508.03814
26•pinewurst•2h ago•9 comments

Project Hyperion: Interstellar ship design competition

https://www.projecthyperion.org
162•codeulike•7h ago•136 comments

Litestar is worth a look

https://www.b-list.org/weblog/2025/aug/06/litestar/
200•todsacerdoti•8h ago•50 comments

More than two hard disks in DOS

https://www.os2museum.com/wp/more-than-two-hard-disks-in-dos/
6•userbinator•3d ago•0 comments

We'd be better off with 9-bit bytes

https://pavpanchekha.com/blog/9bit.html
103•luu•8h ago•192 comments

Jules, our asynchronous coding agent

https://blog.google/technology/google-labs/jules-now-available/
241•meetpateltech•11h ago•164 comments

Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model

https://github.com/KittenML/KittenTTS
790•divamgupta•22h ago•322 comments

Writing a Rust GPU kernel driver: a brief introduction on how GPU drivers work

https://www.collabora.com/news-and-blog/blog/2025/08/06/writing-a-rust-gpu-kernel-driver-a-brief-introduction-on-how-gpu-drivers-work/
224•losgehts•11h ago•28 comments

The Day MOOCs Died: Coursera's Preview Mode Kills Free Learning

https://www.classcentral.com/report/coursera-preview-mode-paywall/
34•deepakkarki•3d ago•20 comments

Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs

https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/
5•philipkiely•1h ago•0 comments

You know more Finnish than you think

https://dannybate.com/2025/08/03/you-know-more-finnish-than-you-think/
62•infinate•2d ago•28 comments

A fast, growable array with stable pointers in C

https://danielchasehooper.com/posts/segment_array/
143•ibobev•9h ago•58 comments

The Bluesky Dictionary

https://www.avibagla.com/blueskydictionary/
119•gaws•7h ago•41 comments

Apple increases US commitment to $600B, announces American Manufacturing Program

https://www.apple.com/newsroom/2025/08/apple-increases-us-commitment-to-600-billion-usd-announces-ambitious-program/
26•Zenbit_UX•4h ago•10 comments

301party.com: Intentionally open redirect

https://301party.com/
69•nahikoa•7h ago•13 comments

Multics

https://www.multicians.org/multics.html
102•unleaded•11h ago•21 comments

Out-Fibbing CPython with the Plush Interpreter

https://pointersgonewild.com/2025-08-06-out-fibbing-cpython-with-the-plush-interpreter/
23•Bogdanp•4h ago•0 comments

Show HN: HMPL – Small Template Language for Rendering UI from Server to Client

https://github.com/hmpl-language/hmpl
7•aanthonymax•17h ago•5 comments

Comptime.ts: compile-time expressions for TypeScript

https://comptime.js.org/
104•excalo•3d ago•17 comments

A Man Who Beat IBM

https://every.to/feeds/b0e329f3048258e8eeb7/the-man-who-beat-ibm
45•vinnyglennon•3d ago•15 comments

Breaking the sorting barrier for directed single-source shortest paths

https://www.quantamagazine.org/new-method-is-the-fastest-way-to-find-the-best-routes-20250806/
139•baruchel•13h ago•43 comments

The Inkhaven Blogging Residency

https://www.inkhaven.blog/
29•venkii•3h ago•28 comments

Zig Error Patterns

https://glfmn.io/posts/zig-error-patterns/
124•Bogdanp•12h ago•33 comments

Automerge 3.0

https://automerge.org/blog/automerge-3/
251•surprisetalk•3d ago•21 comments

303Gen – 303 acid loops generator

https://303-gen-06a668.netlify.app/
180•ankitg12•15h ago•62 comments

AI in Search is driving more queries and higher quality clicks

https://blog.google/products/search/ai-search-driving-more-queries-higher-quality-clicks/
46•thm•10h ago•61 comments

Rethinking DOM from first principles

https://acko.net/blog/html-is-dead-long-live-html/
192•puzzlingcaptcha•21h ago•169 comments
Open in hackernews

The Bluesky Dictionary

https://www.avibagla.com/blueskydictionary/
119•gaws•7h ago

Comments

neaden•6h ago
Is this not working or am I missing something, it just shows as seeing 0 words for me. Firefox on a PC.
SirFatty•6h ago
Same... maybe you need a Bluesky account, which I don't have.
gpm•6h ago
It doesn't... I can open it in a private browsing window.
GalaxyNova•6h ago
It's working fine for me on Firefox
accrual•6h ago
You may need to allow scripts from the domain avibagla.com, it shows 0 when the scripts are blocked.
zem•5h ago
ugh, it ought to be building the results on the server and serving up static pages.
rafram•4h ago
But it updates live...
forgotmypw17•2h ago
It could do both...
dymk•1h ago
then go build it…
Noumenon72•1h ago
All the scripts are ERR_SSL_PROTOCOL_ERROR for me in Chrome, I'm assuming because of a corporate firewall.
AgentME•5h ago
For me it took a minute to start loading data and switch from just showing 0.
GalaxyNova•6h ago
fascinating! I think it's really cool that this is possible, and at the same time kine of sad that the norm is slowly moving towards more locked-down APIs.
timeon•5h ago
> slowly moving towards

Depends what we accept as norm.

75345d4c•6h ago
I just saw it indexed "eluvium," but the post was referring to a band with that same name
Kye•6h ago
GeologySky will get to it soon enough.
k7sune•1h ago
Thanks to this I just learned about alluvium, eluvium, illuvium, and colluvium.
atlgator•5h ago
I checked out the author's other projects and this is common issue. For example, he has a "lean checker" for bluesky that claims it is right-leaning simply because of all the people saying "That's right," "He was right," etc. None of the supposed right-leaning posts were actually conservative in nature. They just used to word right to mean correct.
avibagla1•5h ago
one, thank you for checking my website. two, that is the joke, 100% - at the time people kept talking about how "left leaning" bsky was and that idea came to mind
OneDeuxTriSeiGo•2h ago
lmao that's fantastic
wantlotsofcurry•5h ago
I'm very curious as to how this works in the backend. I realize it uses Bluesky's firehose to get the posts, but I'm more curious on how it's checking whether a post contains any of the available words. Any guesses?
bangaladore•5h ago
Maybe I'm being naive, but with only ~275k words to check against, this doesn't seem like a particularly hard problem. Ingest post, split by words, check each word via some db, hashmap, etc... and update metadata.
somebehemoth•1h ago
I think the cool part is watching words go brrr.
gpm•5h ago
Probably just a big hashtable mapping word -> the number of times it's been seen, and another hashset of all the words it hasn't seen. When a post comes in you hash all the words in it and look them up in the hashtable, increment it, and if the old value was 0 remove it from the hash set.

250k words at a generous 100 bytes per word is only 25MB of memory...

f311a•5h ago
You can probably fit all words under 10-15MB of memory, but memory optimisations are not even needed for 250k words...

Trie data structures are memory-efficient for storing such dictionaries (2-4x better than hashmaps). Although not as fast as hashmaps for retrieving items. You can hash the top 1k of the most common words and check the rest using a trie.

The most CPU-intensive task here is text tokenizing, but there are a ton of optimized options developed by orgs that work on LLMs.

stwrzn•5h ago
I very much hope that the backend uses one of the bluesky jetstream endpoints. When you only subscribe to new posts, it provides a stream of around 20mbit/s last time I checked, while the firehose was ~200mbit/s.
avibagla1•5h ago
yes it does!
avibagla1•5h ago
Hey! this is my site - it's not all that complex, i'm just using a sqlite db with two tables - one for stats, the other for all the words that's just word | count | first use | last use | post.

I... did not expect this to be so popular

gumboshoes•2h ago
What is your source dictionary to compare to? Seems kind of small. Also, how are you handling inflected forms?
avibagla1•25m ago
https://github.com/words/an-array-of-english-words

using this, a combo of "covered enough" for the bit and easy to use

also, since i'm tracking every word (technically a better name for this project would be The Bluesky Corpus) all inflected forms are different words, which aligns with my thinking

blendo•59m ago
What are the table sizes?

And what ingress bandwidth do you have?

avibagla1•17m ago
DB is currently 58mb (damn lol)

Ingress is actually pretty manageable, ~900kbps

spullara•5h ago
I did this against a pretty large tweet archive and got hits on about 125k of the words in the unix dictionary.
pona-a•5h ago
For a moment I thought it would be an AT-Proto based Urban Dictionary clone.
tough•5h ago
Words We Haven't Seen

- Search unseen words

made me chuckle

crm9125•4h ago
I've found content for all of my future skeets.
refreeze654•4h ago
I've wondered how blueksy affords the bandwidth to let anyone stream the full firehose.
dgacmu•4h ago
Not an answer to your question, but I suspect most people don't -- my bot (a pi searcher bot, of course) just runs on Jetstream, which is pretty lightweight and heavily compressed.

(The website in question uses jetstream also.)

psionides•2h ago
From what they say, it is a lot, but it's generally on the order of a few hundreds of connections total at the moment
blintz•2h ago
I'm surprised at how normal some of the unseen words are. I expected them to all be archaic or niche, but many are pretty reasonable: 'congregant', 'definer', 'stereoscope'.
gkoberger•2h ago
For what it's worth, there's 1.7bn posts on Bluesky according to this: https://bsky.jazco.dev/stats

The dictionary site has only checked 4,920,000 posts, which is 0.28% of all messages.

fudged71•1h ago
I'm just surprised that there's revolt when Bluesky posts are used for LLMs, but regular NLP is fine for some reason.