frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

To study how chips work, MIT researchers built their own operating system

https://news.mit.edu/2026/to-study-how-chips-really-work-mit-researchers-built-their-own-operatin...
61•speckx•3d ago•3 comments

How Japan's railways stayed one while splitting apart

https://arun.is/blog/jr-logo/
54•ddrmaxgt37•1d ago•39 comments

Zero-Touch OAuth for MCP

https://blog.modelcontextprotocol.io/posts/enterprise-managed-auth/
129•niyikiza•6h ago•56 comments

Datasette Apps: Host custom HTML applications inside Datasette

https://simonwillison.net/2026/Jun/18/datasette-apps/
41•lumpa•2h ago•10 comments

I found 10k GitHub repositories distributing Trojan malware

https://orchidfiles.com/github-repositories-distributing-malware/
700•theorchid•16h ago•161 comments

Building a robotics research setup that lives next to my desk

https://dfdxlabs.com/research/2026/robotics-setup/
22•mplappert•13h ago•2 comments

Cell-based architecture for resilient payment systems

https://americanexpress.io/cell-based-architecture-for-resilient-payment-systems/
100•birdculture•3d ago•38 comments

Ubiquiti: Enterprise NAS, Built on ZFS

https://blog.ui.com/article/introducing-enterprise-nas
282•ksec•13h ago•254 comments

CS 6120: Advanced Compilers: The Self-Guided Online Course (2020)

https://www.cs.cornell.edu/courses/cs6120/2025fa/self-guided/
329•ibobev•16h ago•48 comments

Zork name origin got an update on Wikipedia

https://www.dpolakovic.space/blogs/zork-part2#update
68•dpola•7h ago•10 comments

Flexport (YC W14) Is Hiring in Indonesia, India, and Thailand

https://www.flexport.com/company/careers/
1•thedogeye•2h ago

Hospitals and universities repurposing drugs at lower cost

https://www.kcl.ac.uk/news/hospitals-and-universities-repurposing-drugs-at-90-lower-cost
293•giuliomagnifico•17h ago•128 comments

I told them forced consent was unlawful. 5 years later it cost Elkjop €1.8M

https://www.thatprivacyguy.com/blog/elkjop-forced-consent-fine/
265•speckx•9h ago•117 comments

Show HN: Are You in the Weights?

https://www.intheweights.com/
231•turtlesoup•7h ago•137 comments

.gitignore Isn't the only way to ignore files in Git

https://nelson.cloud/.gitignore-isnt-the-only-way-to-ignore-files-in-git/
314•FergusArgyll•17h ago•108 comments

Launch HN: TesterArmy (YC P26) – Agents that test web and mobile apps

https://tester.army
108•okwasniewski•13h ago•47 comments

Swiss parliament lifts ban on new nuclear power plants

https://www.bluewin.ch/en/news/switzerland/parliament-lifts-ban-on-new-nuclear-power-plants-32575...
717•leonidasrup•13h ago•603 comments

The Token Compression Illusion: Why I'm Skeptical of RTK

https://mroczek.dev/articles/the-token-compression-illusion-why-im-skeptical-of-rtk/
86•lackoftactics•10h ago•90 comments

W Social, public institutions and the theater of European digital sovereignty

https://blog.elenarossini.com/w-social-public-institutions-and-the-theater-of-european-digital-so...
179•nemoniac•15h ago•120 comments

If your product is Great, it doesn't need to be Good (2010)

http://paulbuchheit.blogspot.com/2010/02/if-your-product-is-great-it-doesnt-need.html
41•skogstokig•3d ago•30 comments

Noam Shazeer Joins OpenAI

https://twitter.com/NoamShazeer/status/2067400851438932297
303•lukasgross•1d ago•293 comments

Modos Color Monitor Pushes E-Paper Displays Further

https://spectrum.ieee.org/modos-e-paper-monitor
238•Vinnl•16h ago•67 comments

How Alberta Eradicated Rats

https://worksinprogress.co/issue/albertas-war-on-rats/
146•tzury•14h ago•106 comments

Show HN: Gerrymandle - Daily puzzle game where you redraw electoral districts

https://gerrymandle.cc/
147•realmofthemad•13h ago•66 comments

Migrating from GNU Stow to Chezmoi

https://rednafi.com/misc/chezmoi/
111•speckx•10h ago•105 comments

Horizons JPL Solar System Data Demo and NASA DSN Updates: Datastar, Common Lisp

https://horizons.lambda-combine.net/
34•adityaathalye•4d ago•1 comments

Agentic Resource Discovery Specification

https://agenticresourcediscovery.org/introduction/
55•damick•1d ago•15 comments

Update on Ocean Observatories Initiative

https://www.nsf.gov/news/update-ocean-observatories-initiative
116•andsoitis•4h ago•20 comments

Emacs 31 is around the corner: The changes I'm daily driving

https://www.rahuljuliato.com/posts/emacs-31-around-the-corner
420•frou_dh•15h ago•237 comments

Show HN: Talos – Open-source WASM interpreter for Lean

https://github.com/cajal-technologies/talos
17•mfornet•14h ago•2 comments
Open in hackernews

GLM-5.2: The Most Powerful Open Model yet and the Brutal Reality of Running It

https://vettedconsumer.com/glm-5-2-the-most-powerful-open-weight-model-yet-and-the-brutal-reality-of-running-it-locally/
37•ermantrout•2h ago

Comments

walrus01•1h ago
Before people go and drop a gargantuan sum of money on a server capable of running it entirely in GPU, there's still a fair amount of used x86-64 servers capable of running it in CPU and RAM (using llama-server) for probably under $6000. For example a Dell R640 with two older Xeon 18-core CPUs and 1TB of RAM. Test it out at a slow token/sec rate and see if it fits your needs.

Same idea for Kimi.

qingcharles•1h ago
Agreed. There are some crazy good deals on these older servers. For me, the inference speed would be fine as I'd just get on with a million other tasks between each response.
sgc•1h ago
To check whether I understand how this all works: Wouldn't a 4 bit quant run reasonably well (for that hardware) with far less ram, something like 1.5x the 476gb, or 714gb+ ram?
walrus01•53m ago
Yes, but the price difference between buying a used x86-64 server with 512GB and 1024GB isn't that great, and if you're already determined to buy the hardware to run in CPU a "large" model (eg: Not Qwen 3.6 35B-A3B, gemma4 or similar size), the loss of quality and sometimes suspicious nature of the output from a 4-bit quant might be undesirable vs running a Q8 quant or full precision.

You would also want a lot of RAM for context/kv cache to make it usable so just the amount of RAM that will fit a Q4 model and run it (before any cache starts getting populated through active use) isn't enough.

tfirst•1h ago
If model performance continues to scale with model size, I have a hard time seeing how local models will have any chance of competing with models hosted on datacenter hardware.

1. There are strong economies of scale in hosting inference (batched prompts, high uptime, shared infrastructure).

2. There are physical limits on how much memory we will be able to produce over the next few years. Demand will probably scale at least as fast as production does, so we won't be saved by falling prices.

dabinat•1h ago
Cloud models will always be ahead, but not every task needs Fable-level intelligence. The number of usable situations for local models will increase as hardware and open-weight models improve.
walrus01•1h ago
There's a value for many people and organizations with running a model locally on hardware they fully own and control (or pay to colocate in a datacenter somewhere) vs running a model on something owned/controlled by any third party. For highly privacy sensitive, medical applications, etc. It's not just a question of raw efficiency in dollar per tokens per day or tokens/second.
butvacuum•1h ago
For your first point- You've just repeated "shared tenant." A scaling factor that's been used since before the turn of the milenium. Uptime is, as always, an irrelevancy for personal/homelab vs cloud. It shifts from uptime to pure financial (capex first, then how you account for "wasted" time).

2) The current memory crunch is more political than cyclical. The only reason we have fabs as far intro construction as we do is CHIPS Act. Which, predates LLMs public existance by more than 6mo. the horrific silicon prices are a direct result of openAI's openly Illegal dealings. Their pretense of needing it for stargate gets sundered further with each missed or cancelled deadline.

They predicted the political and regulatory outcome superbly.

kristianp•1h ago
Irritating LLMisms:

    - "real architecture trick"
    - "the honest hardware reality of running it at home."
    - "What it is — and what Z.ai claims"
    - "The one genuinely new idea"
And many more.
LeoPanthera•1h ago
I've been using "the one genuinely adjective noun" for years as a weird English tic, and it bothers me that it's become an LLM tell.
butvacuum•1h ago
That's because most the "tells" expose more about the "reader" than the content.
CorpOverreach•1h ago
Yep. The entire thing. Instant turn-off when reading an article.

I'm sure the content does have some value, and perhaps someone spent time putting together an original copy that they thought was going to be made better by having AI "make it better".

Actually, I take some of that back - most of the site seems to be AI written, following the formula of "ingest multiple sources" => feed to AI => write article.

KaoruAoiShiho•1h ago
Terrible zero value article, I am extremely surprised it is upvoted.

That being said Artificial Analysis just came out with a brand new benchmark where it scored between opus 4.8 and gpt-5.5 and well behind fable-5 so it's definitely frontier-ish https://x.com/ArtificialAnlys/status/2067744637155226101

CorpOverreach•1h ago
I do think it's going to get harder and harder to run bleeding-edge models; this is just the start of it.

It being hard for the average joe to run these at its fullest potential is unfortunate, but the important part is that _you can_ assuming you can acquire the resources.

I think that's going to be important for the sake of preserving privacy and freedom of information in the long run. We're seeing this play out right now with Anthropic originally playing the "safety" card for why they can't let everyone at Mythos and subsequently got on the US Gov't radar with access to Fable being pulled.

The next biggest milestone will be an open-weights challenger to Mythos. There'll be consequences to that, but I feel those are less worse than someone else deciding what you can and can't use a model for.

lamida•1h ago
Pretty sure the article is fully written by LLM without editing at all. See all the — emdash sprinkled all over.
blackoil•1h ago
I think people overrate 'local' part of open Models vs private. With OpenAI my choice is 1. I have to use them, even if they decide to double the cost or work with govt to blow my country. My $5 server can't run GLM but I have choice from many providers based on my requirements of cost, data residency, political alignment.
easygenes•1h ago
Article reads as though written by someone who doesn't have much experience with deployments like this. Underestimates the memory needed to run with a reasonable amount of context. Misses two other obvious targets:

  1) 4x DGX Spark (or equivalent other GB10 boxes) with a switch (MikroTik CRS504 or CRS804) and TP=4.
  2) 4x RTX PRO 6000 box. Probably the most practical for cost/perf if you want on-prem as an individual.
Both would be best to run a 2-bit quant so everything can stay resident (article claims you could run a 4-bit quant with 4x RTX 6000 Ada, and while technically true it would mean a lot of the weights are streaming from DRAM, so it would be slow and impractical. You would need 8x RTX PRO 6000 to run 4 bit at a good speed).

This model quantizes unusually well: https://unsloth.ai/docs/models/glm-5.2#quantization-analysis

redox99•46m ago
Can you really say you're running GLM 5.2 if its a 2 bit quant? It might be usable but the capabilities will definitely not be the same.
ma2kx•29m ago
Thats just stupid.

- Why should I run it on local hardware when there are already about a dozen US provider available?

- To compare the token usage per task with GLM 5.1 is worthless when GLM 5.1 is unable to do the task.

- Not even z.ai itself runs the model with BF16 weights.

- I couldn't care less how good the model is at drawing a pelican on a bicycle.