frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

ContextMaestro – Curated Engineering Feed

https://www.contextmaestro.com/
1•jazzboss•3m ago•0 comments

Knowledge Catalog – universal context engine for agents

https://github.com/GoogleCloudPlatform/knowledge-catalog
1•modinfo•5m ago•0 comments

Google Investing in 'Backrooms' Studio A24 in AI research partnership

https://www.wsj.com/tech/ai/google-investing-in-backrooms-studio-a24-e7585ebe
1•jaredwiener•8m ago•0 comments

Citroën Ami Is an Ultra Affordable EV (2020)

https://insideevs.com/news/401218/citroen-ami-deliveries-june/
1•rawgabbit•11m ago•0 comments

AI's PR Problem

https://blog.dshr.org/2026/05/ais-pr-problem.html
4•linsomniac•13m ago•0 comments

Frozen Reformer

https://arunc.dev/essays/frozen-reformer/
1•arunc•13m ago•0 comments

Ask HN: How do you make the LLM generate good code?

1•bjourne•15m ago•0 comments

Why AI Is a Bubble

https://federicozebele.substack.com/p/this-is-why-ai-is-a-bubble-and-what
3•stanislavb•16m ago•1 comments

Europe must choose between AI and climate goals, data center lobby says

https://www.politico.eu/article/europe-choose-ai-climate-goals-data-center-chief-warns/
3•cdrnsf•17m ago•1 comments

AI Is Not a Tool

https://theconvivialsociety.substack.com/p/your-ai-is-not-a-tool
3•longdefeat•18m ago•0 comments

Robots will replace 700k delivery workers 'sooner or later' warns JD.com boss

https://www.ft.com/content/465635e2-633b-4311-afe5-9b3bff8c9240
3•momentmaker•20m ago•1 comments

The AI shift in cyber risk: why leaders must act now

https://www.cyber.gov.au/about-us/view-all-content/news/five-eyes-cyber-security-agencies-statement
2•Khaine•20m ago•0 comments

Kya is hiring an AI/ML Engineer

https://www.kyahq.com/careers/software-engineer-ai-ml
1•Johnall_n•21m ago•1 comments

Bipartite Matching Is in NC

https://scottaaronson.blog/?p=9851
2•amichail•23m ago•0 comments

Q.js: modern front-end framework for 2026. No build scripts unlike React et al.

https://www.npmjs.com/package/@qbix/q.js
1•EGreg•24m ago•1 comments

Hyperbolic Discounting

https://en.wikipedia.org/wiki/Hyperbolic_discounting
1•rzk•24m ago•0 comments

Report: Kennedy Space Center not ready for era of super heavy rockets

https://arstechnica.com/space/2026/06/report-kennedy-space-center-not-ready-for-era-of-super-heav...
1•voxadam•26m ago•0 comments

Show HN: FastAPI Cloud is in public beta, deploy apps with FastAPI deploy

https://fastapicloud.com/
1•tiangolo•26m ago•1 comments

Payoff Progress of an Amortizated Loan

https://push.cx/payoff-progress
1•pavel_lishin•29m ago•0 comments

Daybreak

https://openai.com/daybreak/
1•Recursing•31m ago•0 comments

Mod Logs: Save every change, thank yourself later

https://unstack.io/mod-logs-save-every-change-thank-yourself-later
2•ScottWRobinson•31m ago•0 comments

Knowledge Agents: Beat Frontier Models with Better Structure

https://weightythoughts.com/p/knowledge-agents-beat-frontier-models
1•lklinger•31m ago•0 comments

Show HN: Who's in the weights? – which people 13 language models know

https://whos-in-the-weights.vercel.app/
1•heterodoxjedi•32m ago•0 comments

PsychAdapter: Personality in LLM output via trait-language patterns, not prompts

https://github.com/humanlab/psychadapter
1•indynz•34m ago•0 comments

A Source of Mysterious Repeating Radio Signals from Space Has Been Identified

https://www.wired.com/story/a-source-of-mysterious-repeating-radio-signals-from-space-has-been-id...
1•ubutler•35m ago•0 comments

The database that refused to die: How Postgres survived its own creators

https://www.theregister.com/databases/2026/06/22/the-database-that-refused-to-die-how-postgres-su...
2•jnord•36m ago•0 comments

The fake ABC News articles trying to sell you a scam

https://www.abc.net.au/news/2026-06-23/fake-abc-website-scam-facebook-ads/106653690
2•Gaishan•36m ago•0 comments

Vibedrop: Ephemeral Hosting for Agents

https://vibedrop.sh/
1•mormonnegro•38m ago•0 comments

Trump Demands "?" For the "Vandalism" of a $14M Swimming Pool

https://thenewassociationwebmasters.blogspot.com/2026/06/trump-demands-years-in-prison-after.html
4•laurentlof•38m ago•5 comments

Worldfall- a beautiful web novel about change and the diffusion of technology

https://worldfall.ink/
1•pfwitt•38m ago•1 comments
Open in hackernews

Unsloth GLM-5.2 – How to Run Locally

https://unsloth.ai/docs/models/glm-5.2
50•TechTechTech•1h ago

Comments

xrd•46m ago
So close! My machine with 192GB RAM + RTX 3090 24GB can almost run this. It says it needs 24GB of VRAM and 256GB of RAM for MoE offloading.

https://unsloth.ai/docs/models/glm-5.2#usage-guide

In a prior thread, someone said it would take $500k in hardware:

https://news.ycombinator.com/item?id=48629970

mgambati•42m ago
With 2 wouldn’t have good results. Ideal range for coding is at least Q8.
kibibu•34m ago
According to this very article, 4-bit dynamic is essentially lossless
cheema33•29m ago
I have the RAM, but not the VRAM. What kind of speed/tps could you expect from a 3090 with 24GBs of RAM? I am somewhat tempted to pick a GPU with 24GBs of RAM.
zuzululu•29m ago
wonder if AMD's new ai chip can run this with ease? I'm seriously consider buying it. GLM 5.2 is just shy of GPT 5.4 so I would welcome offloading any grunt work locally

I am very excited for local LLMs I think we may have GPT 5.5-xhigh level of performance for under 2000 EUR

This should put more pressure on the frontier models to avoid sitting on any fancy stuff and lower token prices as a whole.

Nothing beats a local LLM disconnected from the cloud.

Iolaum•22m ago
At full quantization GLM 5.2 may be close to GPT 5.4. But at Q2 or whatever one needs in order to run it on a pro-sumer device it will be worse.

Also I m not sure where you are getting the under 2k value. I bought a Framework desktop 128GB last year and my setup was around 2.7k. The same setup now sells for around 4.7k.

kccqzy•18m ago
The AMD 395 chip supports up to 128GB unified RAM. So still not enough even at 1-bit quant unfortunately.
benjiro29•14m ago
"GLM 5.2 is just shy of GPT 5.4"... If your running the full model. As in have 750 (FP8) to 1.5TB(FP16) of memory available.

Do not mix the benchmark results of GLM 5.2 FP16/FP8 with FP4 or FP2.

* FP4 will mean a accuracy loss of about 3%. Not noticeable but more chance for mistakes.

* FP2 ... what is what most people are able to run at home, for a "reasonable" price. Your looking at over 17% loss in accuracy.

At that point, your running at less then claude-sonnet-4.6, as the issues compound with accuracy losses. And reasonable priced is still in the ~ $5000 range (192GB + GPU 32GB active/kv cache system).

For that price your using a Codex / Claude Pro subscription for the next 4+ years with better models (by default), let alone with a FP2 GLM 5.2 version. And your looking at < 10 fps. A MacStudio with 512GB will net you 18 a 20fps+ with FP4, but ... i mean, those used to be $10.000.

Unfortunately the local hardware cost is a major issue for running large models like that.

pheggs•28m ago
I feel like the gap is closing to be able to run good enough models locally even for coding and I would assume it could make some companies a bit nervous. Am I wrong about that?
fny•27m ago
The RAM requirements are still pretty painful.
yieldcrv•9m ago
equilibrium in one or two more years on the consumer/prosumer side

think Apple M6 or M7 with a currently unforeseen denser memory style, 256gb RAM

a couple inference or cache improvements on the algorithmic side, using less ram for context windows and doubling token speed again

denser open source models, packing more experts for smaller active layers

it'll still be expensive but like $8,000 - $13,000 instead of $450,000 worth of B200s

CamouflagedKiwi•23m ago
The hardware requirements to run this locally are still very high. Seems far enough off mainstream for those companies not to be too worried yet.
cogman10•19m ago
I don't think so. I could easily see a company deciding to host and run these models for their own development. If you have a dev team of about 10 people, a one time $50k investment in an LLM server has to be pretty tempting. Unlimited tokens, decent performance, upgrade options, and potential product integrations.

For companies wanting LLMs in their products in general, I have to think going the local llm route is even more tempting. Somewhat dumb models are more than good enough for a lot of the things people are integrating LLMs into their products.

nh43215rgb•9m ago
Even with upcoming AI Max+ PRO 495 we are capped with 192GB, so no...