frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Previewing GPT‑5.6 Sol: a next-generation model

https://openai.com/index/previewing-gpt-5-6-sol/
912•minimaxir•12h ago•556 comments

Why does kinetic energy increase quadratically, not linearly, with speed? (2011)

https://physics.stackexchange.com/questions/535/why-does-kinetic-energy-increase-quadratically-no...
159•ProxyTracer•6h ago•74 comments

WordStar: A Writer's Word Processor (1996)

https://www.sfwriter.com/wordstar.htm
15•droidjj•1h ago•9 comments

U.S. allows Anthropic to release Mythos AI to ‘trusted’ US organizations

https://www.semafor.com/article/06/27/2026/us-releases-powerful-anthropic-model-mythos-to-some-us...
319•bobrenjc93•6h ago•324 comments

AI in mathematics is forcing big questions

https://spectrum.ieee.org/ai-in-mathematics
85•rbanffy•6h ago•52 comments

MicroVMs: Run isolated sandboxes with full lifecycle control

https://aws.amazon.com/blogs/aws/run-isolated-sandboxes-with-full-lifecycle-control-aws-lambda-in...
297•justincormack•3d ago•165 comments

Show HN: Hacker News on a train station-style flip board

https://popflame.quickish.space/hn-flipboard/
38•PaybackTony•4h ago•6 comments

Hellishly Slow Level 13 Deflate Compression

https://kirill.korins.ky/articles/hellishly-slow-level-13-deflate-compression/
22•zX41ZdbW•4d ago•2 comments

The gap between open weights LLMs and closed source LLMs

https://blog.doubleword.ai/frontier-os-llm
162•kkm•8h ago•129 comments

U.S. government will decide who gets to use GPT-5.6

https://www.washingtonpost.com/technology/2026/06/26/openai-says-us-government-will-vet-users-its...
923•alain94040•10h ago•987 comments

Fusion Programming Language

https://fusion-lang.org/
21•efrecon•2d ago•6 comments

A C++ implementation of a fast hash map and hash set using hopscotch hashing

https://github.com/Tessil/hopscotch-map
79•gjvc•7h ago•13 comments

Om

https://daringfireball.net/2026/06/om
238•throw0101a•5h ago•14 comments

Show HN: DBOSify – Drop-in Temporal replacement built on Postgres

https://github.com/dbos-inc/dbosify-py
38•KraftyOne•2d ago•7 comments

We can still stop California's 3D printer surveillance scheme

https://www.eff.org/deeplinks/2026/06/we-can-still-stop-californias-3d-printer-surveillance-scheme
306•hn_acker•8h ago•111 comments

Show HN: Turn images into audio that can be decoded with a spectrogram

https://nsspot.herokuapp.com/imagetoaudio/
5•jupr•2d ago•0 comments

Foreign funds help make housing unaffordable: research

https://news.mccombs.utexas.edu/research/foreign-funds-help-make-housing-unaffordable/
26•hhs•5h ago•5 comments

Anatomy of a Failed (Nation-State?) Attack

https://grack.com/blog/2026/06/25/dissecting-a-failed-nation-state-attack/
8•signa11•2h ago•2 comments

Ultrasound imaging of the brain

https://alephneuro.com/blog/ultrasound-brain
258•rossant•17h ago•107 comments

Making Sense of Proof by Contradiction [pdf]

https://www.foster77.co.uk/Foster,%20Scottish%20Mathematical%20Council%20Journal,%20Making%20sens...
22•surprisetalk•3d ago•6 comments

Show HN: Smart model routing directly in Claude, Codex and Cursor

https://github.com/workweave/router
158•adchurch•12h ago•91 comments

Hightouch (YC S19) Is Hiring

https://hightouch.com/careers#open-positions
1•joshwget•8h ago

Long Wave radio era set to end with Droitwich switch-off

https://www.bbc.com/news/articles/c74yn7v7k4qo
72•speckx•10h ago•30 comments

What Is a Nomogram and Why Would It Interest Me?

https://lefakkomies.github.io/pynomo-doc/introduction/introduction.html#what-is-a-nomogram-and-wh...
100•Eridanus2•11h ago•18 comments

A Tiny Compiler for Data-Parallel Kernels

https://healeycodes.com/a-tiny-compiler-for-data-parallel-kernels
32•healeycodes•1d ago•4 comments

The "Bizarre Headgear" exhibit at the Sam Noble museum

https://svpow.com/2026/05/15/the-bizarre-headgear-exhibit-at-the-sam-noble-museum-is-incredible/
72•surprisetalk•3d ago•7 comments

Pre-Modern Armies for Worldbuilders, Part III: Paying for It

https://acoup.blog/2026/06/26/collections-pre-modern-armies-for-worldbuilders-part-iii-paying-for...
74•jfoucher•11h ago•13 comments

A human postmortem of the 1996 AOL outage

https://ngrok.com/blog/aol-was-down-1996
49•EndEntire•2d ago•10 comments

PlayStation Is Deleting 551 Movies from Customers' Accounts

https://kotaku.com/playstation-store-movies-digital-studio-canal-terminator-2000711013
224•ortusdux•9h ago•128 comments

Ask HN: MacBook vs. Dedicated GPU for LLM

17•mzubairtahir•1h ago•29 comments
Open in hackernews

Ask HN: MacBook vs. Dedicated GPU for LLM

17•mzubairtahir•1h ago
For those who are using llms on macbook, Want to understand how macbook is different than dedicated GPU in running those models? and how to know how much a macbook is capable of running a model?

Comments

JSR_FDED•1h ago
MacBooks with their unified memory behave like a slow GPU with enormous amount of video RAM. So you can run large smart models slowly.

Dedicated GPUs have less video RAM so can run smaller less smart models quickly.

exabrial•1h ago
Do Mac Pros provide more headroom? noob here, noob questions
rho138•59m ago
Idk why you’re being downvoted for asking a question. Pending specs they _could_ provide more headroom for a larger model but they would still be limited by the CPU and it’s associated bus speeds.
JSR_FDED•24m ago
In what sense? Headroom for what?
mzubairtahir•52m ago
how much memory is actually useable by gpu in macbook? as it is shared?
pylotlight•37m ago
roughly ~50–56GB, although this is somewhat configurable with iogpu.wired_limit_mb. By default, macOS reserves ~25% of memory for the system.
visarga•48m ago
Macbook M5 64GB - can run gemma-4-26b-a4b-it-4bit and Qwen3.6-35B-A3B-4bit at about 1500 tps prefix and 45 tps decode on contexts up to 100K tokens using MLX. It's faster than Claude. I was really surprised, chat quality is also similar to Claude for gemma4. Agentic works but does not compare to cloud models, you can still make agents where top level is code.
mzubairtahir•46m ago
sorry but asking again: how much memory is actually useable by gpu in macbook? as it is shared(os and apps also have to use same memory)? and it is different than dedicated gpu memory?
gizajob•36m ago
It’s completely shared so the OS and everything else takes up maybe 8GB of the RAM. On a 64GB machine you can run models about 45GB in size and still have space for those models run other tasks which themselves might need ram. To a user, the GPU appears to just use the RAM as much as it needs same as any other process running on the system. You can see what space your LLMs are taking up in Activity Monitor (or htop) and how much GPU capacity they’re using (all of it)
j45•17m ago
You can adjust the percentage available both on the MacOS side and how much the model uses.
epsteingpt•1h ago
Both are going to be super super slow and low payback.

You gotta really want it right now.

It's still early!

jpgvm•1h ago
If you want a massive MacBook anyway then it's great. They are decent for local LLMs, awesome for local image models and it's a MacBook so AppleCare+ has your back. IMO it's a no brainer if you wanted a MacBook anyway but it's a poor choice if your reason to buy it is to run LLMs.
mzubairtahir•51m ago
are you saying because of speed or it just cant run them?
zepearl•21m ago
I agree. To run an acceptable model (e.g. Qwen/Qwen3.6-27B or google/gemma-4-31B) with a good quantization (minimum Q5) with a good context size (min 64k) you could buy 2 or even 3 GTX 5060 16GiB VRAM for ~550$ each. Fyi the much faster MoE models were useless for my usecases - e.g not able to correctly identify me/I/you, endless thinking loops, etc.

I'm currently running those models using an RTX 5070 12GiB + RTX 5060 16GiB + RTX 3060 12GiB with a 96k context size with MTP/speculative decoding and I'm quite happy (the 5070 is about 4x faster than the 3060, the 5060 is inbetween them so about 2x faster than a 3060).

brcmthrowaway•1h ago
Dual 3090 >>> Any Apple product.
gnabgib•56m ago
> Dual 3090 >>> Any Apple product.

Dual 3090s are terrible airpods

tim-tday•35m ago
I snorted
kylec•52m ago
Doesn't the 3090 cap out at 24GB VRAM? That's not a lot to run a local model
mzubairtahir•48m ago
but still it can run handsome models
nichch•1h ago
My opinion is that you should wait for 6-12 months before making a purchase either way.

Open weight models are getting good. With GLM 5.2 now chasing Opus, I'm very excited to see a smaller model's distillation.

Plus, the OLED MacBook Pro should be released by then.

Frannky•33m ago
This is my opinion too. Even if you buy hardware like a cluster of 8xGB10s or 4 A100s, they'll still be slow and a little dumber than what you're used to. We need to wait a little for better hardware. Lots of companies are pushing the frontier, so hopefully it'll come very soon.

Competition and innovation will hopefully make the bubble pop, and we'll get reasonably priced local hardware to run very intelligent models. Something like Talaas with GLM 5.2 would be pretty cool. Or Apple printing the latest model onto hardware—it would give a new reason to buy a new Mac every year (a new ai model with every new version).

gizajob•23m ago
The hardware is here today for people prepared to tolerate mild amounts of latency. It’s easy to forget that computing tasks used to often take major amounts of time - rendering an audio file, rendering a video, transcoding – all kinds of tasks took minutes or even hours of the computer spinning its fans on maximum just to deliver the result. AI and agentic AI and diffusion is the next round of that - trading a small bit of your waiting time for phenomenal power. The datacentre builders trying to get you hooked on instant responses on the LLM platforms have made you think that a “good” AI responds instantly and completely interactively - they can still be brilliant with a bit of delay. And having a competent agent doing things on my local machine, it doesn’t really matter if it takes ten minutes or an hour or six hours to complete a task while I’m out doing other things.
Frannky•8m ago
cylentwolf•56m ago
I asked a few of my friends that are ML engineers this question and all of them said to run the LLMs in the cloud with their infrastructure because it was going to be way faster. If you just want to tinker around I would look at @JSR_FDD's comment.
gizajob•46m ago
Local LLMs running in LM Studio on a MacBook Pro work great, if you’re prepared to wait for the answers because using an LLM locally is much much slower than having the instant results appear when using an online LLM like ChatGPT or Claude. You can also run OpenClaw on the MacBook and have that act as the front end for the LLM, to get full interactivity and have it install command line tools on your Mac to perform whatever tasks you’ve set it.

If you don’t already have a MacBook, then there’s a bit of a sweet-spot for the AI experimenter right now, which is to buy a second-hand 16” MBP with an M1 Max chip and 64GB of shared ram. Because these are about 5 years old now, they have depreciated to the point where they can be had for around £1100 / €1300 / $1500 and make a phenomenal platform for learning because the 64Gb of shared memory means you can host models up to about 48GB in size, and then task them to do interesting things with coding without ever having to worry about token burn.

The downside is that they’re slow, and prone to having to be nudged to keep them on track, but that’s part of the fun too. The “latency” is atrocious granted - you ask something and the machine thinks for a few minutes before saying anything which is a different experience to using Claude. But… it does work. You can think of yourself more like a manager with a junior member of staff and set the machine running and leave it to do its thing for a couple of hours which can be actually useful work, but this approach will likely be shouted down by some commenters here who treat Claude like some kind of expensive and quick-fire dopamine pump. Can also use a Mac like this for running diffusion models for image generation and suchlike in ComfyUI, even though, again, results will be slow. Spending more money on a more recent MBP with as much RAM as you can afford will deliver the same results more expensively in a quicker and quicker time.

To get the same kind of size of model you’d have to combine a couple of Nvidia 3090 24GB cards in a decent workstation with the PCI capacity to handle them, or hack some kind of solution to hang GPUs off the back of a motherboard on ribbon cables with the GPUs running on their own PSU, which is what I’m building next… the difference is those cards have 24GB of vram and cost about $1000 each second-hand, but will operate much much faster than the M1 Max MBP, or even the most recent M5 because they have so much more bandwidth (because they’re burning 350 watts on GPU compute rather than 140 watts total which is what a super efficient MBP has for the cpu/gpu/screen/everything).

So say you had $6000 to spend today, you could buy a second hand workstation and craft a solution with external GPUs which would completely smoke any Mac in existence, even though macs have the edge in the size of model you’d can run (slowly) due to their shared memory. External GPUs and access to the Nvidia frameworks and general CUDA ecosystem wins out on the performance front though. A real sweet spot is to buy an M1 Max MBP and have that as your front end to a Linux workstation full of GPUs.

But any apple silicon MBP is a totally competent gateway drug to local agentic computing.

Google Gemini could give you an in-depth and useful discussion about this exact question.

browningstreet•20m ago
It’s kind of amazing how steadily this question is asked in every forum where it can be asked. Kind of amazing that the answers previously given can’t reach the next person who’s going to ask it.
g-technology•7m ago
Around February or march I started looking into hardware options to help me start learning about training models and working with them. My budget was limited and an apple refurbished 32 gb Mac mini was far and away the best option for my budget. I wish it was faster but I can let it run 24/7 with no noise and minimal power draw. I just arrange long running tasks for when am asleep or at work. Then as a huge plus I have an awesome daily driver machine for whatever else I want to do
Hmm, I have access to A100s and a GB10, but if I use the models hosted there to code, I waste a lot of time waiting for answers and correcting errors. The amount of work I get done thanks to the quality and speed of frontier hosted models let me be insanely productive and have a lot of free time. I could use the slow local setup, but at what price?
FireBeyond•7m ago
The racks we're deploying are effectively GB300 NVL72s: 72 Blackwell Ultra GPUs 36 Grace CPUs, 20.7TB of unified HBM3e.

Works out to about 1.1exaflops of fp4. Networking is 800gbps.

120kW per rack.