frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

How I bypassed Amazon's Kindle web DRM

https://blog.pixelmelt.dev/kindle-web-drm/
647•pixelmelt•7h ago•204 comments

Claude Skills

https://www.anthropic.com/news/skills
520•meetpateltech•12h ago•290 comments

America’s semiconductor boom

https://www.youtube.com/watch?v=T-jt3qBzJ4A
106•zdw•5h ago•48 comments

Gemini 3.0 spotted in the wild through A/B testing

https://ricklamers.io/posts/gemini-3-spotted-in-the-wild/
300•ricklamers•11h ago•179 comments

Cloudflare Sandbox SDK

https://sandbox.cloudflare.com/
146•bentaber•7h ago•48 comments

A 4k-Room Text Adventure Written by One Human in QBasic No AI

https://the-ventureweaver.itch.io/tlote4111
68•ATiredGoat•4d ago•43 comments

Lead Limited Brain and Language Development in Neanderthals and Other Hominids?

https://today.ucsd.edu/story/did-lead-limit-brain-and-language-development-in-neanderthals-and-ot...
44•gmays•4h ago•11 comments

Your data model is your destiny

https://notes.mtb.xyz/p/your-data-model-is-your-destiny
196•hunglee2•2d ago•29 comments

DoorDash and Waymo launch autonomous delivery service in Phoenix

https://about.doordash.com/en-us/news/waymo
228•ChrisArchitect•14h ago•513 comments

Codex Is Live in Zed

https://zed.dev/blog/codex-is-live-in-zed
191•meetpateltech•12h ago•28 comments

Hyperflask – Full stack Flask and Htmx framework

https://hyperflask.dev/
298•emixam•15h ago•94 comments

Why I have to buy doughnuts with cash

https://www.ft.com/content/8766ef23-3938-4de2-8a37-602c798034aa
10•hhs•5d ago•14 comments

Talent

https://www.felixstocker.com/blog/talent
127•BinaryIgor•10h ago•54 comments

Understanding Spec-Driven-Development: Kiro, Spec-Kit, and Tessl

https://martinfowler.com/articles/exploring-gen-ai/sdd-3-tools.html
46•janpio•6h ago•5 comments

Syntax highlighting is a waste of an information channel (2020)

https://buttondown.com/hillelwayne/archive/syntax-highlighting-is-a-waste-of-an-information/
228•swyx•4d ago•92 comments

Post office in France rolls out croissant-scented stamp

https://www.ctvnews.ca/world/article/french-post-office-rolls-out-croissant-scented-stamp/
101•ohjeez•1w ago•37 comments

Microwave technique allows energy-efficient chemical reactions

https://phys.org/news/2025-10-microwave-technique-energy-efficient-chemical.html
35•rolph•6d ago•1 comments

Elixir 1.19

https://elixir-lang.org/blog/2025/10/16/elixir-v1-19-0-released/
225•theanirudh•20h ago•48 comments

A liver transplant from start to finish

https://press.asimov.com/articles/liver
13•mailyk•4d ago•2 comments

Electricity can heal wounds three times as fast (2023)

https://www.chalmers.se/en/current/news/mc2-how-electricity-can-heal-wounds-three-times-as-fast/
144•mgh2•15h ago•90 comments

Benjie's Humanoid Olympic Games

https://generalrobots.substack.com/p/benjies-humanoid-olympic-games
105•robobenjie•8h ago•78 comments

How to tame a user interface using a spreadsheet

https://blog.gingerbeardman.com/2025/10/11/how-to-tame-a-user-interface-using-a-spreadsheet/
99•msephton•6d ago•21 comments

A conspiracy to kill IE6 (2019)

https://blog.chriszacharias.com/a-conspiracy-to-kill-ie6
169•romanhn•9h ago•100 comments

Lace: A New Kind of Cellular Automata Where Links Matter

https://www.novaspivack.com/science/introducing-lace-a-new-kind-of-cellular-automata
122•airesearcher•14h ago•48 comments

Show HN: Inkeep (YC W23) – Agent Builder to create agents in code or visually

https://github.com/inkeep/agents
64•engomez•15h ago•47 comments

Hacker News – The Good Parts

https://smartmic.bearblog.dev/why-hacker-news/
116•smartmic•7h ago•131 comments

A stateful browser agent using self-healing DOM maps

https://100x.bot/a/a-stateful-browser-agent-using-self-healing-dom-maps
110•shardullavekar•15h ago•54 comments

VOC injection into a house reveals large surface reservoir sizes

https://www.pnas.org/doi/10.1073/pnas.2503399122
91•PaulHoule•5d ago•79 comments

Eon – An Effects-Based OCaml Nameserver

https://ryan.freumh.org/eon.html
54•Bogdanp•5d ago•3 comments

Nvidia DGX Spark and Apple Mac Studio = 4x Faster LLM Inference with EXO 1.0

https://blog.exolabs.net/nvidia-dgx-spark/
40•edelsohn•4h ago•14 comments
Open in hackernews

Nvidia DGX Spark and Apple Mac Studio = 4x Faster LLM Inference with EXO 1.0

https://blog.exolabs.net/nvidia-dgx-spark/
40•edelsohn•4h ago

Comments

pram•4h ago
Very cool, using the DGX like an “AI eGPU.” I wonder if this could also benefit stuff like Stable Diffusion/WAN etc?
dekhn•4h ago
Are you using USB-C for networking between the Spark and the Mac?
pdpi•2h ago
IP over thunderbolt is definitely a thing, don't know whether IP over USB is also a thing. USB4x2 or TB5 can do 80Gib/s symmetrical or 120+40 asymmetrical (and boy is this a poster child for the asymmetrical setup). The Mac definitely supports that fine, so, as long as the Spark plays nice, USB is actually a legitimately decent choice.
esseph•1h ago
USB4 was based on Thunderbolt3

Yes, it's a thing that works.

mehdibl•3h ago
The gain is only in prefill and if the task/output is complex the gain will be totally minor. So the numbers are quitly exagerated here based on a prompt that is taking less than 2s to decode. So I guess we are not here doing complex tasks with 100's or 1000 token output. For the cost of an M3 Ultra + DGX the gain seem minimal and most of all, exo didn't clarify the model used here and it's for sure not a dense model or an MoE with 1B or 2B experts otherwise the mac ultra too will suffer a lot and the layers will be bigger!
solarkraft•2h ago
Anecdotally, even medium-sized prompts (a few thousand tokens) on pretty small models (8-2B) have resulted in extremely noticeable slowdowns (vast majority of total processing time) on my M1 Mac, leading me to appreciate the significance of the pre-fill step (and difficulty of processing large contexts locally).
adam_arthur•3h ago
I'm confused by all the takes implying decode is more important than prefill.

There are an enormous number of use cases where the prompt is large and the expected output is small.

E.g. providing data for the LLM to analyze, after which it gives a simple yes/no Boolean response. Or selecting a single enum value from a set.

This pattern seems far more valuable in practice, than the common and lazy open ended chat style implementations (lazy from a product perspective).

Obviously decode will be important for code generation or search, but that's such a small set of possible applications, and you'll probably always do better being on the latest models in the cloud.

drodgers•2h ago
This is really cool!

Now I'm trying to stop myself from finding an excuse to spend upwards of $30k on compute hardware...

tuananh•1h ago
if you have $30k to spare, I'm sure there are better options
jsight•38m ago
Yeah, a couple of RTX Pro 6000 cards would blow this away and still leave him with money to spare.
solarkraft•2h ago
This is a wonderful explanation of the two phases! I appreciate the hardware concerns for both now.

Reading the article I wished for a device that just does both things well and on that topic it might be noteworthy that Apple's just-released M5 has approximately 3.5x-ed TTFT performance compared to M4, according to their claims!

daft_pink•2h ago
It’s really sad that exo went private.
storus•1h ago
Wouldn't this restrict memory to 128GB, wasting M3 Ultra potential?
musicale•19m ago
But you could also just get two DGX Spark and get 2 * 1.9x = 3.8x total throughput for two query streams.