frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: General Instinct (YC P26) – Frontier models on edge devices

10•guanming0717•1h ago
Hey HN, Guanming and Bill here from General Instinct (https://general-instinct.com/).

After years of working in robotics, we kept running into the same problem: the best models never fit the hardware we actually had available.

The models that performed best were usually designed around datacenter assumptions: large GPUs, lots of memory bandwidth, and reliable network access. But most physical systems have the opposite constraints.

That led us down the path of figuring out how much of a frontier model could be preserved while still making it practical to run on edge hardware.

As part of that work, we recently open sourced InstinctRazor (https://github.com/General-Instinct/InstinctRazor)

One result we're excited about is compressing Qwen3.5-122B-A10B, a roughly 245 GB BF16 MoE model, into a 48 GiB GGUF. The resulting model is actually smaller than Gemma-4-26B-A4B while outperforming it on benchmarks like MMLU-Pro and GPQA-D etc. we preserve the parts that are always active (router, norms, Gated-DeltaNet/SSM layers, vision pathway, etc.) and quantize the routed experts much more aggressively. We then use on-policy distillation to recover capability lost during quantization.

The model can also run in a "small GPU" configuration where experts are streamed from system RAM. With an 8k context window, peak VRAM usage is around 7.6–8 GB.

If you're interested in the technical details, we wrote up the approach here (https://general-instinct.com/blog/frontier-moe-sub-4-bit)

We're especially interested in hearing from people deploying models onto robots or other edge devices. What models are you trying to run locally today? What has been the biggest bottleneck in getting them into production?

Comments

VikRubenfeld•50m ago
You've likely heard about this - he'd probably like to talk to you and might potentially give you some good PR.

https://www.youtube.com/watch?v=rAzT5lcezPs&t=467s

guanming0717•42m ago
Thanks for sharing! I'd love to chat with him. Would you be open to introducing us? :)
smokel•12m ago
For those too lazy to watch someone talk on video for ages to make a point:

The link is to a famous YouTuber called PewDiePie and he uses a local LLM to parse his email, to save time with that. They have an autoreply system and get notified about urgent matters.

XenophileJKO•6m ago
I'm still kind of surprised that people are targeting edge deployment of MoE models. By definition they optimize for computation cost at the expense of memory efficiency. We generally need the opposite on the edge.

I'm hoping to see more work in the other direction with cyclic/looped transformers and other memory dense approaches.

rohansood15•38s ago
Have you benchmarked against other 3-bit dynamic quants like Unsloth? I am sorry but this framing against a full precision, newer, smaller MoE just seems misleading.

Astronauts on ISS told to shelter as repairs under way to fix air leaks

https://www.bbc.com/news/live/c4g44ew3g1kt
195•janpot•2h ago•134 comments

pg_durable: Microsoft open sources in-database durable execution

https://github.com/microsoft/pg_durable
110•coffeemug•1h ago•28 comments

Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency

https://blog.google/innovation-and-ai/technology/developers-tools/quantization-aware-training-gem...
31•theanonymousone•1h ago•11 comments

Mouseless – keyboard-driven control of macOS/Linux/Windows

https://mouseless.click
291•riddley•2d ago•144 comments

I tested every IP KVM in my Homelab

https://www.jeffgeerling.com/blog/2026/i-tested-every-ip-kvm/
106•vquemener•3h ago•29 comments

Mantine-datatable (and others) compromised – owner account suspended

https://github.com/icflorescu/mantine-datatable/discussions/813
17•justsomehuman•1h ago•3 comments

Cooldown Support for Ruby Bundler

https://blog.rubygems.org/2026/06/03/cooldown-let-new-gems-be-vetted.html
98•calyhre•2d ago•17 comments

Tracing a powerful GNSS interference source over Europe

https://arxiv.org/abs/2606.03673
296•mimorigasaka•9h ago•141 comments

Adyen Selected as Payment Services Provider for GOV.UK Pay

https://www.adyen.com/press-and-media/adyen-payments-gov-uk
11•ChrisArchitect•41m ago•2 comments

New method turns ocean water into drinking water, without waste

https://www.rochester.edu/newscenter/what-is-desalination-definition-ocean-water-704732/
20•speckx•2h ago•12 comments

Launch HN: General Instinct (YC P26) – Frontier models on edge devices

10•guanming0717•1h ago•5 comments

Redis 8.8: New array data structure, rate limiter, performance improvements

https://redis.io/blog/announcing-redis-8-8/
159•ksec•2d ago•76 comments

Dutch gov't will only allow European company to operate DigiD platform

https://nltimes.nl/2026/06/05/dutch-govt-will-allow-european-company-operate-digid-platform
121•TechTechTech•2h ago•45 comments

SVG of a Hamster Playing Table-Tennis

https://aibenchy.com/ro/showcase/hamster-playing-table-tennis-svg/
8•XCSme•44m ago•3 comments

Gov.uk goes Dutch on payments as it dumps Stripe

https://www.theregister.com/public-sector/2026/06/04/govuk-goes-dutch-on-payments-as-it-dumps-str...
31•toomuchtodo•45m ago•5 comments

Entanglement Builds Space-Time. Now "Magic" Gives It Gravity

https://www.quantamagazine.org/entanglement-builds-space-time-now-magic-gives-it-gravity-20260603/
129•rbanffy•9h ago•126 comments

Nango (YC W23, dev infra) is hiring staff back end engineers

https://nango.dev/careers
1•bastienbeurier•5h ago

C++: The Documentary

https://herbsutter.com/2026/06/04/c-the-documentary-released-today/
305•ingve•13h ago•222 comments

Changing how we develop Ladybird

https://ladybird.org/posts/changing-how-we-develop-ladybird/
711•EdwinHoksberg•10h ago•469 comments

Stop Using Conventional Commits

https://sumnerevans.com/posts/software-engineering/stop-using-conventional-commits/
118•jsve•2h ago•90 comments

ESP32 Bit Pirate, a Hardware Hacking Tool with WebCLI That Speaks Every Protocol

https://github.com/geo-tp/ESP32-Bit-Pirate
139•geotp•10h ago•42 comments

Fine-tuning an LLM to write docs like it's 1995

https://passo.uno/fine-tuning-docs-llm/
164•taubek•11h ago•56 comments

Lee Kuan Yew's Singapore Story (2023)

https://www.historytoday.com/archive/feature/lee-kuan-yews-singapore-story
126•pepys•10h ago•122 comments

Azure Linux 4.0 is Microsoft's first general-purpose Linux

https://www.boxofcables.dev/azure-linux-4-0-is-microsofts-first-general-purpose-linux/
164•haydenbarnes•14h ago•132 comments

Meta enables ADB on deprecated Portal devices [video]

https://fb.watch/HxPu0fSyeH/
282•jenders•16h ago•112 comments

U.S. Military Turned GPS into a Global "Numbers Station"

https://www.404media.co/the-u-s-military-quietly-turned-gps-into-a-global-numbers-station-evidenc...
44•awkwardpotato•1h ago•29 comments

Leap in DNA synthesis slashes time to build new genetic sequences

https://spectrum.ieee.org/faster-dna-synthesis-sidewinder
102•natalcleft•23h ago•23 comments

Anthropic's open-source framework for AI-powered vulnerability discovery

https://github.com/anthropics/defending-code-reference-harness
502•binyu•21h ago•140 comments

databow: a Rust CLI to query any database with an ADBC driver

https://columnar.tech/blog/introducing-databow//
103•hckshr•2d ago•20 comments

I'm skeptical about efforts to revolutionize schooling

https://www.scotthyoung.com/blog/2026/05/27/revolutionize-schooling/
280•andrewstuart•2d ago•464 comments