frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you use local LLMs productively?

3•virgildotcodes•1h ago
I've been periodically testing the strongest reported models as they come out, and which can fit on my 32GB M1 Max. I've yet to find one that I feel is genuinely useful.

My latest attempts were with 4 bit quants of Qwen 3.5, both 9b and 35B.

Both, on my very first query, something along the lines of "sup dog" or "how does beer a compare to beer b" led to an endless loop of thinking that I eventually had to manually stop in each case.

And yet I keep seeing passing comments about people using local LLMs to be productive.

Just curious what your strategies are, what the usecases are, and anything I may be missing.

Comments

andsoitis•1h ago
Lots of conversation on this topic yesterday: https://news.ycombinator.com/item?id=47363754
Cytoplast3528•1h ago
I think only Claude Sonnet/Opus, GPT 5.2+, Minimax M2.5 are useful. They are all nearly impossible to self-host, unfortunately.
CamperBob2•33m ago
Qwen 3.5 was plagued by some premature quant releases and unclear/incomplete guidelines for the sampling parameters. Especially if you are having looping problems, make sure you are using the very latest model files, executables, and recommended params.

If all the stars are aligned, Qwen 3.5 will not exhibit outright looping, although it will still burn more thinking tokens than some other models. There are ways to tone down the overthinking or disable it entirely, though, and the models are still quite capable when configured that way.

Show HN: Architecture question: running an LLM as core infrastructure

https://automazionezeli.com
1•senza1dio•1m ago•0 comments

Digg.com Closing Due to Spam

https://digg.com?hn
3•napolux•5m ago•0 comments

Rajon Rondo Profile

https://www.espn.com/espn/feature/story/_/id/12587848/old-questions-surface-new-dallas-mavericks-...
1•marysminefnuf•6m ago•0 comments

When Freemium Is Limiting: My Frustrations with Beehiiv

https://micahblachman.beehiiv.com/p/when-freemium-is-limiting
1•subdomain•8m ago•0 comments

The Download: Early adopters cash in on China's OpenClaw craze, and US batterie

https://www.technologyreview.com/2026/03/12/1134207/the-download-china-openclaw-ai-craze-us-batte...
1•joozio•10m ago•0 comments

Why Little Was Done to Head Off Oil's Strait of Hormuz Problem

https://www.nytimes.com/2026/03/14/business/energy-environment/iran-strait-hormuz-oil-middle-east...
1•mooreds•12m ago•1 comments

Convert JPG Logos to SVG – Stay Sharp at Any Size

https://oneweeb.com/jpg-to-svg.html
1•Zepubo•13m ago•0 comments

Kalshi for People

https://vouchmarket.polsia.app/
1•Marcoven•15m ago•0 comments

Show HN: I built a Chrome extension to block Instagram's feed and keep only DMs

https://chromewebstore.google.com/detail/mindful-instagram/neiedkilefemabefjohneedlemngfdjh
1•Shivam_Dewan•16m ago•0 comments

Piqe – AI marketing co-founder that handles community engagement while you code

https://getpiqe.com
1•tsjose•16m ago•1 comments

Linux Page Faults, MMAP, and userfaultfd for faster VM restores

https://www.shayon.dev/post/2026/65/linux-page-faults-mmap-and-userfaultfd/
2•shayonj•17m ago•0 comments

The State Policy Network

https://www.sourcewatch.org/index.php?title=State_Policy_Network
1•jamesgill•22m ago•0 comments

Locked Up but Not Locked Out: iOS App Pentesting Without Jailbreak

https://www.anvilsecure.com/blog/locked-up-but-not-locked-out-ios-app-pentesting-without-jailbrea...
1•depierre•26m ago•0 comments

I built a platform to help developers find collaborators for new projects

2•deiv2002•27m ago•0 comments

AI Adoption Rapidly Growing in Public Sector

https://www.gallup.com/workplace/702983/adoption-rapidly-growing-public-sector.aspx
1•hn_acker•27m ago•0 comments

GitHome: Local Git repository management at scale

https://crates.io/crates/githome
1•agentk9•29m ago•0 comments

Patrons of Journalism

https://www.hamiltonnolan.com/p/patrons-of-journalism
2•thm•32m ago•0 comments

TTL Exceeded – In Memory of FX

https://phenoelit.de/fx.html
2•_tk_•32m ago•0 comments

50 Years of Thinking Different

https://www.apple.com/50-years-of-thinking-different/
2•mndren•33m ago•0 comments

Show HN: I built a heartbeat and uptime monitoring for developers

https://pulsemon.dev/
3•ramgale•34m ago•0 comments

Sunsetting Jazzband

https://jazzband.co/news/2026/03/14/sunsetting-jazzband
14•mooreds•34m ago•1 comments

Can you see Earth's shadow?

https://www.livescience.com/space/can-you-see-earths-shadow
2•Brajeshwar•35m ago•0 comments

My Claude Settings

https://twitter.com/JoshuaBaer/status/2032666249465942208
2•tosh•38m ago•0 comments

Knowing What Not to Animate

https://micro.bossadizenith.me/writing/animations
3•handfuloflight•38m ago•0 comments

Show HN: I made a tool to pixelate image

https://www.pixelateimage.co/
3•atharvtathe•39m ago•1 comments

The Pentagon Went to War with Anthropic. What's Really at Stake?

https://www.newyorker.com/news/annals-of-inquiry/the-pentagon-went-to-war-with-anthropic-whats-re...
2•mitchbob•40m ago•1 comments

Show HN: VibeNVR – Open-source self-hosted NVR with REST API and Homepage widget

https://github.com/spupuz/VibeNVR
2•spupuz•41m ago•0 comments

Team Human

https://onTeamHuman.com
1•andytratt•42m ago•0 comments

North Korea: secretive nation lands in spotlight at Women's Asian Cup

https://www.theguardian.com/sport/2026/feb/23/north-korea-womens-national-football-team-asian-cup...
1•PaulHoule•42m ago•0 comments

The ArXiv is separating from Cornell University, and is hiring a CEO for 300k/yr

https://mathstodon.xyz/@johncarlosbaez/116223948891539024
4•binsquare•42m ago•2 comments