frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Why Ford Can't Find Mechanics

https://www.wsj.com/opinion/ford-motor-ceo-jim-farley-mechanic-shortage-autos-skilled-workers-334...
1•bookofjoe•2m ago•1 comments

The Crisis of the Portuguese Left

https://jacobin.com/2025/10/portugal-left-socialist-costa-chega/
1•PaulHoule•4m ago•0 comments

Anthropic models available on Microsoft Foundry

https://azure.microsoft.com/en-us/blog/introducing-anthropics-claude-models-in-microsoft-foundry-...
1•kryogen1c•5m ago•0 comments

Pulp (Non)Fiction: A Conversation with Pulp Magazine Collector Dr. Richard Meli

https://intelligentcollector.com/pulp-nonfiction-a-conversation-with-pulp-magazine-collector-dr-r...
1•ohjeez•7m ago•0 comments

Empire of AI Overestimated Datacenter Water Usage by 1000x

https://twitter.com/AndyMasley/status/1990181855753347433
2•kumarharsh•7m ago•0 comments

Court rules Meta purchases of Instagram and WhatsApp did not stifle competition

https://www.reuters.com/sustainability/boards-policy-regulation/meta-defeats-us-antitrust-case-ov...
2•jmsflknr•7m ago•0 comments

There Be Whales Here

https://ironicsans.ghost.io/there-be-whales-here/
1•Amorymeltzer•12m ago•0 comments

Aptible and Opti9: secure, reliable, compliant cloud infrastructure

https://www.aptible.com/blog/announcing-aptible-opti9
1•vandot•12m ago•0 comments

Show HN: Guts – convert Golang types to TypeScript

https://github.com/coder/guts
3•emyrk•12m ago•0 comments

Whatever Happened to UBI?

https://maxread.substack.com/p/whatever-happened-to-ubi
3•Amorymeltzer•13m ago•0 comments

Show HN: DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3

https://leebutterman.com/2025/11/01/prompt-optimization-on-a-raspberry-pi.html
1•lsb•14m ago•0 comments

Gemini 3 Pro Preview on OpenRouter

https://openrouter.ai/google/gemini-3-pro-preview
1•Topfi•15m ago•0 comments

Free PDF "Cheat Sheets" Released to Help Developers Navigate the PDF Spec

https://pdfa.org/new-and-updated-pdf-cheat-sheets/
2•whizzx•15m ago•0 comments

JIT Compiling AI Agents to Code

https://github.com/stanford-mast/a1
1•calebhwin•16m ago•0 comments

How Do Americans View Childhood Vaccines, Vaccine Research and Policy?

https://www.pewresearch.org/science/2025/11/18/how-do-americans-view-childhood-vaccines-vaccine-r...
3•hn_acker•16m ago•0 comments

What Is Success?

https://www.mdubakov.me/what-is-success/
1•tablet•20m ago•0 comments

Google CEO: If an AI bubble pops, no one is getting out clean

https://arstechnica.com/ai/2025/11/googles-sundar-pichai-warns-of-irrationality-in-trillion-dolla...
4•doctorshady•20m ago•0 comments

Generative UI: LLMs Are Effective UI Generators

https://generativeui.github.io/
2•easton•21m ago•0 comments

Show HN: Dataset Factory – Generate RAG evaluation datasets from a text prompt

https://alexjacobs08.github.io/datasetFactory/
1•tacoooooooo•21m ago•0 comments

What Is Work?

https://substack.com/inbox/post/179266883
1•gilfoyle_7•23m ago•0 comments

Active short video use linked to altered attention and brain connectivity

https://www.psypost.org/active-short-video-use-linked-to-altered-attention-and-brain-connectivity/
2•01-_-•25m ago•0 comments

Nestle accused of risking babies' health in Africa

https://www.aljazeera.com/news/2025/11/18/nestle-accused-of-risking-baby-heath-in-africa-asia-and...
1•Qem•25m ago•0 comments

To Be a Leader of Systems

https://hazelweakly.me/blog/to-be-a-leader-of-systems/
1•gpi•25m ago•0 comments

Mapping the future with 3D‑printed titanium Apple Watch cases

https://www.apple.com/newsroom/2025/11/mapping-the-future-with-3d-printed-titanium-apple-watch-ca...
2•throwfaraway4•25m ago•0 comments

Build a full data set using a single web query

https://parallel.ai/blog/introducing-findall-api
6•lukaslevert•26m ago•0 comments

John Henry and the Broken Dishwasher

https://substack.com/inbox/post/179265755
1•mathattack•27m ago•0 comments

Why one of the nation's most prosperous industries is shedding jobs

https://www.washingtonpost.com/business/2025/11/18/big-tech-layoffs-ai/
2•jackallis•28m ago•0 comments

A City Is Broke. Can a Billionaires' Urbanist Dream Offer It a Last Chance?

https://www.nytimes.com/2025/11/18/business/economy/suisun-city-makes-an-offer-to-california-fore...
2•mitchbob•28m ago•1 comments

Feeling Flush with Success – Making Museum Bathrooms into Exhibition Spaces

https://blog.orselli.net/2025/07/feeling-flush-with-success-making.html
1•ripe•28m ago•0 comments

The Connectivity Standards Alliance Announces Zigbee 4.0 and Suzi

https://csa-iot.org/newsroom/the-connectivity-standards-alliance-announces-zigbee-4-0-and-suzi-em...
1•paulatreides•33m ago•0 comments
Open in hackernews

Show HN: LLMKube – Kubernetes for Local LLMs with GPU Acceleration

https://github.com/defilantech/LLMKube
5•defilan•1h ago
Hi HN! I built LLMKube, a Kubernetes operator for deploying GPU-accelerated LLMs in production. One command gets you from zero to inference with full observability.

Why this exists: Regulated industries (healthcare, defense, finance) need air-gapped LLM deployments, but existing tools are either single-node only (Ollama) or lack GPU optimization and SLO enforcement. LLMKube bridges the gap.

What's working:

- 17x speedup with NVIDIA GPUs (64 tok/s on Llama 3.2 3B vs 4.6 tok/s CPU)

- One command: llmkube deploy llama-3b --gpu (auto CUDA setup, scheduling, layer offloading)

- Production observability: Prometheus + Grafana + DCGM GPU metrics out of the box

- OpenAI-compatible API endpoints

- Terraform configs for GKE GPU clusters with auto-scale to zero

Tech: Kubernetes CRDs, llama.cpp with CUDA, NVIDIA GPU Operator, cost-optimized spot instances (~$50-150/mo dev workloads).

Status: v0.2.0 production-ready for single-GPU deployments on standard K8s clusters. Multi-GPU and multi-node model sharding on the roadmap.

Apache 2.0 licensed. Would love feedback from anyone running LLMs in production!

Website: https://llmkube.com

GitHub: https://github.com/Defilan/LLMKube