frontpage.

Hi HN,

GPU utilization in Kubernetes is still surprisingly poor for many inference and interactive workloads.

Most clusters either:

allocate exclusive GPUs per pod, or

rely on MIG / vGPU, which introduces rigidity and operational complexity.

I’m experimenting with a different approach: scheduler-level GPU sharing.

Shared Device Group is a Kubernetes extension that lets multiple pods share one or more GPUs, with GPU selection handled by the scheduler instead of hardware partitioning.

High-level idea:

A SharedDeviceGroup CRD defines a logical GPU group

Pods reference the group via annotation

The scheduler plugin selects a node + GPU set

Selected GPUs are injected via NVIDIA_VISIBLE_DEVICES

Optional device-plugin integration for kubelet accounting

This works best for:

inference workloads

bursty / short-lived GPU tasks

scenarios where strict GPU isolation isn’t required

Trade-offs:

not a replacement for MIG/vGPU

requires scheduler involvement

best suited for dedicated GPU-sharing nodes

Repo: https://github.com/sceneryback/shared-device-group

I’d appreciate feedback, especially from folks running large GPU clusters or inference platforms.

Do you have a mathematically attractive face?

Code only says what it does

The success of 'natural language programming'

The Scriptovision Super Micro Script video titler is almost a home computer

Discovering the "original" iPhone from 1995 [video]

Psychometric Comparability of LLM-Based Digital Twins

SidePop – track revenue, costs, and overall business health in one place

The Other Markov's Inequality

The Cascading Effects of Repackaged APIs [pdf]

Lightweight and extensible compatibility layer between dataframe libraries

Haskell for all: Beyond agentic coding

Dorsey's Block cutting up to 10% of staff

Show HN: Freenet Lives – Real-Time Decentralized Apps at Scale [video]

In the AI age, 'slow and steady' doesn't win

Administration won't let student deported to Honduras return

How were the NIST ECDSA curve parameters generated? (2023)

AI, networks and Mechanical Turks (2025)

Goto Considered Awesome [video]

Show HN: I Built a Free AI LinkedIn Carousel Generator

Implementing Auto Tiling with Just 5 Tiles

Open Challange (Get all Universities involved

Apple Tried to Tamper Proof AirTag 2 Speakers – I Broke It [video]

Show HN: Isolating AI-generated code from human code | Vibe as a Code

Show HN: More beautiful and usable Hacker News

Toledo Derailment Rescue [video]

War Department Cuts Ties with Harvard University

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

A Bid-Based NFT Advertising Grid

AI readability score for your documentation

NASA Study: Non-Biologic Processes Don't Explain Mars Organics

Show HN: Shared Device Group – scheduler-level GPU sharing for Kubernetes