news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Actual audio world models are getting closer and closer

https://bsky.app/profile/onder.ai/post/3lxxozk2zb22n

1•selimonder•48s ago•0 comments

Are there any countries with no mosquitoes?

https://www.livescience.com/animals/mosquitos/are-there-any-countries-with-no-mosquitoes

1•wjb3•2m ago•0 comments

Every Commodore Amiga Model Ever Made [video]

https://www.youtube.com/watch?v=JUwpkKVw0Xk

1•doener•2m ago•0 comments

How Does Anesthesia Work?

https://www.popsci.com/health/how-does-anesthesia-work/

1•wjb3•4m ago•0 comments

Release Notes for Safari Technology Preview 227

https://webkit.org/blog/17324/release-notes-for-safari-technology-preview-227/

1•feross•6m ago•0 comments

Salesforce's weak quarterly revenue forecast signals lagging AI monetization

https://www.reuters.com/sustainability/sustainable-finance-reporting/salesforces-weak-quarterly-r...

1•toomuchtodo•7m ago•0 comments

Show HN: Listgitfiles.sh – Fetch Raw GitHub File URLs with One Command

https://gist.github.com/rmtbb/d55638e758ad656eb40741dd60a39e5f

2•rmtbb•11m ago•0 comments

Biased bots: AI hiring managers shortlist candidates with AI resumes

https://www.theregister.com/2025/09/03/ai_hiring_biased/

2•Dotnaught•11m ago•0 comments

Firefox ESR 115 for Windows 7/8.1, macOS 10.12-10.14 extended to March 2026

https://whattrainisitnow.com/release/?version=esr

1•tech234a•12m ago•0 comments

Helping a Child Sleep: Practical Tips for Parents and Carers [video]

https://acamhlearn.org/Learning/Helping_a_Child_Sleep%3a_Practical_Tips_for_Parents_and_Carers/91...

1•wjb3•12m ago•0 comments

The agent economy: New paths to build and earn

https://developer.paypal.com/community/blog/agent-economy-new-paths/

1•ftavakoli•13m ago•0 comments

Sweeteners can harm cognitive health equivalent to 1.6 years of ageing

https://www.theguardian.com/food/2025/sep/03/sweeteners-can-harm-cognitive-health-equivalent-to-1...

4•Tarsul•14m ago•0 comments

Ask HN: Is Google AI Overview giving you scam phone numbers?

3•CreepGin•16m ago•2 comments

History of gamma-ray burst research

https://en.wikipedia.org/wiki/History_of_gamma-ray_burst_research

1•belter•18m ago•0 comments

Like Humans, Every Tree Has Its Own Microbiome, a New Study Has Found

https://www.nytimes.com/2025/08/27/science/biology-trees-microbiomes.html

1•bookofjoe•18m ago•1 comments

SRE AI Agent Used at TikTok

https://josephfattah.com/sre-ai-agent

1•uselesserrands•19m ago•0 comments

An Alternative to TradingView

https://www.aulico.com

1•lollobrigo•20m ago•0 comments

Every Single Human. Like. Always

https://randsinrepose.com/archives/every-single-human-like-always/

1•colinprince•20m ago•0 comments

Tech Model Railroad Club at MIT

https://tmrc.mit.edu/

1•brendanashworth•21m ago•0 comments

Math Resource: All Ten – A Visual Math Game (From Beast Academy)

https://kidswholovemath.substack.com/p/math-resource-all-ten-a-visual-math

1•sebg•22m ago•0 comments

Slack update shows iPhone as a jailbroken device

https://old.reddit.com/r/Slack/comments/1n7pba7/unexpected_jailbroken_device/

2•sharjeelsayed•24m ago•0 comments

Practical Techniques for Claude Code and Codex CLI

https://coding-with-ai.dev/

1•grigy•25m ago•0 comments

The old Irish internet is being deleted on the 21st of October

https://old.reddit.com/r/DataHoarder/comments/1n7715f/the_old_irish_internet_is_being_deleted_on_...

2•akyuu•25m ago•0 comments

Tesla's Robotaxi So Revolutionary, It's Moving 'Safety Monitor' to Driver's Seat

https://www.jalopnik.com/1958233/tesla-robotaxi-tech-safety-monitor-drivers-seat/

2•Animats•26m ago•1 comments

Tcl (Tool Command Language)

https://www.tcl-lang.org/

1•gjvc•28m ago•0 comments

PLTR is the most de-risked 10x opportunity on the market

1•ngninja•29m ago•1 comments

Troubleshooting ZFS – Common Issues and How to Fix Them

https://klarasystems.com/articles/troubleshooting-zfs-common-issues-how-to-fix-them/

2•zdw•32m ago•0 comments

Show HN: Validating demand for an AR and Agentic airport guide – would you pay?

1•bengpepin•33m ago•0 comments

Bridging the network cost gap: Operators need traffic-based cost intelligence

https://www.ciodive.com/spons/bridging-the-network-cost-gap-why-operators-need-real-time-traffic-...

1•oavioklein•36m ago•0 comments

Most people's life satisfaction matches their personality traits

https://psycnet.apa.org/record/2024-93961-003

4•gmays•36m ago•0 comments

Open in hackernews

Sharing base model in GPU VRAM across multiple inference stack process [video]

https://www.youtube.com/watch?v=OC1yyJo9zpg

5•medicis123•2h ago

Comments

medicis123•2h ago

We have just published a short demo of the WoolyAI GPU Hypervisor, showcasing VRAM memory sharing/deduplication. Load a single base model once, then run multiple isolated LoRA stacks or VLLM stacks on the same GPU.

Why this matters

Higher capacity: Share the base model in VRAM; add more adapters or vertical inference stacks per GPU without increasing memory usage.

Isolation & control: Each stack is its own process with independent batching and SLA-aware scheduling.

While vLLM supports multiple adapters on a single vLLM process, many teams need predictable per-adapter SLAs—this is where running independent stacks with a shared base model in VRAM can enable doing it all on the same GPU.

The demo uses LoRA inference using Pytorch, but the same applies when using vLLM. If you’re scaling LoRA inference across business units or model variants and need predictable latency without overprovisioning GPUs, I’d love your feedback. Comment or DM to chat.