frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: 50+ LLMs on 2 GPUs with 2-Second Swapping? We built AI-Native Runtime

https://github.com/inferx-net/inferx/wiki/The-Cold-Start-Time-To-First-Token-(CS%E2%80%90TTFT)-of-InferX-snapshot-based-container
2•pveldandi•7h ago
We've built InferX, a specialized runtime environment that fundamentally changes how LLMs are served. The core problem we solve is the latency bottleneck in AI inference, especially with large models. Current systems waste resources or suffer from painfully slow cold starts.

InferX's AI-native architecture, with its "snapshot" technology, enables:

* *Sub-2s cold starts:* Spin up models instantly. * *High density:* Serve more LLMs on the same GPUs. * *Optimal efficiency:* Maximize GPU utilization.

This isn't just another API; it's a new execution layer designed from the ground up for the unique demands of LLM inference. We're seeing strong interest from infrastructure teams and AI platform builders.

Would love your thoughts and feedback! What are the biggest challenges you're facing with LLM deployment?

Demo: https://inferx.net/

A rare snail is filmed laying an egg from its neck

https://apnews.com/article/zealand-snail-egg-neck-powelliphanta-augusta-3cb8082547a83b8c47848b6621c06cb0
1•gmays•2m ago•0 comments

Google Worried It Couldn't Control How Israel Uses Project Nimbus, Files Reveal

https://theintercept.com/2025/05/12/google-nimbus-israel-military-ai-human-rights/
1•zhengiszen•2m ago•0 comments

When was peak message in a bottle?

https://interconnected.org/home/2025/05/16/bottle
1•LorenDB•10m ago•0 comments

Soviet Refugee Igor Tulchinsky Became a Hedge Fund Billionaire

https://www.forbes.com/sites/johnhyatt/2025/05/16/this-billionaire-quant-is-turbocharging-his-trading-models-with-chatgpt-style-ai/
2•walterbell•18m ago•0 comments

Is there anything similar to xcancel or nitter but for Bluesky?

1•ranoutofnames•20m ago•0 comments

It's Not Just a Feeling: Data Shows Boys and Young Men Are Falling Behind

https://www.nytimes.com/2025/05/13/upshot/boys-falling-behind-data.html
1•jnord•21m ago•0 comments

Constrained Random Walks

https://github.com/ivanbelenky/pywalker
1•ivanbelenky•22m ago•0 comments

MIT Says It No Longer Stands Behind Student's AI Research Paper

https://www.msn.com/en-us/money/other/mit-says-it-no-longer-stands-behind-student-s-ai-research-paper/ar-AA1EUFwO
2•jnord•22m ago•0 comments

(How) I Use Amp

https://ampcode.com/how-i-use-amp
1•handfuloflight•23m ago•0 comments

Supplements

https://near.blog/supplements/
1•bilsbie•31m ago•0 comments

Phone scammers pretending to be 'from Amazon' trick woman out of $1M

https://www.seattletimes.com/nation-world/phone-scammers-pretending-to-be-from-amazon-trick-woman-out-of-1m/
1•rwc9•31m ago•1 comments

Nintendo's May 2025 Policy Updates

https://consumerrights.wiki/index.php?title=Nintendo%27s_May_2025_Policy_Updates
1•raybb•32m ago•0 comments

The Collapse of GPT

https://cacm.acm.org/news/the-collapse-of-gpt/
16•pseudolus•44m ago•5 comments

Pallene: A statically typed ahead-of-time compiled sister language to Lua, with

https://github.com/pallene-lang/pallene
3•todsacerdoti•45m ago•0 comments

The Connoisseur of Desire

https://www.nybooks.com/articles/2025/05/29/the-connoisseur-of-desire-the-annotated-great-gatsby/
4•samclemens•48m ago•0 comments

AI job alerts that match your skills

https://jobsphere.onboardai.net/
2•Gurnoorsb•51m ago•2 comments

They Were Identical 'Twinnies' Who Charmed Orwell, Camus and More

https://www.nytimes.com/2025/05/04/books/review/the-dazzling-paget-sisters-ariane-bankes.html
10•lermontov•52m ago•1 comments

AI Food Detection and Free Calorie Counter App by Recipe

https://whatthefood.io
2•OdehAhwal•55m ago•0 comments

Nord Stream 2 Enters Debt Restructuring Deal with Creditors

https://oilprice.com/Latest-Energy-News/World-News/Nord-Stream-2-Enters-Debt-Restructuring-Deal-with-Creditors.html
1•PaulHoule•58m ago•0 comments

Better air quality is the easiest way not to die

https://dynomight.net/air/
4•haltingproblem•58m ago•0 comments

Jane Street-Millennium Trade Secrets Fight Ends in Settlement (2024)

https://www.bnnbloomberg.ca/business/international/2024/12/05/jane-street-millennium-settle-india-options-trade-secrets-case/
1•walterbell•1h ago•0 comments

TalwarAI: A suite of autonomous security agents

https://test.pypi.org/project/talwarai-beta/
1•handfuloflight•1h ago•0 comments

Apple Says Fortnite for iOS Isn't Blocked Worldwide, Just the U.S.

https://www.macrumors.com/2025/05/16/apple-fortnite-ios-not-blocked-worldwide/
5•smileybarry•1h ago•1 comments

Typograph: Prompt to Font

https://typograph.studio/en
2•handfuloflight•1h ago•0 comments

Harvard bought a Magna Carta copy for $27. It turned out to be an original

https://www.usatoday.com/story/news/nation/2025/05/15/harvard-magna-carta-1300/83643266007/
5•rmason•1h ago•0 comments

Core War

https://en.wikipedia.org/wiki/Core_War
5•michalpleban•1h ago•0 comments

Reddit is down

3•tom1337•1h ago•2 comments

Yeast-Based LLM Research

1•daly•1h ago•0 comments

Berkshire Hathaway Inc Q4 2024 vs. Q1 2025 13F Holdings Comparison

https://13f.info/13f/000095012325005701-berkshire-hathaway-inc-q1-2025
1•kamaraju•1h ago•0 comments

How to Split Ranges in C++23 and C++26

https://www.cppstories.com/2025/ranges_split_chunk/
2•ibobev•1h ago•0 comments