frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT

https://github.com/leoheuler/flashtensors
2•leonheuler•1h ago
I wanted to build an inference provider for proprietary AI models, but I did not have a huge GPU farm. I started experimenting with Serverless AI inference, but found out that coldstarts were huge. I went deep into the research and put together an engine that loads large models from SSD to VRAM up to ten times faster than alternatives. It works with vLLM, and transformers, and more coming soon.

With this project you can hot-swap entire large models (32B) on demand.

Its great for:

Serverless AI Inference

Robotics

On Prem deployments

Local Agents

And Its open source.

Let me know if anyone wants to contribute :)

Comparing the Latitude of Europe and America

https://vividmaps.com/comparing-latitude-of-europe-and-america/
1•mooreds•1m ago•0 comments

AI powered stocks CLI tool

https://github.com/Chukwuebuka-2003/stock_cli
1•Chukwuebukaagm•4m ago•1 comments

PostgreSQL deserves better than libpq

https://twitter.com/tildeslash_/status/1987327780217102517
1•sovande•10m ago•0 comments

IRIX Introduction

http://www.sgistuff.net/software/irixintro/index.html
1•naves•11m ago•0 comments

Copy Edit This (2016)

https://www.nytimes.com/2016/07/19/insider/you-be-the-copy-editor.html
1•thundergolfer•12m ago•0 comments

Can peptides give you superpowers?

https://www.economist.com/science-and-technology/2025/11/07/can-peptides-give-you-superpowers
1•andsoitis•13m ago•0 comments

Statistical Estimate of Occurrence of Extraterrestrial Intelligence in Milky Way

https://arxiv.org/abs/2012.07902
1•bryanrasmussen•17m ago•0 comments

Bell Labs Innovations Song [video]

https://www.youtube.com/watch?v=U5V1sxAKu5I
1•gjvc•19m ago•0 comments

Close Pattern in Zig

https://zig.news/houghtonap/closure-pattern-in-zig-19i3
1•andsoitis•20m ago•0 comments

The File Search Tool in Gemini API

https://blog.google/technology/developers/file-search-gemini-api/
1•gmays•21m ago•0 comments

HTML to Markdown Converter

https://www.htmltomarkdown.io/
1•leadsrocks•23m ago•2 comments

Advances in Threat Actor Usage of AI Tools

https://cloud.google.com/blog/topics/threat-intelligence/threat-actor-usage-of-ai-tools
1•andrescordova•25m ago•1 comments

Debt Has Entered the A.I. Boom

https://www.nytimes.com/2025/11/08/business/dealbook/debt-has-entered-the-ai-boom.html
3•JumpCrisscross•26m ago•0 comments

Receipts: A brief list of prominent articles proclaiming the death of the web

https://zeldman.com/2025/10/25/receipts-a-brief-list-of-prominent-articles-proclaiming-the-death-...
1•cpeterso•26m ago•0 comments

How Airbus Took Off

https://worksinprogress.co/issue/how-airbus-took-off/
1•JumpCrisscross•26m ago•0 comments

He Chunhui's Tiny386 Turns an ESP32-S3 into a Fully-Functional 386-Powered PC

https://www.hackster.io/news/he-chunhui-s-tiny386-turns-the-humble-esp32-s3-into-a-fully-function...
11•HardwareLust•36m ago•3 comments

Lego Icons Star Trek U.S..S. Enterprise NCC-1701-D

https://www.lego.com/en-us/aboutus/news/2025/november/lego-icons-star-trek-u-s-s-enterprise-ncc-1...
3•BiraIgnacio•36m ago•1 comments

The best intro video to blockchain is from 2013

https://www.youtube.com/watch?v=J-ab9was1p0
2•Norcim133•41m ago•0 comments

Judge says Education Dept partisan out-of-office emails violated First Amendment

https://www.npr.org/2025/11/08/nx-s1-5602859/education-department-out-of-office-emails-ruling
25•toomanyrichies•42m ago•5 comments

A 500M-year-old brain "radar" still shapes how you see

https://www.sciencedaily.com/releases/2025/11/251108083858.htm
1•therobots927•44m ago•0 comments

Fixing the Biggest Problem with Mechanical Keyboards

https://www.youtube.com/watch?v=N3FEv1qw4_w
1•todsacerdoti•47m ago•0 comments

Re-creating a rare 80s laptop from the ground up [video]

https://www.youtube.com/watch?v=BilLgXkR_Kw
1•jsheard•47m ago•0 comments

Essential Services Maintenance Act

https://en.wikipedia.org/wiki/Essential_Services_Maintenance_Act
1•SanjayMehta•48m ago•0 comments

Court Judge Rules Flock Safety camera data is not exempt from PRA [WA State]

https://www.goskagit.com/news/local_news/court-denies-request-that-it-find-flock-safety-camera-da...
17•p_ing•52m ago•1 comments

Geoffrey Hinton: Intro to Deep Learning and Deep Belief Nets [video] (2012)

https://www.youtube.com/watch?v=GJdWESd543Y
2•walterbell•56m ago•0 comments

TelUI 1.2: TelUI with fun alignments

1•telui•1h ago•0 comments

Freee AI Image Prompt Generator

https://gempix2.photo/prompt-generator/
1•wantering•1h ago•0 comments

Truth Decay: Exploration of the Diminishing Role of Facts and Analysis (2018)

https://www.rand.org/pubs/research_reports/RR2314.html
3•whoknowsidont•1h ago•1 comments

Riyadh's new 176 km metro

https://www.liberallandscape.org/2025/11/08/riyadhs-new-metro-and-some-other-associated-landscape...
3•decimalenough•1h ago•0 comments

Emergency Airworthiness Directive for MD-11

https://drs.faa.gov/browse/excelExternalWindow/DRSDOCID188588539020251108211920.0001
2•garaetjjte•1h ago•0 comments