frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT

https://github.com/leoheuler/flashtensors
5•leonheuler•5h ago
I wanted to build an inference provider for proprietary AI models, but I did not have a huge GPU farm. I started experimenting with Serverless AI inference, but found out that coldstarts were huge. I went deep into the research and put together an engine that loads large models from SSD to VRAM up to ten times faster than alternatives. It works with vLLM, and transformers, and more coming soon.

With this project you can hot-swap entire large models (32B) on demand.

Its great for:

Serverless AI Inference

Robotics

On Prem deployments

Local Agents

And Its open source.

Let me know if anyone wants to contribute :)

Comments

billconan•2h ago
can you hot swap a portion of an ai model, if my gpu is not large enough to hold the entire model? so that I can run half model first and load the other half.

Show HN: Geofenced chat communities anyone can create

https://vicinity.social/
8•clarencehoward•2h ago•3 comments

Show HN: Hephaestus – Autonomous Multi-Agent Orchestration Framework

https://github.com/Ido-Levi/Hephaestus
7•idolevi•5d ago•0 comments

Show HN: OtterLang – Pythonic scripting language that compiles to native code

https://github.com/jonathanmagambo/otterlang
9•otterlang•6h ago•2 comments

Show HN: I built an HTTP client that perfectly mimics Chrome 142

https://github.com/arman-bd/httpmorph
13•armanified•13h ago•1 comments

Show HN: Find matching acrylic paints for any HEX color

https://acrylicmatch.com/
53•dotspencer•5d ago•19 comments

Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT

https://github.com/leoheuler/flashtensors
5•leonheuler•5h ago•1 comments

Show HN: I built a website to visualize company financial data

https://myfinsight.com/
4•eadanlin•7h ago•1 comments

Show HN: I scraped 3B Goodreads reviews to train a better recommendation model

https://book.sv
584•costco•3d ago•250 comments

Show HN: qqqa – A fast, stateless LLM-powered assistant for your shell

https://github.com/matisojka/qqqa
157•iagooar•2d ago•84 comments

Show HN: See chords as flags – Visual harmony of top composers on musescore

https://rawl.rocks/
123•vitaly-pavlenko•3d ago•28 comments

Show HN: C++ Quantum Simulator written from scratch

https://github.com/braketware/hilbert-qusim
6•lofri•15h ago•0 comments

Show HN: Three Emojis, a daily word puzzle for language learners

https://threeemojis.com/en-US/play/hex/en-US/2025-11-07
28•knuckleheads•1d ago•25 comments

Show HN: Dynamic code and feedback walkthroughs with your coding Agent in VSCode

https://www.intraview.ai/hn-demo
42•cyrusradfar•2d ago•11 comments

Show HN: Livestream of a coding agent controlled by public chat

https://www.vibecodedbyx.com/
3•fela•12h ago•0 comments

Show HN: Easily reduce GitHub Actions costs with Ubuntu-slim migration

https://github.com/fchimpan/gh-slimify
4•r4mimu•12h ago•0 comments

Show HN: I combine Htmx, LiveView and SolidJS for interactive server components

https://github.com/phucvin/solv-03
3•phucvin•12h ago•1 comments

Show HN: TabPFN-2.5 – SOTA foundation model for tabular data

https://priorlabs.ai/technical-reports/tabpfn-2-5-model-report
71•onasta•2d ago•12 comments

Show HN: Command line YouTube downloader,a universal media solution for everyone

https://github.com/Saffron-sh/m2m
15•saffron-sh•1d ago•7 comments

Show HN: VoxConvo – "X but it's only voice messages"

https://voxconvo.com
10•siim•1d ago•14 comments

Show HN: Ambient light sensor control of keyboard and screen brightness in Linux

https://github.com/donjajo/als-led-backlight
26•donjajo•6d ago•2 comments

Show HN: OSS implementation of Test Time Diffusion that runs on a 24gb GPU

https://github.com/eamag/MMU-RAG-competition
21•eamag•1d ago•0 comments

Show HN: Flutter_compositions: Vue-inspired reactive building blocks for Flutter

https://github.com/yoyo930021/flutter_compositions
45•yoyo930021•2d ago•23 comments

Show HN: Patternia – A compile-time pattern matching DSL for C++

https://github.com/sentomk/patternia
4•sentomk•19h ago•1 comments

Show HN: CoLit – A Collaborative Literature Platform

https://www.colit.app/
5•pujan19•19h ago•0 comments

Show HN: A CSS-Only Terrain Generator

https://terra.layoutit.com
364•rofko•4d ago•82 comments

Show HN: A DevTools-Level JavaScript API for DOM and CSS Style Rules

https://github.com/devtoolcss/chrome-inspector
3•brouser•1d ago•2 comments

Show HN: Strange Attractors

https://blog.shashanktomar.com/posts/strange-attractors
800•shashanktomar•1w ago•78 comments

Show HN: Hacker Reader – A clean, open-source Hacker News client for iOS

https://apps.apple.com/us/app/hacker-reader/id6754137305
2•danielcspaiva•1d ago•0 comments

Show HN: Extending LLM SVG generation beyond pelicans and bicycles

https://gally.net/temp/20251107pelican-alternatives/index.html
7•tkgally•1d ago•0 comments

Show HN: I made a better DOM morphing algorithm

https://joel.drapper.me/p/morphlex/
8•joeldrapper•1d ago•0 comments