frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: We Built a Serverless GPU Platform with Fast Cold Starts

https://dat1.co
3•ayankovsky•21h ago
We built a Serverless GPU platform with low cold starts that is perfect for running your custom ML models (LLMs, image generation etc.).

We started with our own pain. We were running a generative AI startup and needed to run a Stable Diffusion pipeline with custom LoRA. We found that running a custom model on a cloud GPU means either a steep fixed cost (using traditional cloud providers) or extreme cold starts of several minutes (using serverless GPU providers).

We looked at successful non-GPU providers and came up with a hypothesis that still holds true today: we don't need to support custom Docker images, we can create just one environment that will run any model.

Of course, that alone did not solve the cold start. We had to work hard optimizing our platform to load and unload models as quickly as possible. We ended up building a pre-download mechanism and manipulating the page cache to load the predicted next model faster.

We wanted to make it as easy as possible for our clients to migrate and also to learn as much as possible, so we started offering free assistance in adapting models. We learned that improving cold starts is not just about the platform. It also depends on how the model is loaded.

This way we helped several teams running LLMs and image generation improve their ML-related features for users (reducing wait time) and often reduced costs.

Try our platform here: https://dat1.co

We'd love to hear your thoughts on anything related to the subject.

Thanks, Arseny.

Comments

sprocketus•21h ago
That's interesting. I have couple of questions: 1) How long it would take me to try out ? It seems that I cannot copy paste some snippets quickly. 2) What makes you better then let's say modal or replicate ?
ayankovsky•21h ago
Depending on the model, it could take from minutes to a couple of hours to adapt it and deploy to our platform. The process is quite easy if you want to run say an LLM (check an example project here https://github.com/dat1-co/dat1-model-examples/tree/main/lla...).

As to why we're better, I'd say a couple of reasons: lower cold start, more transparent pricing and human-first approach where we will work with you to make your model run in the best way possible.

sprocketus•18h ago
ok, I'll try thank you
nikitos4319•21h ago
Do you have trial or smth to try? I didn't see it right away
ayankovsky•21h ago
Thanks for the question! Yes, we do offer a free first month as a trial period (which we can extend of course). We should make it much more obvious on the website.

Immortal SSH Sessions

https://www.grepular.com/Immortal_SSH_Sessions
1•mike-cardwell•6m ago•0 comments

'Hello World' in Bismuth

https://enikofox.com/posts/hello-world-in-bismuth/
2•Bogdanp•10m ago•0 comments

Does Betteridge's Law Still Apply?

https://medium.com/luminasticity/does-betteridges-law-still-apply-7faf7f3c0bd3
1•bryanrasmussen•10m ago•0 comments

Meta Admits There's a Goldilocks Zone for VR Session Length Due to Form Factor

https://www.roadtovr.com/meta-quest-goldilocks-zone-session-form-factor/
2•LorenDB•10m ago•0 comments

Show HN: WTMF: An AI Companion for Late-Night Thoughts – Launching Next Week

1•ishqdehlvi•11m ago•0 comments

Australia's productivity commission proposes cashflow tax to boost investment

https://www.smh.com.au/politics/federal/big-cut-in-company-tax-would-boost-economy-but-it-comes-with-a-sting-20250731-p5mj5u.html
1•softveda•13m ago•1 comments

OpenCQRS – an open-source CQRS framework for the JVM

https://github.com/open-cqrs/opencqrs
2•goloroden•14m ago•0 comments

Show HN: I made a website to find relevant conversations about your brand

https://socialbrandmonitoring.com
2•tech_nurgaliyev•15m ago•0 comments

My first browser extensions|speed up AEO with generated content to copy & paste

https://aeoadvice.com/
1•scencan•16m ago•1 comments

Amazon DocumentDB Serverless is now available

https://aws.amazon.com/blogs/aws/amazon-documentdb-serverless-is-now-available/
1•mariuz•16m ago•0 comments

Why Won't Anyone Use the Beautiful Corporate Spaces

https://loganmarek.com/why-wont-anyone-use-the-beautiful-corporate-spaces/
2•xvok•16m ago•0 comments

Google ADK and AMD Instinct GPUs: The Dynamic Duo for AI Agents

https://www.amd.com/en/developer/resources/technical-articles/2025/google-adk-amd-instinct-gpus-the-dynamic-duo-for-ai-agents.html
2•mariuz•16m ago•0 comments

How to Build a Satellite?

https://www.youtube.com/watch?v=5voQfQOTem8
1•kehiy•18m ago•0 comments

'This wasn't obvious': the potato evolved from a tomato ancestor

https://www.theguardian.com/science/2025/jul/31/potato-evolved-from-tomato-ancestor-researchers-find
2•defrost•19m ago•0 comments

Onshape – Product Development Platform

https://www.onshape.com/en/
1•kehiy•19m ago•0 comments

Quadratic Voting

https://www.radicalxchange.org/wiki/quadratic-voting/
1•xucian•19m ago•1 comments

Brightest explosion ever seen is still baffling astronomers

https://www.popsci.com/science/biggest-gamma-ray-burst-boat/
1•Bluestein•24m ago•0 comments

Subagents.sh – Share and discover Claude Code sub-agents

https://subagents.sh/
1•augmnt•27m ago•1 comments

Bbor62 – A compact binary-to-text compressor

https://github.com/goudvuur/bbor62
1•beligum•28m ago•1 comments

Top Anonymous Email Services for Privacy Lovers

https://cyble.com/knowledge-hub/anonymous-email-services-for-privacy/
1•cybleinc•33m ago•0 comments

Fujitsu starts development of 10000 plus superconducting quantum computer

https://global.fujitsu/en-global/newsroom/gl/2025/08/01-01
2•donutloop•33m ago•0 comments

I built a free, open-source security scanner with shareable dashboards

https://github.com/Huluti/Secrover
1•hugoposnic•34m ago•1 comments

US Energy Department misrepresents climate science in new report

https://phys.org/news/2025-08-energy-department-misrepresents-climate-science.html
2•OutOfHere•37m ago•0 comments

The Art of Parsing and Comparing Version Strings

https://secalerts.co/news/the-art-of-parsing-and-comparing-version-strings/7bVWMEyNBrMIbBmixgGVsI
2•louisstow•39m ago•0 comments

One diet soft drink daily may increase diabetes risk by more than a third

https://www.monash.edu/news/articles/one-can-of-artificially-sweetened-soft-drink-daily-may-increase-diabetes-risk-by-more-than-a-third
2•t0lo•39m ago•1 comments

Isle FPGA Computer

https://projectf.io/isle/fpga-computer.html
1•z303•41m ago•0 comments

Ask HN: How do I sandbox Gemini Code Assist on Mac from accessing other files?

1•nuker•47m ago•0 comments

China struggles to break its addiction to manufacturing [Financial Times]

https://www.ft.com/content/f7979a8f-874a-4b47-8304-d93d30171980
2•wuschel•53m ago•2 comments

Why Japanese Developers Write Code Differently – Why It Works Better

https://medium.com/@sohail_saifi/why-japanese-developers-write-code-completely-differently-and-why-it-works-better-de84d6244fab
1•zdkaster•55m ago•0 comments

Ubiquiti users report having access to others' UniFi routers, cameras (2023)

https://www.bleepingcomputer.com/news/security/ubiquiti-users-report-having-access-to-others-unifi-routers-cameras/
2•janandonly•57m ago•0 comments