frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How to serve inference as we do with containes with cached token

1•elesbao•2h ago
I've been reading and experimenting with vLLM but it seems that each day there are more and more articles and AI generated long form about each part of the stack. I have a few GPUs and work for a private education group. I want to run models internally and distribute access to a research team. I don't want to have one (or more) GPU per user neither train models. CUrrently I am doing well with a local Qwen on my own single server but I can't wrap my head around on which part to tackle - right now I am looking to KV caches and building over vLLM but I wanted something simple and secure to not leak data from one session to another.

British Columbia to make daylight saving time permanent

https://www.npr.org/2026/03/07/nx-s1-5741076/british-columbia-daylight-saving-time
1•geox•52s ago•0 comments

Prison guards discussed cover-up of Epstein's death, inmate tells FBI

https://www.miamiherald.com/news/local/crime/article314966334.html
1•ParentiSoundSys•5m ago•0 comments

Patching minified Claude Code so it can hear webhooks

https://github.com/Connoropolous/claude-notifications-for-agents
1•connorturland•5m ago•0 comments

Show HN: Navtee – Golf course directory and navigation app

https://navtee.com/
1•metafarer•7m ago•0 comments

ReactScope

https://folio.stage.obvious.ai/obvious/reactscope
1•handfuloflight•9m ago•0 comments

Show HN: Qry – CLI web search that always outputs JSON, with swappable back ends

https://github.com/justEstif/qry
2•justEstif•12m ago•0 comments

Show HN: SafeAgent – exactly-once execution guard for AI agent side effects

2•Lions2026•12m ago•1 comments

Show HN: Open-source personal finance AI that runs locally on your laptop

https://nullbook.ai/
2•jfornear•14m ago•1 comments

Forcing Flash Attention onto a TPU and Learning the Hard Way

https://archerzhang.me/forcing-flash-attention-onto-a-tpu
4•azhng•16m ago•0 comments

Mechanical Movements Animated

https://507movements.com/
2•TigerUniversity•16m ago•0 comments

Show HN: Pipe Checker – paste a sales deal and it checks BANT qualification

https://pipechecker.onrender.com/
1•eghatch92•20m ago•0 comments

Juno – J Web IDE

https://jsoftware.github.io/juno/app/
1•todsacerdoti•20m ago•0 comments

ReverseLM Playground

https://scottinallca.ps/reverse-lm/
1•scottmf•23m ago•0 comments

Integrating AI-Driven Predictive Analytics for Cybersecurity Risk Mitigation [pdf]

https://www.researchgate.net/profile/Adeyinka-Oluwatomisin/publication/401488012_Integrating_AI-D...
1•Olshansky•23m ago•0 comments

Msspproviders.io: a searchable directory of managed security service providers

https://msspproviders.io
1•datacorp•28m ago•1 comments

Old Versions of Programs, Drivers and Games

https://www.oldversion.com/
1•TigerUniversity•31m ago•0 comments

Zero Sum Game

https://code.chuanqisun.com/zero-sum-game/
1•low_tech_punk•33m ago•0 comments

Karabiner-Elements is a powerful tool for customizing keyboards on macOS

https://github.com/pqrs-org/Karabiner-Elements
1•vinhnx•35m ago•0 comments

Coruna: The Mysterious Journey of a Powerful iOS Exploit Kit

https://cloud.google.com/blog/topics/threat-intelligence/coruna-powerful-ios-exploit-kit
1•JumpCrisscross•35m ago•0 comments

How Vinay Prasad Came to Washington, and Why It Was Always Going to End This Way

https://anishkokamd.substack.com/p/how-vinay-prasad-came-to-washington
1•ssivark•37m ago•0 comments

Judge Voids Mass Layoffs at Voice of America

https://www.nytimes.com/2026/03/07/us/politics/judge-kari-lake-voa-layoffs.html
10•JumpCrisscross•38m ago•0 comments

Scaling and controlling an army of devices in parallel with voice commands

https://www.youtube.com/watch?v=kxfT4ZzG2l0
1•GPUboy•39m ago•0 comments

Plenty of AI hype, but not much useful software?

1•YounesDz•39m ago•1 comments

Show HN: Yumo.to, a map of 19,652 onsens in Japan

https://yumo.to/
2•katagamistudio•46m ago•3 comments

Show HN: I built a $5/mo Jobber alternative for solo carpenter

https://fieldflow-nine.vercel.app/auth
1•Mike_Handyman•46m ago•0 comments

Americans Are Now a Target for ICE

https://www.wsj.com/us-news/immigration-protests-noem-minneapolis-0b8bd496
9•JumpCrisscross•48m ago•2 comments

All Bench Leaderboard for Comparing LLMs Across Benchmarks

https://huggingface.co/blog/FINAL-Bench/all-bench
1•seawolf2357•48m ago•0 comments

Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

https://arxiv.org/abs/2603.04466
1•vaishak2future•49m ago•1 comments

Kalshi and Polymarket Are Each Eyeing Roughly $20B Valuations

https://www.wsj.com/finance/kalshi-and-polymarket-are-each-eyeing-roughly-20-billion-valuations-d...
3•bookofjoe•49m ago•1 comments

State of WASI support for CPython: March 2026

https://snarky.ca/state-of-wasi-support-for-cpython-march-2026/
1•mariuz•49m ago•0 comments