frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Show HN: I built a focus-driven task manager and pom timer that sorts by urgency

https://fluxtimer.com
1•zahirbmirza•38s ago•0 comments

Show HN: Kichan.ai a FREE Chrome extension to generate and inject userscripts

https://kichan.ai/
2•k1rd•1m ago•0 comments

Fun with Logical Fallacies

https://fs.blog/logical-fallacies/
1•Brysonbw•3m ago•0 comments

The dark side of Python (for CS students before learning Python)

https://gist.github.com/Stream29/4b242cb532d41a17cfb1b3cd8455134b
1•Stream•4m ago•0 comments

What Big Tech's Band of Execs Will Do in the Army

https://www.wired.com/story/what-lt-col-boz-and-big-techs-enlisted-execs-will-do-in-the-army/
1•thm•6m ago•0 comments

Three-quarters of Americans support bike lanes in their area

https://today.yougov.com/travel/articles/52404-three-quarters-of-americans-support-bike-lanes
3•lemonberry•9m ago•0 comments

Hyper-realistic baby dolls spark moral panic and legislation in Brazil

https://www.theguardian.com/world/2025/jun/22/hyper-realistic-dolls-brazil
2•mmarian•10m ago•0 comments

Bacchanalia Reborn

https://worldhistory.substack.com/p/bacchanalia-reborn
1•crescit_eundo•10m ago•0 comments

Dnsimg – storing images in txt records

https://asherfalcon.com/blog/posts/2
1•aa_is_op•16m ago•0 comments

How to Submit

1•nightcoders•19m ago•0 comments

How regime change in Iran could affect global oil prices

https://www.cnbc.com/2025/06/21/how-regime-change-in-iran-could-affect-global-oil-prices.html
2•rntn•21m ago•1 comments

Implementing Ray Tracing In One Weekend in F#

https://heftymouse.me/blog/raytracer/
2•nairadithya•21m ago•0 comments

Annotation Mono: A lovingly crafted handwriting-style monospace font

https://qwerasd205.github.io/AnnotationMono/
2•robinhouston•23m ago•0 comments

The "USB killer" is dead: Apple drops FireWire support in macOS 26

https://www.techspot.com/news/108394-usb-killer-dead-apple-drops-firewire-support-macos.html
3•thunderbong•24m ago•0 comments

DHS warns of sharp rise in Chinese-made signal jammers 'tools of terrorism'

https://www.theregister.com/2025/06/20/dhs_issues_warning_about_influx/
1•Bender•26m ago•0 comments

Ask HN: At what point did your startup hire its first lawyer?

2•randerson001•26m ago•1 comments

"Cold spray" 3D printing technique proves effective for on-site bridge repair

https://news.mit.edu/2025/cold-spray-3d-printing-technique-effective-bridge-repair-0620
2•rbanffy•28m ago•0 comments

Conflict, Montage, and AI in Data Storytelling – Communications of the ACM

https://cacm.acm.org/blogcacm/conflict-montage-and-ai-in-data-storytelling/
1•rbanffy•28m ago•0 comments

Uncle Sam seeks time in tower dump data grab case

https://www.theregister.com/2025/06/19/us_tower_grab_appeal/
2•Bender•29m ago•0 comments

The Evolution of Metal Music: From Its Roots to Modern Metal (2023)

https://www.ourmusicworld.com/archives/4391
1•almost-exactly•29m ago•0 comments

Ask HN: Were any of the DropZap World game rules hard to understand?

1•amichail•29m ago•0 comments

Huang's Law

https://thechipletter.substack.com/p/huangs-law
1•rbanffy•29m ago•0 comments

Aarne–Thompson–Uther Index

https://en.wikipedia.org/wiki/Aarne%E2%80%93Thompson%E2%80%93Uther_Index
1•doener•30m ago•0 comments

Japan Adventures: A Designer's Perspective

https://www.tombihn.com/blogs/main/tokyo-adventures-a-designers-perspective
1•wallflower•32m ago•0 comments

The New York Dating Economy

https://menofny.com/
1•wallflower•34m ago•0 comments

I built AI Legal Helper – an AI-powered legal assistant for everyone

https://ailegalhelper.com/
2•stafane•34m ago•1 comments

He's the godfather of AI he wants to keep it

https://www.vox.com/future-perfect/417087/ai-safety-yoshua-bengio-lawzero
1•HR01•34m ago•0 comments

Show HN: Wave – A C-like low-level language (now with %, [], and and= operators)

https://github.com/LunaStev/Wave
1•LunaStev•34m ago•0 comments

Networked tuple set with authenticated elements – 1A Insecure Network

https://www.1a-insec.net/blog/82-authenticated-tuple-set/
1•bo0tzz•36m ago•0 comments

No War

https://www.sammcalilly.com/writing/2025/06/18/fuck-war.html
1•greenie_beans•36m ago•1 comments
Open in hackernews

Show HN: LLMOne – Deploy LLMs from bare metal to production in hours

https://github.com/EM-GeekLab/LLMOne
4•pescn•4h ago
I spent days trying to deploy DeepSeek on a server this year. Install Ubuntu, NVIDIA drivers, CUDA, Docker, configure vLLM, debug memory issues, tune performance settings. Every deployment was different. Every server had its own quirks. Worse still, these issues are more pronounced on non-NVIDIA accelerators, such as Ascend or Intel NPU.

So, we made LLMOne, which will automates this. You can use it at bare metal (via BMC) or SSH (coming soon) into an existing server, select models, and it handles everything: OS installation, driver setup, inference engine configuration, model deployment, and deploy applications such as Open WebUI or Dify.

The code is open source (Mulan PSL v2, like Apache 2.0). No vendor lock-in.

There is a User Tutorial Video: https://youtu.be/P4MgIPW5K70

How it works:

1. Uses BMC (Redfish) to remotely install OS on bare metal, but not PXE (without DCHP Server configure) 2. Installs appropriate drivers (NVIDIA, Huawei Ascend, etc.) 3. Sets up containers and inference engines (vLLM, MindIE or OpenVINO - picks the right one) 4. Deploys models and runs performance benchmarks 5. Can also deploy apps like OpenWebUI, Dify alongside the models

The whole process runs unattended. What used to take me 2-3 days of tweaking now finishes in 1-2 hours.

Technical bits:

We avoid Kubernetes entirely - found it adds complexity without much benefit for single-node LLM deployments. Everything runs in Docker containers with custom orchestration.

The BMC integration was tricky. Different servers expose different Redfish capabilities, so we built adapters for some vendors, such as iDRAC from Dell and iBMC from Huawei.

Performance varies by hardware, but we've seen ~2200 tokens/sec on RTX 4090 with TensorRT-LLM backend, ~1900 with vLLM. The system runs Evalscope benchmarks automatically so you know what you're getting.

Why this exists:

We work with chip vendors and AI server resellers. When servers arrive at customer sites, instead of a multi-person support team spending days on deployment and debugging, one man can use this tool to get everything running.

While we focus on LLM deployment, the tech stack can actually deploy anything from bare OS to complex software stacks. The automation layer is generic enough for various workloads.

Current limitations:

For BMC support, we currently only support Dell iDRAC and Huawei iBMC. We're working on Supermicro support. We'd love to expand to other server vendors but need hardware access or Redfish Mock for testing and development.

SSH Mode and Apple Silicon Support is coming soon

Looking for feedback. Also, if you're a server vendor and can provide BMC access for testing, we'd appreciate the help expanding hardware support.