frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I wrote an LLM inference engine in pure Go – 48 tok/s zero dependencies

https://github.com/computerex/dlgo/tree/main
2•computerex•2h ago
dlgo is a pure Go deep learning inference engine. It loads GGUF models and runs them on CPU with no dependencies beyond the standard library (SIMD acceleration is optional via CGo).

I built this because I wanted to add local LLM inference to a Go project without shelling out to Python or linking against llama.cpp. The whole thing is go get github.com/computerex/dlgo and you're running models.

It supports LLaMA, Qwen 2/3/3.5, Gemma 2/3, Phi-2/4, SmolLM2, Mistral, and Whisper speech-to-text. Architectures are expressed as a declarative per-layer spec resolved at load time, so adding a new model family is mostly just describing its layer structure rather than writing a new forward pass.

Performance on a single CPU thread with Q4_K_M quantization: ~31 tok/s for LLaMA 3.2 1B, ~48 tok/s for Qwen3 0.6B, ~16 tok/s for Qwen3.5 2B (which has a hybrid attention + Gated Delta Network architecture). Not going to beat llama.cpp on raw speed, but it's fast enough to be useful and the ergonomics of a native Go library are hard to beat.

Supports 25+ GGML quantization formats (Q4_0 through Q8_0, all K-quants, I-quants, F16, BF16, F32). The GGUF parser, dequantization, tokenizer, forward pass, and sampling are all implemented from scratch.

Code: https://github.com/computerex/dlgo

Anthropic Unveils Amazon Inspired Marketplace

https://www.bloomberg.com/news/articles/2026-03-06/anthropic-unveils-amazon-inspired-marketplace-...
1•dthread3•10m ago•0 comments

Show HN: Glad-IA-Tor – Tired of Vibecoded Products? Come and Roast Them for Free

https://glad-ia-tor.com/
1•GiornoJojo•11m ago•1 comments

Ontology (Information Science)

https://en.wikipedia.org/wiki/Ontology_(information_science)
1•downboots•11m ago•0 comments

Show HN: Wireframable – Generate wireframes from any website URL

https://wireframable.com/
1•rosiepuppy•12m ago•0 comments

Google Always-On Memory Agent

https://github.com/GoogleCloudPlatform/generative-ai/tree/main/gemini/agents/always-on-memory-agent
1•sowbug•14m ago•1 comments

Tractography

https://en.wikipedia.org/wiki/Tractography
1•downboots•17m ago•0 comments

Show HN: SurvivalIndex – which developer tools do AI agents choose?

https://survivalindex.org/
1•scalefirst•17m ago•1 comments

FounderScope – Integrated business model validation platform

https://workspace.founderscope.app/
1•zekiunal•18m ago•1 comments

The 2026 Global Intelligence Crisis - postings for devs are rising, up 11% YoY

https://www.citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
1•alhazrod•21m ago•1 comments

Show HN: DiggaByte Labs – pick your stack, download production-ready SaaS code

https://diggabyte.com/
1•GraysoftDev•22m ago•0 comments

Love, Premonition and a Robot Partner

https://twitter.com/expatlitj/status/2029554217958916277
1•shikano•22m ago•0 comments

The State of Consumer AI

https://apoorv03.com/p/the-state-of-consumer-ai-part-1-usage
1•gmays•24m ago•0 comments

Show HN: I accidentally caught an AI agent trying to poison my prod config

https://github.com/liuhaotian2024-prog/k9-solo-hook
1•zippolyon•26m ago•0 comments

AI and the Illegal War

https://buttondown.com/creativegood/archive/ai-and-the-illegal-war/
3•interpol_p•27m ago•0 comments

An ugly year for the Louvre: where does the biggest museum go from here?

https://www.theguardian.com/world/ng-interactive/2026/mar/01/an-ugly-year-for-the-louvre-where-do...
1•PaulHoule•27m ago•0 comments

Show HN: Citepo-CLI, a lightweight CLI for creating blogs, build for AI agent

https://github.com/LinklyAI/citepo-cli
1•blueeon•27m ago•0 comments

Big Sleep Tracker: Google Project Zero + Google DeepMind find security bugs

https://issuetracker.google.com/savedsearches/7155917
2•guessmyname•30m ago•0 comments

Suggestion Regarding References to the Prophet Muhammad (Peace Be Upon Him)

1•naseerwafa•30m ago•0 comments

Show HN: Career AutoPilot – AI guidance for navigating your career

https://www.careerautopilot.ai
2•bvikasgupta•31m ago•0 comments

Can a wealthy family change the course of a deadly brain disease?

https://www.science.org/content/article/can-wealthy-family-change-course-deadly-brain-disease
6•Snoozus•35m ago•0 comments

Show HN: Contd makes interactive CLIs usable for agents in an async way

https://github.com/werifu/contd
1•wefchen•35m ago•0 comments

Hitting the High Notes (2005)

https://www.joelonsoftware.com/2005/07/25/hitting-the-high-notes/
1•benatkin•41m ago•0 comments

Show HN: What zero-intervention E2E test generation looks like

https://www.youtube.com/watch?v=G6mtaC15ocw
1•nadeem1•41m ago•0 comments

Neolab and Emerging AI Lab Tracker

https://cleverhack.com/neolab-and-emerging-ai-lab-tracker
2•jxmorris12•44m ago•0 comments

"Clinejection" Turned an AI Bot into a Supply Chain Attack

https://snyk.io/blog/cline-supply-chain-attack-prompt-injection-github-actions/
1•vismit2000•46m ago•0 comments

Show HN: Managed S3 exports for billing data (no AWS setup required)

https://flexprice.io/
3•manishfp•49m ago•0 comments

Coruna: The Mysterious Journey of a Powerful iOS Exploit Kit

https://cloud.google.com/blog/topics/threat-intelligence/coruna-powerful-ios-exploit-kit
1•mitchbob•52m ago•0 comments

Vibe Security Radar – Tracking the security cost of vibe coding

https://vibe-radar-ten.vercel.app
1•guessmyname•55m ago•0 comments

Spark Runner: Easily Automate Front End Tests

https://github.com/simonarthur/spark-runner/
1•chromaton•58m ago•1 comments

I built this privacy-focused analytics tool

1•webanalyzerapp•59m ago•0 comments