frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Rewrite Text – On-Device AI Writing Tool for iOS

https://apps.apple.com/us/app/rewrite-text-ai-writing-tool/id6758913519
1•8mobile•3m ago•0 comments

Investment Supply Chain Analysis

https://investment.binhph.am
1•davedx•8m ago•1 comments

Show HN: Skillscape – Engineering skills matrix without the spreadsheet

https://www.skillscape.dev/
1•danielyefet•14m ago•0 comments

SimpleSteps – TypeScript-to-ASL Compiler

https://github.com/DevNamedZed/simplesteps
1•aman96_54_3•17m ago•0 comments

Demonstration of Network Tap and Packet Filter Using a Security Camera

https://privateisland.tech/dev/betsy-demo-tap-w-cam
1•mindchasers•17m ago•0 comments

I thought freelancers hated invoices. They hated the tools

https://www.indiehackers.com/post/i-thought-freelancers-hated-invoices-they-actually-hated-the-to...
1•allinonetools_•23m ago•0 comments

ThePrimeagen goes back to traditional coding

https://twitter.com/theprimeagen/status/2026771192191824108
2•rob•26m ago•0 comments

When "technically true" becomes "misleading"

https://www.theargumentmag.com/p/when-technically-true-becomes-actually
1•bananaflag•32m ago•0 comments

Australia's WiseTech to cut 2k jobs as AI renders manual coding obsolete

https://www.computerworld.com/article/4137200/australias-wisetech-to-cut-2000-jobs-as-ai-renders-...
2•netfortius•32m ago•1 comments

CleverMock – An AI voice interviewer that interrupts you like a real human

https://www.clevermock.com
1•devinda-dilshan•33m ago•1 comments

Show HN: Programmatic (and self-updating) SaaS demo videos

https://www.rundown.video/
1•guico•34m ago•0 comments

Show HN: Bing Webmaster CLI for Agents and LLMs

https://github.com/NmadeleiDev/bing_webmaster_cli
1•Gregoryy•37m ago•0 comments

A White House Staffer Appears to Run Pro-Trump X Account

https://www.wired.com/story/a-white-house-staffer-appears-to-run-massive-pro-trump-meme-page/
3•doener•41m ago•2 comments

Show HN: Onera – Private LLM Inference Inside AMD SEV-SNP Enclaves

https://onera.chat
1•shreyaspapi•42m ago•1 comments

Next-Token Predictor Is an AI's Job, Not Its Species

https://www.astralcodexten.com/p/next-token-predictor-is-an-ais-job
1•bananaflag•42m ago•0 comments

Tests Are the New Moat

https://saewitz.com/tests-are-the-new-moat
1•vinhnx•45m ago•1 comments

'Access to Insight' is shutting down

https://www.accesstoinsight.org/
1•bifftastic•46m ago•0 comments

The next batch of fixed Epstein files links and notes is live

https://xcancel.com/IAmAnonLegion/status/2026853415863615662?s=20
2•doener•46m ago•0 comments

Programming has changed dramatically due to AI in the last 2 months (Karpathy)

https://twitter.com/karpathy/status/2026731645169185220
2•bakigul•49m ago•0 comments

Demo of an indie AI collaboration app – beyond Codex and Claude Code desktop

1•seeksky•51m ago•1 comments

AIQuotaBar – macOS menu bar app that shows Claude and ChatGPT usage limits

https://github.com/yagcioglutoprak/AIQuotaBar
1•toprak123•56m ago•1 comments

Git City – Your GitHub as a 3D City

https://www.thegitcity.com/
1•duck•56m ago•2 comments

Mumsnet campaign demands ban on social media for under-16s

https://www.theguardian.com/society/2026/feb/26/mumsnet-campaign-demands-ban-social-media-under-16s
2•pmg101•59m ago•0 comments

Shipcast – Turn your Git commits into tweets, automatically

https://shipcast.dev/
1•guoyu•59m ago•0 comments

Show HN: LucidExtractor – Extract web data in plain English, no selectors

https://lucidextractor.liceron.in
1•yukendiran_j•1h ago•0 comments

A larger cage: about the ongoing calls for "digital sovereignty"

https://www.structural-integrity.eu/a-larger-cage-about-the-ongoing-calls-for-digital-sovereignty/
1•doener•1h ago•0 comments

Earth's heat to power 10k homes in renewable energy first for UK

https://www.bbc.co.uk/news/articles/cewzg77k721o
2•RobinL•1h ago•0 comments

Show HN: Snaplake – Query past database states without restoring backups

https://snaplake.clroot.io
1•clroot•1h ago•0 comments

Show HN: Context Harness – Local first context engine for AI tools

https://github.com/parallax-labs/context-harness
1•__parallaxis•1h ago•0 comments

Perplexity Computer

https://www.perplexity.ai/hub/blog/introducing-perplexity-computer
1•kamaal•1h ago•1 comments
Open in hackernews

vLLM-mlx – 65 tok/s LLM inference on Mac with tool calling and prompt caching

https://github.com/raullenchai/vllm-mlx
3•raullen•1h ago

Comments

raullen•1h ago
I've been working on a fork of vllm-mlx (OpenAI-compatible LLM server for Apple Silicon) to make it actually usable for coding agents. The upstream project is great but was missing production-grade tool calling, reasoning separation, and multi-turn performance.

  What I added (37 commits):

  - Tool calling that works — streaming + non-streaming, supports MiniMax and Hermes/Qwen3 formats. 4/4 accuracy on structured function calling benchmarks.
  - Reasoning separation — MiniMax-M2.5 mixes reasoning into its output with no tags. Built a heuristic parser that cleanly separates reasoning from content (0% leak rate, was 60%
   with the generic parser).
  - Prompt cache for SimpleEngine — persistent KV cache across requests. On 33K-token coding agent contexts: TTFT goes from 28s to 0.3s on cache hit. This is the single biggest
  improvement for multi-turn use.
  - 1500+ tests — parsers, engine, server, tool calling. The upstream had minimal test coverage.

  Benchmarks (Mac Studio M3 Ultra, 256GB):

  Qwen3-Coder-Next-6bit (80B MoE, 3B active):
  - Decode: 65 tok/s
  - Prefill: 1090-1440 tok/s
  - TTFT (cache hit, 33K context): 0.3s

  MiniMax-M2.5-4bit (229B MoE):
  - Decode: 33-38 tok/s
  - Deep reasoning with tool calling

  I built this to run OpenClaw locally on my Mac instead of paying for cloud APIs. Qwen3-Coder-Next at 65 tok/s with tool calling is genuinely usable — not a toy demo.

  Quick start:

  pip install git+https://github.com/raullenchai/vllm-mlx.git
  python -m vllm_mlx.server \
    --model lmstudio-community/Qwen3-Coder-Next-MLX-6bit \
    --tool-call-parser hermes --port 8000

  GitHub: https://github.com/raullenchai/vllm-mlx