frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

vLLM-MLX – Run LLMs on Mac at 464 tok/s

https://github.com/waybarrios/vllm-mlx
2•waybarrios•1h ago

Comments

waybarrios•1h ago
Hey HN! I built vLLM-MLX alike framework on macOS, which is painfully slow on Apple Silicon machines.

vLLM-MLX brings native GPU acceleration using Apple's MLX framework, with:

  • OpenAI-compatible API (drop-in replacement)
  • Multimodal: Text, Images, Video, Audio in one server
  • Continuous batching for concurrent users (3.4x speedup)
  • TTS in 10+ languages (Kokoro, Chatterbox)
  • MCP tool calling support

  Performance on M4 Max:
  - Llama-3.2-1B-4bit: 464 tok/s
  - Qwen3-0.6B: 402 tok/s
  - Whisper STT: 197x real-time
Quick start: pip install -e . vllm-mlx serve mlx-community/Llama-3.2-3B-Instruct-4bit

Works with standard OpenAI SDK. Happy to answer questions!

GitHub: https://github.com/waybarrios/vllm-mlx

Running Factorio from over 1k floppy disks

https://www.pcgamer.com/hardware/running-factorio-from-over-1-000-floppy-disks-is-a-masochistical...
1•Tomte•9m ago•0 comments

The One Simple Thing That Makes the U.S. Economy Unmanageable

https://www.thebignewsletter.com/p/the-one-simple-thing-that-makes-the
1•connor11528•12m ago•0 comments

Podpdf – Ultra-fast, zero-dependency PDF generation for Node.js and Bun

https://github.com/herolabid/podpdf
2•javatuts•15m ago•0 comments

Title: A simple dinner meeting led to a sophisticated iOS eKYC bypass

https://medium.com/@ryu360i/potential-for-iphone-ekyc-face-id-hacking-how-passcode-shouldering-se...
2•ryuzaburo•17m ago•1 comments

India Issues Final Warning to Apple in Ongoing Antitrust Case

https://www.macobserver.com/news/india-issues-final-warning-to-apple-in-ongoing-antitrust-case/
2•Brajeshwar•17m ago•0 comments

The Executive Assistant Paradox: Why AI Makes This Role Critical, Not Obsolete

https://vleech.substack.com/p/the-executive-assistant-paradox-why
3•connor11528•19m ago•0 comments

100's of hours of GTM research in seconds

https://dev.dashboard.chainfuse.ai/team/01981bfb-b6fa-78a8-937e-822018bddac0/dataspace/019a6f6c-1...
2•sushidata•20m ago•1 comments

Show HN: The viral speed read at 900wpm app

https://wordblip.com
1•Gillinghammer•20m ago•0 comments

Learning Latent Action World Models in the Wild

https://arxiv.org/abs/2601.05230
1•saswatms•22m ago•0 comments

Antigravity down for Ultra plan accounts

https://discuss.ai.google.dev/t/antigravity-broken-getting-only-agent-execution-terminated-due-to...
2•fmnxl•23m ago•1 comments

European Sovereign Cloud

https://www.chrisfarris.com/
1•weinzierl•28m ago•0 comments

OpenAI Codex with Ollama

https://ollama.com/blog/codex
1•meetpateltech•28m ago•0 comments

OpenAI Used Kenyan Workers on Less Than $2 per Hour to Make ChatGPT Less Toxic

https://time.com/6247678/openai-chatgpt-kenya-workers/
2•pabs3•29m ago•0 comments

I tricked my partner into caring about finances

https://www.indiehackers.com/post/how-i-tricked-my-partner-into-caring-about-finances-dff051a4cb
1•abbster52•38m ago•1 comments

Simulation: Jupiter holds 1.5 times more oxygen than the sun

https://phys.org/news/2026-01-jupiter-hidden-depths-simulation-planet.html
1•wglb•39m ago•1 comments

Behind Trump vs. Powell Is a Battle over US Empire's Future

https://jacobin.com/2026/01/trump-powell-fed-europe-dollars
4•kaycebasques•43m ago•0 comments

It Can Apply and Positive in Favor the Newton III Law on an Engine System Device

1•monterrey•46m ago•0 comments

State Ofthe Art Novel InFlow 1Gearturbine/Reaction 2Imploturbocompressor/Impulse

1•monterrey•48m ago•0 comments

San Francisco to offer free childcare to people making up to $230000

https://www.theguardian.com/us-news/2026/jan/15/san-francisco-childcare-families
8•darth_avocado•49m ago•2 comments

Podcasting Could Use a Good Asteroid

https://www.joanwestenberg.com/podcasting-could-use-a-good-asteroid/
2•zdw•51m ago•0 comments

Ask HN: What are Claude's skills/what skills does Claude possess?

2•Obscurity4340•52m ago•0 comments

Glyphhanger – Your web font utility belt

https://www.zachleat.com/web/glyphhanger/
1•doodlesdev•54m ago•0 comments

The Myth of the ThinkPad

https://innovintageblog.wordpress.com/2026/01/08/the-myth-of-the-thinkpad/
9•volemo•56m ago•2 comments

Jeff Bezos Needs to Speak Up

https://www.theatlantic.com/ideas/2026/01/raid-washington-post/685621/
3•JumpCrisscross•57m ago•2 comments

Ericsson Silent Layoffs in the US

4•allabouttech•1h ago•1 comments

Trump Moves to Make Tech Giants Pay for Surging Power Costs

https://www.bloomberg.com/news/articles/2026-01-15/trump-to-direct-key-us-grid-operator-to-hold-e...
3•jmcdonald-ut•1h ago•1 comments

America's Throwaway Spies: How the CIA Failed Iranian Informants in Tehran

https://www.reuters.com/investigates/special-report/usa-spies-iran/
5•koolhead17•1h ago•0 comments

Mark Carney and Xi Jinping meet to mend ties as Donald Trump disrupts globe

https://www.ft.com/content/9eeff245-2081-4f97-bc8e-6bbdaf59074e
4•KnuthIsGod•1h ago•0 comments

Fontello – Combine icon webfonts for your own project

https://github.com/fontello/fontello
1•doodlesdev•1h ago•0 comments

Is there any way we can help Stack Overflow Website get back up?

https://stackoverflow.com/questions/79867766/is-there-any-way-we-can-help-stack-overflow-website-...
1•nomilk•1h ago•0 comments