frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Speakrs Full PyAnnotate pipeline in Rust/ONNX 20-37x times faster macOS

https://github.com/avencera/speakrs
2•praveenperera•1h ago
Speakrs implements the full pyannote community-1 style diarization pipeline in Rust: segmentation, powerset decode, overlap-add aggregation, binarization, embedding, PLDA, and VBx clustering.

There is no Python runtime in the library path. Inference runs on ONNX Runtime or native CoreML, and the rest of the pipeline stays in Rust.

It is 20x-30x faster on macOS, but only 2-3x faster on linux/cuda (depending on CPU).

Few reasons its faster:

1. Speakrs is using coreml versions of the models. I exported the models specifically to run on coreml. PyAnnote just runs the same the same PyTorch versions through MPS (Metal) on macOS.

2. PyAnnote is not a single model, its a few different models put together in a pipeline, the readme has some info on the full pipeline.

3. Speakrs optimizes the pipeline so different parts can run on CPU, Neural Engine and GPU. Speakrs has a batch mode, where you can run on multiple files at once, doing this also lets you keep CPU/GPU/ANE all fully utilized.

This is why on linux/cuda its not that much faster, PyAnnotate is already optimized to run on cuda, the speed improvements we get on cuda is by running some stuff on cpu while the other stuff runs on the GPU. The speedup on linux will depend on how powerful the CPU is.

There is also a fast mode, that sacrifices some speed for accuracy, that can be up to 50x faster, and for some types of audio doesn't sacrifice that much accuracy. The benchmarks have more info on this.

Show HN: An LLM translator whose source is a single prompt

https://github.com/hamsterbase/llm-translator
3•Cassandra99•3m ago•0 comments

Show HN: Cross-agent messaging and shared memory over the local filesystem

https://oacp.dev
2•haoranchg•41m ago•1 comments

Show HN: Speakrs Full PyAnnotate pipeline in Rust/ONNX 20-37x times faster macOS

https://github.com/avencera/speakrs
2•praveenperera•1h ago•0 comments

Show HN: A website that tracks every stock trade Congress makes

https://congress.kadoa.com/
4•hubraumhugo•1h ago•1 comments

Show HN: Write your BPF programs in Go, not C

https://github.com/boratanrikulu/gobee
107•boratanrikulu•5d ago•51 comments

Show HN: NeuroFlow 55.8x video inference speedup for Vision Transformers PyTorch

https://github.com/ynnk-research/-NeuroFlow
3•ynnk•2h ago•0 comments

Show HN: Mind-expander, a visual workspace for coding with AI agents

https://github.com/mbbill/mind-expander
2•mbbill•2h ago•0 comments

Show HN: OpenBrief – Local-first video downloader/summarizer

https://github.com/tantara/openbrief
78•tantara•20h ago•13 comments

Show HN: I open-sourced two AI agents with real memory (chat and voice, MIT)

https://github.com/SynapCores/synapcores-agent
5•SQLv2•3h ago•0 comments

Show HN: Harbor v0.4.19 – harbor launch –back end vLLM –web codex

https://github.com/av/harbor/releases/tag/v0.4.19
4•everlier•3h ago•0 comments

Show HN: Local-first PDF redaction for permanently removing data

3•daoxiaoyue2012•3h ago•2 comments

Show HN: Speed up taking macOS promo screenshots

https://bendansby.com/apps/beautyshot.html
2•webwielder2•47m ago•0 comments

Show HN: Kakeibo – a simple budget tracking app for simple people

https://getkakeibo.com/en/
2•palpfiction•4h ago•0 comments

Show HN: WYSIWYG markdown editor for any GitHub repo

https://dunkdown.com
4•ramoz•4h ago•2 comments

Show HN: Treats Human and AI the Same

https://github.com/haozeli2009/Hands-and-Claws
4•haozeli•4h ago•0 comments

Show HN: Sifter – Does your CV *actually* stand out to LLMs?

https://www.sifter.sh/
5•benjosaur•4h ago•0 comments

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

https://audiomass.co/?multitrack=1
529•pantelisk•2d ago•115 comments

Show HN: Geomatic – A command-driven geometry studio enabled with autodiff

https://www.tinyvolt.com/geomatic
70•nivter•1d ago•16 comments

Show HN: MetaStrip – Client-side metadata removal for photos, video, audio, docs

https://metastrip.app
3•lars-dev•6h ago•0 comments

Show HN: Fungible – A local personal finance app in the terminal

https://github.com/tomfunk/fungible
15•tomfunk•20h ago•8 comments

Show HN: TryPost – open-source Social Media Scheduler

https://trypost.it/en
11•paulocastellano•22h ago•6 comments

Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM

https://github.com/elixir-volt/volt
24•dannote•1d ago•1 comments

Show HN: Riot, a modern multicore actor-based ecosystem for OCaml

https://riot.ml
5•leostera•11h ago•0 comments

Show HN: skills-for-humanity – 171 structured reasoning skills for Claude Code

https://github.com/human-avatar/skills-for-humanity
10•finnworks•12h ago•0 comments

Show HN: Lily Design System: Components for React, Vue, Svelte, HTML, More

https://lilydesignsystem.github.io/
4•jph•14h ago•1 comments

Show HN: Rapel – chunked resumable downloads in unstable networks

https://github.com/redraw/rapel
3•autorun•15h ago•1 comments

Show HN: Pgcraft – a lazygit-style TUI for Postgres

https://github.com/lucasfrederico/pgcraft
6•lucasfrederico•15h ago•0 comments

Show HN: Freenet, a peer-to-peer platform for decentralized apps

https://freenet.org/
383•sanity•5d ago•269 comments

Show HN: Anyone interested in a tool helps to explore C++ ASTs

https://uvic-aurora.github.io/acav-manual/index.html
48•leomicv•4d ago•4 comments

Show HN: ShadowCat – file transfer through QR Codes in a Browser

https://github.com/unprovable/ShadowCat
165•unprovable•4d ago•64 comments