frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The Audio Stack Is a Crime Scene

https://fireborn.mataroa.blog/blog/i-want-to-love-linux-it-doesnt-love-me-back-post-2-the-audio-stack-is-a-crime-scene/
83•todsacerdoti•2mo ago

Comments

PaulHoule•2mo ago
Windows is better but not much. My big PC at home isn't completely reliable to attach to Bluetooth headphones, some of which might be the fault of the particular headphones, but some of which seems to be the fault of having various sorts of "virtual" drivers installed such as for Steam, MQ 3, Immersed, etc.
mystified5016•2mo ago
Windows' Bluetooth stack is an absolute dumpster fire. They trashed the perfectly good BT stack from windows 7 and apparently had an unpaid intern write the replacement.

My day job is building widgets that connect to a windows PC over Bluetooth. The situation is so bad that we're building a dedicated RF adapter so we can have a sane stack under our control.

If your program is scanning for a particular device in the background, that device will never show up in the windows BT pairing menu. I can't even imagine how that happens. Many API calls do nothing or return garbage, many BT features are just not exposed at all, despite windows clearly having that data internally.

W10 never even had BT audio sink. You could not play audio from a remote device to your PC. W7 had it and I think W11 finally got it a few years back. Linux has always had it.

Windows' Bluetooth stack is no contest the worst available on the market. It's astonishing how poor quality Microsoft products are these days.

Macha•2mo ago
Yeah, I always feel surprised when people call out the Linux audio experience as uniquely bad. Meanwhile on Windows I need to jam voicemeter in the middle to separate inputs and outputs so I don't have games crashing to desktop because my bluetooth headset ran out of battery, while Mac makes each individual application implement volume control UI.
Ekaros•2mo ago
After upgrading to Intel Bluetooth/Wlan module I have had lot less issues.

But in general Bluetooth is just bad protocol audio...

Artoooooor•2mo ago
I'm still baffled by difference between Audacity on Windows and Linux. On Windows I have n+1 recording devices, where n is number of microphones, the 1 being stereo mix. On Linux I have gazillion cryptic names and I pray for each of them to even work. And neither of them is stereo mix, for such advanced feature I have to enable external mixer.
loa_in_•2mo ago
Stereo mix is a surprisingly neutral nontrivial feature. How do you mix the channels, are you okay with limitations of software mixer, do you want to hardware mix it on supported audio chipsets, what stereo mix means when you have three or more microphones?
pseudosaid•2mo ago
A stereo mix is in fact trivial and roughly assumes a left right speaker arrangement. Software summing is math and is the same across digital audio workstations because math is math. Are there 3 or more microphones? doesnt matter, each mic is a mono input. trying to capture a facsimile of a perceived stereo field? use ORTF mic spacing and hard pan those mono inputs appropriately.

Stereo is just dual mono. its that simple. Summing is just math, its that trivial.

Creating a mix of multiple inputs for a stereo lr output? also completely trivial from a technical standpoint. Hardly trivial from a creative pov, but that isnt what this is about.

okdood64•2mo ago
Precisely the reason I don't understand how people can daily drive Linux on their laptop. (I say laptop because additionally: trackpad issues, Bluetooth issues, etc.)

Is it much better than 10 or 20 years ago? Yes.

Is it still annoying enough and sometimes colossal waste of time? Yes. Just use a Macbook and be done with it.

xrhobo•2mo ago
I think it depends on what laptop.

Asus laptops have always worked flawlessly for me.

I am writing this on an Asus laptop with linux mint. Everything just works perfect out of the box.

I could buy three of these laptops instead of the same setup on a mac is the major selling point for me.

ringeryless•2mo ago
the cause of all this? single client ALSA driver model.

coreaudio doesn't need pulse nor jack nor pipewire to allow a single device to be opened by multiple applications.

ALSA additionally specifies every 2 channels as separate 2channel devices, so your 8 channel audio interface looks like 4.

the confusion is added to by device tree overlays such that you may find your pro audio 8 channel device is always seen as a surround setup, replete wirh highpass filtering on the principle pair of outputs, and you don't know why your poor desktop environments sound device doesn't show this...

patrakov•1mo ago
> ALSA additionally specifies every 2 channels as separate 2channel devices, so your 8 channel audio interface looks like 4.

There is no such specification. It is device-dependent and abstracted through ALSA configuration. Creative Labs indeed implemented their cards as three separate codecs, while ForteMedia used a single 6-channel codec that you can open as such at the hardware level. In both cases, you can open a single virtual "surround51" device that routes the channels correctly.

duped•2mo ago
> The stack is clean. The interfaces are better. But the expectations are still stuck in 2012.

The interfaces are absolutely not better, they're a bunch of ~~undocumented~~ C preprocessor macros that hide a pit of complexity and indirection. Pulse is mature enough that you can figure out how to interface with it, pipewire has a handful of examples and some reference documentation that doesn't make sense if you don't already know what you're looking for. Good luck if those examples don't fit nicely into your existing applications' architectures.

Pipewire is very impressive as an achievement, but the work needs to be put in to make it mature enough as a software project (meaning: documentation and well typed interfaces) before existing software can work with it.

The architecture may well be better equipped to provide stable and elegant interfaces to complex media routing problems in application software. But I couldn't tell you if that's true or not, because it's sparsely documented. An undiscoverable API is a nonexistent one.

edit: I wrote this ignorantly before looking back at the pipewire docs. They've improved since I last looked, but I still find the docs lacking (comparable to Apple's docs, which is not a compliment) and the overall interface design of Pipewire a massive challenge to grasp. It might be better, but I don't know. The last time I tried to implement direct support for pipewire in a Linux app I gave up because of its design and lack of useful documentation.

amy214•2mo ago
the interfaces are shite, from the same clown who invented systemd

just use one file per stream eg /dev/dsp/pid_tempfile eg /dev/dsp/778_gJG86h, move mix to kernel, give it a /proc interface. the unix way

nesarkvechnep•2mo ago
Another area where FreeBSD excels.
AHTERIX5000•2mo ago
The Linux audio stack is just baffling. After all these years it's still unreliable. I'm running Fedora 41 and often toggle between two audio devices, USB DAC and HDMI output. Sometimes when I change the output via Gnome settings my Flatpak apps don't care and keep playing with the previous output and other apps change as expected.

I've never edited any audio configs on this machine because I expected my 2 device setup and Gnome settings would be trivial enough for the latest Linux desktop audio solution.

thedanbob•2mo ago
A few years ago I was helping run sound at a conference which involved recording and rebroadcasting multiple audio streams simultaneously. The provided equipment included a Linux computer running a real-time kernel and routing was handled by JACK.

It was a disaster. If the USB audio interface ever disconnected, JACK crashed and wiped my routing configuration. After the first day I ditched the Linux computer and ran the whole setup through my MBA, using Reaper for both recording and routing.

wormius•2mo ago
I got bit by linux audio issues after moving from Win10 to CachyOS (arch based distro). It seemed to be working fine, but I noticed when switching to speakers a terrible digital distortion that wasn't there previously. (I'm not a linux noob, have used since Debian Bo in '97, for example, plus many other distros off and on through the years)

I spent so much time messing with alsamixer, removing pipewire, installing pulse, uninstalling pulse, readding pipewire, resetting ports and connections.

I realized it had something to do with the audio buffer (due to the echo and the distortion was "shaped" like the feedback/if that makes sense). like a half second delay feedback.

Ultimately I read up on the buffering mechanism and found that switching to the Bore-LTO kernel fixed everything. I was so desperate I was literally swapping ends of my cables between the PC and speaker, in theory it shouldn't matter but before realizing the issue when I noticed the noise was based on a delay/feedback, I had no other alternative and was at my wits end.

esbranson•1mo ago
Facts. I still don't think pavucontrol, the GTK volume control tool for PulseAudio, has been replaced by a PipeWire native tool. (I have no idea what Helvum is for, and pavucontrol has minimal backing.) And since I messed with it one time now using headphones is all jacked.

Sega Enterprises Ltd. vs. Accolade, Inc., 977 F.2d 1510 (9th Cir. 1992)

https://en.wikipedia.org/wiki/Sega_v._Accolade
1•Bluestein•2m ago•0 comments

UK graduates share their job-hunting woes amid the AI fallout

https://www.theguardian.com/money/2025/jul/13/student-debt-graduates-share-job-hunting-woes-ai-fallout
1•mykowebhn•6m ago•0 comments

Nearly two-thirds of UK workers turning to drink because of job pressures

https://news.sky.com/story/nearly-two-thirds-of-uk-workers-turning-to-drink-because-of-job-pressures-survey-says-13395997
3•austinallegro•9m ago•0 comments

Gmail AI hallucinates, distorts email contents

https://www.t-online.de/digital/aktuelles/id_100811852/gmail-fantasiert-googles-mail-programm-verfaelscht-fremde-inhalte.html
1•disentanglement•10m ago•0 comments

Show HN: TXT OS – Open-Source AI Reasoning, One Plain-Text File at a Time

https://github.com/onestardao/WFGY/tree/main/OS
3•TXTOS•15m ago•2 comments

Empirical evidence of LLM's influence on human spoken communication

https://arxiv.org/abs/2409.01754
1•ransom_rs•16m ago•0 comments

I built Findly.tools – a successor to 1000.tools

https://findly.tools/
1•drdruide•17m ago•1 comments

A (Limited) Defence of Footnotes

https://www.kryogenix.org/days/2025/07/03/a-limited-defence-of-footnotes/
2•jruohonen•20m ago•0 comments

Retrieval Embedding Benchmark

https://huggingface.co/spaces/embedding-benchmark/RTEB
1•fzliu•23m ago•0 comments

Stop memoizing Hash lookups in Ruby

https://nithinbekal.com/posts/ruby-hash-memoization/
2•thunderbong•24m ago•0 comments

Summary of Bluesky Whitepaper

https://twitter.com/RealmOfSoftware/status/1904941497910505833
1•krsoninikhil•24m ago•0 comments

Vibe-Engineering: When AI Does All the Coding, What Do We Do?

https://img.ly/blog/vibe-engineering-when-ai-does-all-the-coding-what-do-we-actually-do/
2•tosh•25m ago•0 comments

Show HN: We developed an AI tool to diagnose car problems

https://autoai.help
2•mariyan250•27m ago•0 comments

Ask HN: I own wtf.store -what to build that makes people say WTF (in gud way)?

1•deep_thinker26•28m ago•0 comments

Windows Update forced new version of Chengjie which is unusable to users

https://old.reddit.com/r/Windows11/comments/1lvelyb/warning_to_windows_11_24h2_chengji_users_for_july/
2•charlieyu1•29m ago•1 comments

Moiré materials based on M-point twisting

https://www.nature.com/articles/s41586-025-09187-5
2•Bluestein•30m ago•0 comments

Iota Legacy Migration Tool Issue

https://old.reddit.com/r/Iota/comments/1kx02eb/iota_legacy_migration_tool_issue/
1•Bluestein•38m ago•0 comments

Turns out you can just hack any train in the USA

https://twitter.com/midwestneil/status/1943708133421101446
4•lyu07282•38m ago•1 comments

Kawase Hasui Catalogue

https://mokuhanga1.com/kawase-hasui/
1•vinhnx•38m ago•0 comments

Trump announces 30% tariffs on EU and Mexico, starting Aug. 1

https://www.cnbc.com/2025/07/12/trump-tariffs-european-union-eu-mexico-trade.html
3•kamaraju•49m ago•5 comments

Show HN: I built a entire blogging platform just for myself lol

https://lykhari.com
1•yaKashif•53m ago•0 comments

Ask HN: How important is to learn Software Defined Networking?

1•shivajikobardan•56m ago•0 comments

Govern Amazon Bedrock Gen AI Models in SageMaker Unified Studio

https://aws.amazon.com/blogs/machine-learning/configure-fine-grained-access-to-amazon-bedrock-models-using-amazon-sagemaker-unified-studio/
1•funnyguy678•59m ago•0 comments

Hill Space: Neural nets that do perfect arithmetic (to 10⁻¹⁶ precision)

https://hillspace.justindujardin.com/
2•peili7•1h ago•1 comments

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

https://arxiv.org/abs/2507.07955
1•fzliu•1h ago•0 comments

Show HN: Invoicesplitter – Split and Rename invoices in seconds

https://www.invoicesplitter.com/
1•Protonu3102•1h ago•0 comments

The Jackpot Age

https://twitter.com/thiccyth0t/status/1943773395209924769
2•Michelangelo11•1h ago•0 comments

Migrating My NAS from CoreOS/Flatcar Linux to NixOS

https://michael.stapelberg.ch/posts/2025-07-13-nixos-nas-network-storage-config/
3•Bogdanp•1h ago•0 comments

Using terminal-notifier in Claude Code to get custom notifications

https://www.andreagrandi.it/posts/using-terminal-notifier-claude-code-custom-notifications/
1•andreagrandi•1h ago•0 comments

Commodore 64 Ultimate gets released

https://www.tomshardware.com/video-games/retro-gaming/the-commodore-64-ultimate-computer-is-the-companys-first-hardware-release-in-over-30-years-pre-orders-start-at-usd299
2•smingo•1h ago•1 comments