frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I made AI earphones remember everything (auto-sync to Obsidian)

11•Paddyz•5d ago
Hey HN!

I built this after getting frustrated with losing great ideas while driving, cooking, or exercising. You know that moment when inspiration strikes but your hands are occupied?

The Problem: Doubao AI earphones (popular in China, similar to AirPods but with built-in AI) are great for Q&A, but all conversations disappear after listening. It's a closed ecosystem with no way to export valuable content.

My Solution: A Python tool that monitors the Doubao web interface and automatically syncs voice notes to Obsidian in real-time.

What makes it interesting:

30+ speech variation recognition - Works even if you say "note" instead of "take note" or use filler words like "um, note this down" Hands-free operation - Just say "Doubao, take a note, [your content]" and it appears in Obsidian instantly Smart deduplication - Won't create duplicate entries for similar content Cross-platform - Works on Windows, macOS, Linux Technical approach:

Uses Playwright to monitor DOM changes and network requests Regex engine handles speech variations and colloquialisms SQLite for deduplication logic Async I/O for real-time file operations Real use cases I've tested:

Capturing meeting insights while driving between offices Recording workout thoughts during runs Noting recipe improvements while cooking Quick idea capture during walks The tool essentially breaks the walled garden of AI earphones and turns them into a proper knowledge management device. It's like having a voice assistant that actually remembers and organizes everything you tell it.

Demo: Say "Doubao, note this: remember to research async patterns" → Instantly appears in Inbox/Voice Notes/2026-01-21.md

Built with Python + Playwright + SQLite. MIT licensed.

What do you think? Have you faced similar issues with voice assistants that don't persist information? Would love to hear about your workflows for capturing ideas on-the-go!

GitHub: https://github.com/GptsApp/doubao-earphone-to-obsidian

Comments

llbbdd•5d ago
I've long wanted something like this kind of always-on logging but I fear that the social element is the hardest to crack. Besides having a record of the substantial amount I reason out loud to myself, it'd be valuable to be able to really remember everything I'm present for in that level of detail, but I'd feel awkward if the recording device was obvious, I would feel subversive if it were hidden, and people may not like it very much to have someone in their life who has notes on every interaction to refer to.
Paddyz•1d ago
Great point! This really is a nuanced and important issue.

I totally get the concern—continuous recording is definitely a gray area in everyday life. My tool is more focused on active recording (like saying "Doubao, take a note..."), not passive monitoring. The key differences are:

Intent - You clearly know what you're recording. It feels more like "writing something down" than "recording audio"

Visibility - Earbuds are already so commonplace that they don't draw attention like a dedicated voice recorder would

Social boundaries - I typically only record personal thoughts, not conversations with others (which would require clear consent)

netsharc•1h ago
God damn, trying to make fiction reality? https://www.youtube.com/watch?v=RRNYmFrfbCg
netsharc•1h ago
Cool, almost a "build your own ecosystem" Siri/voice assistant...

The 2 line explanation is sort of vague, but from the code I surmise the Python "app" watches a webpage (configured as https://www.doubao.com/chat/624642496948226) and every time the DOM there is modified, it sees that new prompt, looks for the word "note", and if so, creates an Obsidian note with the transcription of the prompt.

    CHAT_URL: str = "https://www.doubao.com/chat/624642496948226"
    [...]
    await page.goto(CHAT_URL, timeout=120000, wait_until="domcontentloaded")
Alexa has "build your own app", this seems less convoluted.

Google Gemini also records my prompts (under My Activity), I guess with an always-listening Gemini Assistant and a similar Python script that monitors https://myactivity.google.com/product/gemini (I'm guessing this page needs a hard reload to update), it's possible to build something similar.

I don't have my phone to respond to "Hey Google", but I have an alarm clock that has that (not Gemini, but Google Assistant), and I often tell it to "Remind me about [...] in x hours". I just tested the phrase "Add a note about...", and it added a note in Google Keep. But with an analog Python script one could trigger many more things.

Television is 100 years old today

https://diamondgeezer.blogspot.com/2026/01/tv100.html
486•qassiov•10h ago•168 comments

AI code and software craft

https://alexwennerberg.com/blog/2026-01-25-slop.html
63•alexwennerberg•7h ago•30 comments

ChatGPT Containers can now run bash, pip/npm install packages and download files

https://simonwillison.net/2026/Jan/26/chatgpt-containers/
150•simonw•6h ago•132 comments

The Hidden Engineering of Runways

https://practical.engineering/blog/2026/1/20/the-hidden-engineering-of-runways
160•crescit_eundo•6d ago•42 comments

People who know the formula for WD-40

https://www.wsj.com/business/the-secret-society-of-people-who-know-the-formula-for-wd-40-e9c0ff54
71•fortran77•4h ago•140 comments

There is an AI code review bubble

https://www.greptile.com/blog/ai-code-review-bubble
159•dakshgupta•9h ago•126 comments

RIP Low-Code 2014-2025

https://www.zackliscio.com/posts/rip-low-code-2014-2025/
127•zackliscio•9h ago•59 comments

JuiceSSH – Give me my pro features back

https://nproject.io/blog/juicessh-give-me-back-my-pro-features/
205•jandeboevrie•7h ago•100 comments

You have to know how tech companies work

https://www.seangoedecke.com/knowing-how-to-drive-the-car/
16•alexwennerberg•2h ago•2 comments

Dithering – Part 2: The Ordered Dithering

https://visualrambling.space/dithering-part-2/
110•ChrisArchitect•6h ago•14 comments

Apple introduces new AirTag with longer range and improved findability

https://www.apple.com/newsroom/2026/01/apple-introduces-new-airtag-with-expanded-range-and-improv...
247•meetpateltech•11h ago•353 comments

Pharos: The Lighthouse at Alexandria

https://penelope.uchicago.edu/encyclopaedia_romana/greece/paganism/pharos.html
8•teleforce•6d ago•0 comments

MapLibre Tile: a modern and efficient vector tile format

https://maplibre.org/news/2026-01-23-mlt-release/
398•todsacerdoti•15h ago•79 comments

Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus

https://tetrisbench.com/tetrisbench/
64•ykhli•6h ago•28 comments

The Adolescence of Technology

https://www.darioamodei.com/essay/the-adolescence-of-technology
130•jasondavies•8h ago•84 comments

Windows 11's Patch Tuesday nightmare gets worse

https://www.windowscentral.com/microsoft/windows-11/windows-11s-botched-patch-tuesday-update-nigh...
139•01-_-•10h ago•120 comments

France Aiming to Replace Zoom, Google Meet, Microsoft Teams, etc.

https://twitter.com/lellouchenico/status/2015775970330882319
510•bwb•8h ago•440 comments

Fedora Asahi Remix is now working on Apple M3

https://bsky.app/profile/did:plc:okydh7e54e2nok65kjxdklvd/post/3mdd55paffk2o
420•todsacerdoti•7h ago•157 comments

I let ChatGPT analyze a decade of my Apple Watch data, then I called my doctor

https://www.msn.com/en-us/news/technology/i-let-chatgpt-analyze-a-decade-of-my-apple-watch-data-t...
30•zdw•2h ago•24 comments

Qwen3-Max-Thinking

https://qwen.ai/blog?id=qwen3-max-thinking
412•vinhnx•10h ago•371 comments

Porting 100k lines from TypeScript to Rust using Claude Code in a month

https://blog.vjeux.com/2026/analysis/porting-100k-lines-from-typescript-to-rust-using-claude-code...
153•ibobev•11h ago•102 comments

Y Combinator website no longer lists Canada as a country it invests in

https://betakit.com/y-combinator-website-no-longer-lists-canada-as-a-country-it-invests-in/
51•TheLegace•1h ago•24 comments

OpenFlexure Microscope

https://openflexure.org/projects/microscope/
47•o4c•5d ago•6 comments

San Francisco Graffiti

https://walzr.com/sf-graffiti
140•walz•15h ago•164 comments

After two years of vibecoding, I'm back to writing by hand

https://atmoio.substack.com/p/after-two-years-of-vibecoding-im
629•mobitar•11h ago•483 comments

Find 'Abbey Road when type 'Beatles abbey rd': Fuzzy/Semantic search in Postgres

https://rendiment.io/postgresql/2026/01/21/pgtrgm-pgvector-music.html
71•nethalo•5d ago•18 comments

Google AI Overviews cite YouTube more than any medical site for health queries

https://www.theguardian.com/technology/2026/jan/24/google-ai-overviews-youtube-medical-citations-...
352•bookofjoe•10h ago•192 comments

Show HN: Only 1 LLM can fly a drone

https://github.com/kxzk/snapbench
139•beigebrucewayne•14h ago•79 comments

The Holy Grail of Linux Binary Compatibility: Musl and Dlopen

https://github.com/quaadgras/graphics.gd/discussions/242
209•Splizard•17h ago•181 comments

Not all Chess960 positions are equally complex

https://arxiv.org/abs/2512.14319
46•MaysonL•3d ago•22 comments