Show HN: Dayflow – A git log for your day

https://github.com/JerryZLiu/Dayflow

1•jerryliu12•1h ago

Hi HN! I've been building Dayflow, a macOS app that automatically tracks what you're actually working on (not just which apps you have open).

Here's what it does:

- It creates a semantic timeline of your day;

- It does it by understanding the content on your screen (with local or cloud VLMs);

- This allows you to see exactly where your time went without any manual logging.

Traditional time trackers tell you "3 hours in Chrome" which is not very helpful. Dayflow actually understands if you're reading documentation, debugging code, or scrolling HN. Instead of "Chrome: 3 hours", you get "Reviewed PR comments: 45min", "Read HN thread about Rust: 20min", "Debugged auth flow: 1.5hr".

I was an early Rewind user but rarely used the retrieval feature. I built Dayflow because I saw other interesting uses for screen data. I find that it helps me stay on track while working - I check it every few hours and make sure I’m spending my time the way I intended - if I’m not, I try to course correct.

Here’s what you need to know about privacy:

- Run 100% locally using qwen2.5-vl-3b (~4GB model)

- No cloud uploads, no account

- Full source available under MIT license (https://github.com/JerryZLiu/Dayflow)

- Optional: BYO Gemini API key for better quality (stored in Keychain, with free-tier workaround to prevent training on your data)

The tech stack is pretty simple, SwiftUI with a local sqlite DB. Uses native macOS apis for efficient screen captures. Since most people who run LLMs locally already have their tool of choice (Ollama, LLMStudio, etc.), I decided to not embed an LLM into Dayflow.

By far the biggest challenge was adapting from SOTA vision models like Gemini 2.5 Pro to small, local models. My constraints were that it had to take up <4GB of ram and have vision capabilities. I had to do a lot of evals to figure out that Qwen2.5VL-3B was the best balance of size and quality, but there was still a sizable tradeoff in quality that I had to accept. I also got creative with sampling rates and prompt chunking to deal with the 100x smaller context window. Processing a 15 minute segment takes ~32 local LLM calls vs 2 Gemini calls!

Here’s what I’m working on next:

Distillation: Using Gemini's high-quality outputs as training data to teach a local model the patterns it needs, hopefully closing the quality gap.

Custom dashboards where you can track answers to any question like "How long did I spend on HN?" or "Hours until my first deep work session of the day

I'd love to hear your thoughts, especially if you've struggled with productivity tracking or have ideas for what you'd want from a tool like this.

Comments

tiernano•1h ago

wait... isnt this pretty much what Microsoft was doing with Recall?

jerryliu12•1h ago

Recall (and Rewind) are similar in the sense that they both use screen data, but it's designed for retrieving specific things you saw, not semantically summarizing your time. My opinion is that they're completely different feature sets.

chewhongjun96•1h ago

Is it possible to include wearables as a data sources?

i.e. apple watch for sleep, running, activity levels? it could really give a 360 view of your life

jerryliu12•1h ago

That would be really cool, but for the foreseeable future there's still a lot of room to improve how screen data is used so I'll mostly be focused on that.

GuitarPie: Use Fretboard of Electric Guitar for Audio-Based Pie Menu Interaction [pdf]

Show HN: RegulGPT – AI-powered compliance policy generator for startups

Show HN: Binary artifact and release management, for everyone

AI "Hyperbole and Silly Numbers"?

The Growth of the Swift Server Ecosystem

New Agent Benchmark from Meta Super Intelligence Lab and Hugging Face

Show HN: Vibe Linking

Show HN: DoShare Personal Cloud

Mixboard: Google Labs' new experiment to visualize ideas

Instagram hits 3B monthly users; testing defaulting to Reels in India

Results Only Jobs (ROWE) – Jobs for better work / life balance and productivity

Gaza medics describe hospital overwhelmed by casualties from Israeli strikes

Waymo for Business

Terence Tao: The role of small organizations in society has shrunk significantly

SquashFS Optimization Achieves 15,277x Performance in Developer Benchmark

How Nvidia Is Backstopping America's AI Boom

Intel Moves Pre-Arc Graphics to "Legacy" Driver on Windows Linux Need Not Worry

EU ethics watchdog opens inquiry into von der Leyen's Mercosur text

The AI Performance Benefit with AMX on Intel Xeon 6 "Granite Rapids"

Astrocytic response to stress alters activity patterns in male and female mice

CompileBench: Can AI Compile 22-year-old Code?

A collaborative large language model for drug analysis

Writing Nothing but Docs for a Week

Product Hunt Is Dead

Audited Interactive Sigma Proofs and Fiat-Shamir Transformation PoC

Google Data Commons MCP Server

Show HN: We built Atono because we kept losing context

Show HN: Snapser Starter Tier – modular back end with isolated K8s "own cluster"

Show HN: Galerion – Minimal app to organize and play your media

Better Curl Saul: a lightweight API testing CLI focused on UX and simplicity