Myself, mostly. Trying to wrestle with realizing how much time I've not been spending on my supposedly main project[1] and questioning whether it's really worth doing.
> Any new ideas that you're thinking about?
Way too many. Writing todo lists is part of working on myself.
[1]: PAPER, a pure-Python ~(pip/pipx replacement), from scratch with an emphasis on simplicity and elegance. https://github.com/zahlman/paper . There's more locally that I haven't pushed, including factoring some stuff out into a separate project and planning more of the same. But yeah.
An AI based time tracker: reconstructs your day from whatever it sees you doing. Screenshot based but never stores them.
The same tech stack is pretty easily adaptable to openclaw tracking. If anybody would like to try, DM
Also looking into AI based security tools for monitoring security of DoneThat. Thinking of using zeropath would love to hear if people tried them / have other suggestions
This feels like it will very easily segway into corporate "spyware" if you ever start doing enterprise plans.
What's your take on that?
So spyware in the sense of getting information without the employee knowing would be impossible and not something I’d ever want to do.
It does enable transparency on a very abstracted level: your team could see a six bullet point summary of your day if you opt in. I believe this kind of transparency can actually help more teams go remote, cut down on sync meetings, etc.
I’m currently experimenting with a feature that shows relative time spent only, not absolute - so e.g. 30% on project X, 20% on admin, etc. That could be the sweet spot on visibility vs privacy.
It's a free USCIS form-filling web-app(no Adobe required). USCIS forms still use XFA PDFs, which don’t let you edit in most browsers. Even with Adobe, fields break, and getting the signature is hard.
So I converted the PDF form into modern, browser-friendly web forms - and kept every field 1:1 with the original. You fill the form, submit it, and get the official USCIS PDF filled.
I found out simplecitizen offers a DIY plan for $529 (https://www.simplecitizen.com/pricing/)
So, a free (and local-only) version might be a good alternative
Full project: https://euzoia.org
Tried to be super low-tech: Notion, super.so, Spotify creators, riverside.
Now thinking of building an email-based agent for behaviour change accountability. Would love any pointers to good UX for email-based AI assistants.
It's an infinite canvas that runs SQL.
I've been working with data my entire career. I feel like we need to alt+tab so much. What if we just put it all on a canvas?
Currently very WIP, but there's a simple titanic demo available!
Built with tldraw and duckdb wasm, running on cloudflare durable objects
Lots of work left to do, but happy to have a working version up. It's an interactive map that currently shows all the routes and stops for SF Muni, BART, Caltrain, samTrans, and VTA. There are many more agencies (official and unofficial) in the bay, so I'll be adding those throughout the next few days as I sort out the data.
Finding the data and cleaning/normalizing it is a real pain, so if anyone knows a good place to find them (and normalize them), please do share
Behind the scenes I’m rebuilding the sync engine to properly support offline mode. Trying to get to instant opens for the app (and of course work offline). It’s probably my 5th sync engine. It’s been really fun to see how much easier, faster, better, etc each new iteration is.
(And the project at large is https://phrasing.app - a language learning app for polyglots. It’s like anki but designed to be enjoyed)
It's a daily puzzles website focused on logic puzzles at this moment. I have about 70 subscribers, and it's online since Dec/25.
im building Satori to fix this -https://www.usesatori.sh/
would love feedback!
https://www.inclusivecolors.com/
Unlike most tools based around autogenerating colors, this is more of an editor that lets you fully customise all the tint/shades to your liking with a focus on accessibility. This is important when you've got existing brand colors to include and want to find accessible color combinations that work together.
Would love feedback in general and especially from designers/devs who have different needs in how they go about creating branded palettes!
Thanks! Any problems you've found with this approach or it's usually good enough?
For me, I couldn't find a tool that would let me customize multiple color scales at once, check they look good together on a mockup, and also be accessible. It's one of those problems where you can autogenerate something that gets you most of the way there, but then for it to be usable you need need to see how it looks on designs and fine tweak it.
Current coverage is the US, more countries coming soon.
Interpretation of SysML activity diagrams as temporal logic for use with state machine specifications.
Module system for state machine with scoping, ownership type system and attendant theorems to carry proofs of LTL properties about individual parts forward after composition.
“Compiles” to SQL, but with a different structural paradigm.
Also, watching a bunch of videos and reading docs on OpenClaw. I had thought I'd do an install of it sometime this weekend, but I don't know if I'll get to that at this point or not.
And lastly, messing with Spring AI[2]. I wanted to get a local build of that going so I can dig into the bowels of it and hack on it a bit. So I got that repo cloned and ran a quick build, and now I plan to start exploring the codebase.
I'm also working on a new strength gains-tracking app that is a lot more intuitive, motivating and friend first. I've been using it with some friends for the last 10 weeks and everyone making is consistent gains. It is my first full PWA, vanillaJs, backend is Lucee & MySQL. Works great on iOS and Android, no one has any complaints. The web stack has come a long way I am probably not going to do a native mobile app for a while. I'll probably make it public in a couple weeks.
It’s been fun to come back to, most of the code I wrote still drives the business (it’s just far outdated).
I was pretty early on in my career when I wrote it, so seeing my mistakes and all the potential areas to improve has been very interesting. It’s like buying back your old high school Camaro that you used to wrench on.
A platform for probers, alerts, playbooks, incidents .etc
Trying to make it as easy as possible to follow SRE procedures
The idea is to get tons of reps in, across varied situations, with excellent advice to build good intuitions and decision making abilities. Or to stop making bad or terrible decisions. Or just play poker for free.
I'd like to monetize with at least the hand history format open sourced. Ping me if you would like to get involved with GTM and the revenue side of things.
>Ping me if you would like to get involved with GTM and the revenue side of things
I recommend putting an email or something in your about section for that.
1. An app for personalized interactive audiobooks for kids - https://www.vivid.cx
2. A book about the edge of the thinkable - https://www.unthinkable.net
Game idea: DroneCraft is a third-person drone exploration game where players scout the world for parts, craft powerful upgrades, and trade strategically to evolve their build.
Whats coming: Core mechanics are up and running. First playable version planned within a month, alongside open-sourcing the full codebase.
On-and-off again working on a Mystery Dungeon style game but I have a lot of obligations taking me away from it.
Planning on making demoscene entries this year.
I've been pretty bummer out by Rainbow 6 Siege X announcing they will never support Linux due to a lack of kernel-level anti-cheat support. While I can use NVIDIA shield to play from my Windows pc, id rather play something natively with friends (for context, we usually play 3v3's for funsies.
My goal is not to make an exact clone, but to make a smaller map version for 3v3 that is a bit more quick paced.
For context, it's a bomb defusal game where the main goal is intel and gadgets. You need to make the other side waste their gadgets so it comes down to a gun v gun fight.
I'm also experimenting with coverage-guided PBT input generation in the same library, AFL-style -- right now elm-test only has random input generation.
You can bookmark a job description (it will be parsed), then paste a question and it generates an answer based on your resume, the job description, and your previously given answers for similar questions in other applications. The generated answer can be refined through a follow-up chat and exported as a PDF. It also works as a simple job application tracker.
Saves me tons of time and effort every day!
1. Trying to improve the translation quality by giving LLM more context.
2. Fixing the issue where PowerPoint slides layout may become a bit messy after transition because of different text density between western and CJK languages.
Did I get that right?
- building an independent line of communication with your audience
- predictive, just in time notifications through push or email delivered when we predict that specific viewer has the time to view videos on YouTube, ensuring you stay on top of their notification stack and don't disappear amongst a flood of notifications.
I've got replicas now working with DML proxy. This essentially means I can now have a cluster of primaries, and then spin up replicas on demand and nodes talking to local host will never see their mutation work pretty transparently from readonly-replicas. While PoC works now the snapshot restore is extremely inefficient IMO yet.
My big takeaway lesson from this is that the APIs are clumsy, the frameworks are very rough, and we're still very much in the territory of having to roll your own bespoke solutions for everything instead of the whole thing "just working". For example:
Large file uploads are very inconsistent between providers. You get fun issues like a completed file upload being unusable because there's an extra "processing" step that you have to poll-wait for. (Surprise!)
The vendors all expose a "list models" API, none of which return a consistent and useful list of metadata.
Automatic context caching isn't.
Multi-modal inputs are still very "early days". Models are terrible at mixed-language input, multiple speakers, and also get confused by background noises, music, and singing.
You can tell an AI to translate the subtitles to language 'X', and it will.. most of the time. If you provide audio, it'll get confused and think that it is being asked to transcribe it! It'll return new English subtitles sometimes.
JSON schemas are a hint, not a constraint with some providers.
Some providers *cough*oogle*cough* don't support all JSON Schema constructs, so you can't safely use their API with arbitrary input types.
If you ask for a whole JSON document back, you'll get timeout errors.
If you stream your results, you have to handle reassembly and parsing yourself, the frameworks don't handle this scenario well yet.
You'd think a JSON list (JSONL) schema would be perfect for this scenario, but they're explicitly not supported by some providers!
Speaking of failures, you also get refusals and other undocumented errors you'll only discover in production. If you're maintaining a history or sliding window of context, you have to carefully maintain snapshots so you can roll back and retry. With most APIs you don't even know if the error was a temporary or permanent condition, of if your retry loop is eating into your budget or not.
Context size management is extra fun now that none of the mainstream models provide their tokenizer to use offline. Sometimes the input will fit into the context, sometimes it won't. You have to back off and retry with various heuristics that are problem-specific.
Ironically, the APIs are so new and undergoing so much churn that the AI models know nothing about them. And anyway, how could they? None of them are properly documented! Google just rewrote everything into the new "GenAI" SDK and OpenAI has a "Responses" API which is different from their "Chat" API... I don't know how. It just is.
Mainly I'm working on a task dispatch dashboard called Prompter Hawk that is designed to be the best UI for task management with agents. If you've been trying to parallelize by running multiple claude code terminals or codex terminals at once, this tool replaces those terminals and fits them all into one view with an AI task tracking board. It sounds more complicated than it is. It's a harness for Claude / Gemini / GPT models with a GUI that speeds up all your workflows. Rather than using sustained chat mode, all Prompter Hawk tasks are fire-and-forget. You just give the task description and come back when it's done. Parallelism first.
Some example highlight features:
-One dashboard view that shows all your parallel sessions and which tasks each agent has in progress and in their queue. Also shows recently completed tasks and outputs. This is my attempt at the ideal "pilot's cockpit view" for agentic development.
-Tasks are well tracked by the manager: see their status, file changes, and git commits. One click task retry. Get breakdowns on cost per run. Tasks can be set to automatically recur on a given schedule. Everything goes into a persistent local DB so you can easily pull up task data from months ago. Far far better user experience than trying to pull up old chat histories IMO.
-Timeline view and analytics views that give you hard stats on your velocity and how effectively your agents are using and updating your codebase. See unique stats like which of your files your agents read the most and how many daily LOC and commit changes you're doing. See how well you're parallelizing workloads at a simple glance.
-Automatic system diagram generation
-Task suggestion feature. If your agents are idle, they can draft tentative tasks to carry out next, based on the project history and your goals. This makes keeping multiple agents spinning actually much easier than you'd think. You don't need to be a multitasking context-switching god to do this.
I haven't shared it much (not even a Show HN) because the landing page isn't converting well at all yet, though I have some reddit ads doing well. I've had a bunch of free users sign up and a handful of paying users too. Looking for users or just feedback on anything! Sorry for wall of text.
Pre-codex:
Local card game: there's a very specific card game played in my country, there's online game rooms, but I want to get something like lichess.org or chess.com scale, oriented towards competitive play, with ELO (instead of social aspects), ideally I would get thousands of users and use it as a portfolio piece while making it open source.
cafetren.com.ar: Screen product for coffee shops near train stations with real time train data.
Post-codex:
SilverLetterai.com: Retook a project for an autonomous sales LLM assistant, building a semi-fake store to showcase the product (I can fulfill orders if they come by dropshipping), but I also have a friend and family order which I should do after this. 2 or 3 years late to the party, but there's probably a lot of work in this space for years to come.
Retook Chess Engine development, got unstuck by letting the agent do the boring busywork, I wish I would have done it without, but I don't have the greatest work ethic, hopefully one day I will manually code it.
Finally, like everyone else, I'm not quite 100% content with the coding agents, so I'm trying to build my own. Yet another coding agent thingy. But tbf this is more for myself than as a product. If it gets released it's as-is do what you want with it.
Pasture takes each signup, enriches it (title, company size, funding, tech stack, and more), and scores it 0-100 against your ICP. Alerts go to Slack with full context. You can also track which channels bring quality vs. junk over time, which has been the most useful part so far.
So I'm building Taskplan (https://taskplan.run) - it's like Ansible, but for people. Build a plan, assign tasks to people or teams, and get a real-time dashboard to track progress as the work happens.
I'd love feedback from anyone who deals with the same issues or works on ops-heavy projects.
In multi-agent setups, we kept running into issues where agents either hoarded resources or exhausted shared budgets unpredictably. So we built a control layer where agents operate using virtual credits, can temporarily rebalance budgets or split shared API costs, but everything stays under explicit human-defined limits with full audit logs and kill switches.
It’s intentionally not real money and not a financial product — more like infrastructure for coordinating agent spend safely. Mostly exploring how much autonomy you can give agents before cost becomes the real bottleneck.
Dimensionally accurate AI 3D modelling. My grandpa has a 3D printer but struggles to use any complex tools. So I am working on this chat interface to allow him to do some simple models.
So far he has triggered more than 150 generations. It’s getting better every model cycle and gives me something I enjoy working on.
The goal is to build cool, interesting sites for my newsletter to show that the old web is still alive and well.
I train BJJ and kept hearing the same pain points from academy owners regarding attendance tracking, communications, missing payments, etc.
So I built a tool for martial arts academies in 2024 with belts progression, automated payments, attendance tracking, and a tablet check-in system. Nowadays I'm still onboarding new academies every week and working a bit more on the marketing side to keep growing.
Used to pay $8/month, now I use around $4!
Orchestrates your local Claude Code, uses GitHub as the interface and state store.
Simple compared to Gas Town or Loom, less CLI, more human-in-the-loop, a little easier to hack.
My main goal is not just a "the model made code, yay!" setup, but verifiable outputs that can show degradation as percentages.
i.e. have the model make something like a connect 4 engine, and then run it through a lot of tests to see how "valid" it's solution is. Then score that solution as NN/100% accurate. Then do many runs of the same test at a fixed interval.
I have ~10 tests like this so far, working on more.
ebhn•3h ago
4b11b4•10m ago