Show HN: Turn .cursorrules / repo guidelines into GitHub pre-merge checks (OSS)

3•dkargatzis•1h ago

Building an open-source repo with deterministic validators backed by agent evaluation loops to turn agent instructions (.e.g. .cursorrules, claude-guidelines.md, copilot-prompts.md) into hard guarantees.

Comments

dkargatzis•1h ago

Hi HN!

Maintainers are increasingly dealing with large volumes of low-context PRs, especially as AI coding tools make it easier to generate changes without the surrounding intent.

Many repos already define contribution expectations in places like .cursorrules, CONTRIBUTING.md, or internal guidelines but those rules rarely translate into something enforceable during review.

We built Watchflow to bridge that gap: an open-source GitHub check that translates repository guidelines into enforceable validation logic that runs on every PR.

The model is intentionally hybrid:

- ~95% deterministic validators (fast, predictable, pure Python)

- Optional LLM checks for semantic cases like description / diff alignment

- LLM layer fails open so merges never block on model issues

Example rules:

- Require a linked issue

- Require CODEOWNERS review

- Block PRs above a LOC threshold

- Ensure PR descriptions align with code changes

- Validate contribution patterns defined in .cursorrules or agent guidelines

The goal is to close the loop between repository intent -> code diff -> enforceable governance.

Repo: https://github.com/warestack/watchflow

If there are review patterns you’re seeing that aren’t covered yet, happy to turn them into new validators!

adithya07•1h ago

Is it self-hostable for private repos?

DevanandGowda•1h ago

Looks like it, tried it on my vibe-coded project, looks pretty good so far.

dkargatzis•1h ago

Yes, both public and private. There is a guide with instructions in the repo.

Show HN: SpacePill – Better macOS Space Context Switching

Show HN: I built a prediction market that predicts itself

The Next Version of Curling IO

Fast IP and GPS to Location API (50ms, Global, 99.9% Uptime)

"Personal Data": more than a definition, a quasi-constitutional stake in EU

IMB Piracy and Armed Robbery Map 2025

New Emoji: Distorted Face

This job has become the ultimate case study why AI won't replace human workers

Learnings from a No-Code Lib: Keep the Spec Driven Development Triangle in Sync

Show HN: I made Claude Code block my distractions and track everything I ship

My MCP Server Setup: A Practical Guide to Wiring AI into Everything

Man Arrested for Plotting with Others to Murder or Kidnap Two Dissidents Abroad

Does Altman Deserve the Heat?

Harjus v4 adds kernel bypass and more

Show HN: TerminalNexus – Turn CLI commands into reusable buttons (Windows)

Why Autonomous Agents Failed the Initial Hype: An AutoGen Retrospective

Rob Grant Obituary on Ganymede and Titan

Agent-experience: visual reference to patterns, surfaces, and infrastructure

C++ Reflection: Another Monad

Invoicesio.app – Invoice and billing for freelancers and small businesses

AWS-hosted tech providers urge Middle East customers to fail over now

Dev stunned by $82K Gemini bill after unknown API key thief goes to town

Faster C software with Dynamic Feature Detection

Get Paid for Good Posts

Up to 10% of Firefox crashes are due to bad memory [thread]

With developer verification, Google's Apple envy threatens Android's open legacy

Ask HN: Does Claude Code's abilities fluctuate for you too?

CodeRabbit tops the F1 score in Martian's code review benchmarks

Open Source Iran War Cost Tracker: 45.7B

Unfiltered bald joy in the most uplifting corner of the internet