frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs

https://github.com/hauntsaninja/git_bayesect
85•hauntsaninja•4d ago

Comments

hauntsaninja•4d ago
git bisect works great for tracking down regressions, but relies on the bug presenting deterministically. But what if the bug is non-deterministic? Or worse, your behaviour was always non-deterministic, but something has changed, e.g. your tests went from somewhat flaky to very flaky.

In addition to the repo linked in the title, I also wrote up a little bit of the math behind it here: https://hauntsaninja.github.io/git_bayesect.html

Myrmornis•1h ago
This is really cool! Is there an alternative way of thinking about it involving a hidden markov model, looking for a change in value of an unknown latent P(fail)? Or does your approach end up being similar to whatever the appropriate Bayesian approach to the HMM would be?
supermdguy•3d ago
Okay this is really fun and mathematically satisfying. Could even be useful for tough bugs that are technically deterministic, but you might not have precise reproduction steps.

Does it support running a test multiple times to get a probability for a single commit instead of just pass/fail? I guess you’d also need to take into account the number of trials to update the Beta properly.

hauntsaninja•3d ago
Yay, I had fun with it too!

IIUC the way you'd do that right now is just repeatedly recording the individual observations on a single commit, which effectively gives it a probability + the number of trials to do the Beta update. I don't yet have a CLI entrypoint to record a batch observation of (probability, num_trials), but it would be easy to add one

But ofc part of the magic is that git_bayesect's commit selection tells you how to be maximally sample efficient, so you'd only want to do a batch record if your test has high constant overhead

__s•19m ago
recompiling can be high constant overhead
Retr0id•1h ago
Super cool!

A related situation I was in recently was where I was trying to bisect a perf regression, but the benchmarks themselves were quite noisy, making it hard to tell whether I was looking at a "good" vs "bad" commit without repeated trials (in practice I just did repeats).

I could pick a threshold and use bayesect as described, but that involves throwing away information. How hard would it be to generalize this to let me plug in a raw benchmark score at each step?

davidkunz•1h ago
Useful for tests with LLM interactions.
SugarReflex•2m ago
I hope this comment is not out of place, but I am wondering what the application for all this is? How can this help us or what does it teach us or help us prove? I am asking out of genuine curiosity as I barely understand it but I believe it has something to do with probability.

Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs

https://github.com/hauntsaninja/git_bayesect
85•hauntsaninja•4d ago•8 comments

Show HN: Flight-Viz – 10K flights on a 3D globe in 3.5MB of Rust+WASM

https://flight-viz.com
10•coolwulf•3h ago•9 comments

Show HN: Zerobox – Sandbox any command with file, network, credential controls

https://github.com/afshinm/zerobox
67•afshinmeh•2d ago•68 comments

Show HN: Real-time dashboard for Claude Code agent teams

https://github.com/simple10/agents-observe
58•simple10•4h ago•21 comments

Show HN: A typing trainer that uses real code snippets

https://type.a2n.dev/
2•a2nb•29m ago•1 comments

Show HN: CLI to order groceries via reverse-engineered REWE API (Haskell)

https://github.com/yannick-cw/korb
183•wazHFsRy•2d ago•78 comments

Show HN: Mycellm – BitTorrent for LLMs, pool GPUs into federated networks

https://mycellm.ai
2•mijkal•1h ago•0 comments

Show HN: Local RAG on 25 Years of Teletext News

https://github.com/r-follador/TeletextSignals/
2•folli•1h ago•0 comments

Show HN: Roadie – An open-source KVM that lets AI control your phone

https://github.com/VibiumDev/roadie
4•hugs•1h ago•0 comments

Show HN: Sycamore – next gen Rust web UI library using fine-grained reactivity

https://sycamore.dev
93•lukechu10•8h ago•68 comments

Show HN: Modern AI assisted goals and performance management

https://prfrm.architectfwd.com
2•quintes•2h ago•0 comments

Show HN: Canon PIXMA G3010 macOS driver, reverse-engineered with Claude

https://github.com/pdrgds/pixma-rs
2•pdrgds•2h ago•0 comments

Show HN: Hire Gnome – a lightweight ATS for small recruiting agencies

https://hiregnome.com/
2•harrywynn•2h ago•0 comments

Show HN: Postgres extension for BM25 relevance-ranked full-text search

https://github.com/timescale/pg_textsearch
187•tjgreen•1d ago•53 comments

Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs

https://prismml.com/
393•PrismML•1d ago•146 comments

Show HN: Forkrun – NUMA-aware shell parallelizer (50×–400× faster than parallel)

https://github.com/jkool702/forkrun
143•jkool702•5d ago•38 comments

Show HN: Metal Quantized Attention on M5 Max

https://releases.drawthings.ai/p/metal-quantized-attention-pulling
4•liuliu•3h ago•0 comments

Show HN: Rustobol – Compile Rust to COBOL

https://github.com/romanagenda/rustobol/
5•mikece•4h ago•0 comments

Show HN: Claude Code rewritten as a bash script

https://github.com/jdcodes1/claude-sh
45•rpst•21h ago•12 comments

Show HN: Max Headbox, a local agent that fits on a Raspberry Pi 5

https://github.com/syxanash/maxheadbox
3•syx•6h ago•0 comments

Show HN: Loreline, narrative language transpiled via Haxe: C++/C#/JS/Java/Py/Lua

https://loreline.app/en/docs/technical-overview/
68•jeremyfa•4d ago•20 comments

Show HN: Baton – A desktop app for developing with AI agents

https://getbaton.dev/
59•tordrt•8h ago•49 comments

Show HN: Sundial – a new way to look at a weather forecast

https://sundial.page/
53•izaidi•1d ago•22 comments

Show HN: You Got Snarked: A snarky messaging app

https://www.siliconsnark.com/you-got-snarked-the-first-and-only-app-for-tech-snark/
3•SaaSasaurus•7h ago•1 comments

Show HN: OpenHarness Open-source terminal coding agent for any LLM

https://github.com/zhijiewong/openharness
6•wangzhijie•7h ago•1 comments

Show HN: Coasts – Containerized Hosts for Agents

https://github.com/coast-guard/coasts
95•jsunderland323•2d ago•38 comments

Show HN: Draw a Picture for My Cat

https://brian.jp/blog/do-cats-dream-of-led-arrays-36/
3•lofties•8h ago•1 comments

Show HN: Aphelo – A Redis-like store in C++ with Progressive Rehashing

https://veniatyrannus993225.substack.com/p/the-stop-the-world-problem-how-i
5•pranit_garje•4h ago•1 comments

Show HN: I turned a sketch into a 3D-print pegboard for my kid with an AI agent

https://github.com/virpo/pegboard
64•virpo•1d ago•17 comments

Show HN: I made a Mario Galaxy game with Claude Code and Three.js in 53 days

https://supertommy.com/games/super-mario-galaxy-movie-game/
6•supertommy•8h ago•11 comments