frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs

https://github.com/hauntsaninja/git_bayesect
63•hauntsaninja•4d ago

Comments

hauntsaninja•3d ago
git bisect works great for tracking down regressions, but relies on the bug presenting deterministically. But what if the bug is non-deterministic? Or worse, your behaviour was always non-deterministic, but something has changed, e.g. your tests went from somewhat flaky to very flaky.

In addition to the repo linked in the title, I also wrote up a little bit of the math behind it here: https://hauntsaninja.github.io/git_bayesect.html

Myrmornis•1h ago
This is really cool! Is there an alternative way of thinking about it involving a hidden markov model, looking for a change in value of an unknown latent P(fail)? Or does your approach end up being similar to whatever the appropriate Bayesian approach to the HMM would be?
supermdguy•3d ago
Okay this is really fun and mathematically satisfying. Could even be useful for tough bugs that are technically deterministic, but you might not have precise reproduction steps.

Does it support running a test multiple times to get a probability for a single commit instead of just pass/fail? I guess you’d also need to take into account the number of trials to update the Beta properly.

hauntsaninja•3d ago
Yay, I had fun with it too!

IIUC the way you'd do that right now is just repeatedly recording the individual observations on a single commit, which effectively gives it a probability + the number of trials to do the Beta update. I don't yet have a CLI entrypoint to record a batch observation of (probability, num_trials), but it would be easy to add one

But ofc part of the magic is that git_bayesect's commit selection tells you how to be maximally sample efficient, so you'd only want to do a batch record if your test has high constant overhead

Retr0id•1h ago
Super cool!

A related situation I was in recently was where I was trying to bisect a perf regression, but the benchmarks themselves were quite noisy, making it hard to tell whether I was looking at a "good" vs "bad" commit without repeated trials (in practice I just did repeats).

I could pick a threshold and use bayesect as described, but that involves throwing away information. How hard would it be to generalize this to let me plug in a raw benchmark score at each step?

davidkunz•40m ago
Useful for tests with LLM interactions.

You're still signing data structures the wrong way

https://blog.foks.pub/posts/domain-separation-in-idl/
23•malgorithms•47m ago•5 comments

Windows 95 defenses against installers that overwrite a file with an older one

https://devblogs.microsoft.com/oldnewthing/20260324-00/?p=112159
41•michelangelo•3d ago•6 comments

EmDash – a spiritual successor to WordPress that solves plugin security

https://blog.cloudflare.com/emdash-wordpress/
349•elithrar•4h ago•249 comments

Ask HN: Who is hiring? (April 2026)

142•whoishiring•5h ago•116 comments

TurboQuant KV Compression and SSD Expert Streaming for M5 Pro and IOS

https://github.com/SharpAI/SwiftLM
65•aegis_camera•2h ago•37 comments

Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs

https://github.com/hauntsaninja/git_bayesect
63•hauntsaninja•4d ago•6 comments

AI for American-produced cement and concrete

https://engineering.fb.com/2026/03/30/data-center-engineering/ai-for-american-produced-cement-and...
99•latchkey•3h ago•81 comments

StepFun 3.5 Flash is #1 cost-effective model for OpenClaw tasks (300 battles)

https://app.uniclaw.ai/arena?tab=costEffectiveness&via=hn
98•skysniper•4h ago•39 comments

An Introduction to Writing Systems and Unicode

https://r12a.github.io/scripts/tutorial/part2
35•mariuz•3d ago•7 comments

Show HN: Zerobox – Sandbox any command with file, network, credential controls

https://github.com/afshinm/zerobox
63•afshinmeh•2d ago•64 comments

The Anti-Intellectualism of Silicon Valley Elites

https://www.elizabethspiers.com/the-anti-intellectualism-of-silicon-valley-elites/
3•speckx•17m ago•0 comments

CERN levels up with new superconducting karts

https://home.cern/news/news/engineering/cern-levels-new-superconducting-karts
365•fnands•13h ago•80 comments

Show HN: Real-time dashboard for Claude Code agent teams

https://github.com/simple10/agents-observe
57•simple10•4h ago•20 comments

The OpenAI Graveyard: All the Deals and Products That Haven't Happened

https://www.forbes.com/sites/phoebeliu/2026/03/31/openai-graveyard-deals-and-products-havent-happ...
171•dherls•4h ago•134 comments

The AI Marketing BS Index

https://bastian.rieck.me/blog/2026/bs/
66•speckx•2h ago•9 comments

Apple at 50

https://www.apple.com/
61•janandonly•1h ago•30 comments

NASA Artemis II moon mission live launch broadcast

https://plus.nasa.gov/scheduled-video/nasas-artemis-ii-crew-launches-to-the-moon-official-broadcast/
243•apitman•3h ago•148 comments

Is BGP safe yet?

https://isbgpsafeyet.com/
213•janandonly•7h ago•75 comments

Random numbers, Persian code: A mysterious signal transfixes radio sleuths

https://www.rferl.org/a/mystery-numbers-station-persian-signal-iran-war/33700659.html
91•thinkingemote•9h ago•91 comments

Ukrainian Drone Holds Position for 6 Weeks

https://defenceleaders.com/news/ukrainian-combat-robot-holds-frontline-position-for-six-weeks-in-...
90•AftHurrahWinch•2h ago•61 comments

Wasmer (YC S19) Is Hiring – Rust and DevRel Positions

https://www.workatastartup.com/companies/wasmer
1•syrusakbary•8h ago

Ada and Spark on ARM Cortex-M – A Tutorial with Arduino and Nucleo Examples

http://inspirel.com/articles/Ada_On_Cortex.html
47•swq115•4d ago•15 comments

Intuiting Pratt Parsing

https://louis.co.nz/2026/03/26/pratt-parsing.html
129•signa11•2d ago•42 comments

Claude Wrote a Full FreeBSD Remote Kernel RCE with Root Shell (CVE-2026-4747)

https://github.com/califio/publications/blob/main/MADBugs/CVE-2026-4747/write-up.md
216•ishqdehlvi•15h ago•97 comments

Consider the Greenland Shark (2020)

https://www.lrb.co.uk/the-paper/v42/n09/katherine-rundell/consider-the-greenland-shark
74•mooreds•5d ago•31 comments

Randomness on Apple Platforms (2024)

https://blog.xoria.org/randomness-on-apple-platforms/
46•surprisetalk•5d ago•1 comments

Show HN: CLI to order groceries via reverse-engineered REWE API (Haskell)

https://github.com/yannick-cw/korb
183•wazHFsRy•2d ago•78 comments

Claude Code Unpacked : A visual guide

https://ccunpacked.dev/
991•autocracy101•15h ago•353 comments

Chess in SQL

https://www.dbpro.app/blog/chess-in-pure-sql
172•upmostly•3d ago•42 comments

SpaceX confidentially files to go public at $1.75T, reports say

https://www.theguardian.com/technology/2026/apr/01/spacex-public-offering-stock-market
14•bookofjoe•1h ago•4 comments