frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Over 36,500 killed in Iran's deadliest massacre, documents reveal

https://www.iranintl.com/en/202601255198
63•mhb•55m ago•15 comments

Clawdbot - open source personal AI assistant

https://github.com/clawdbot/clawdbot
49•KuzeyAbi•1h ago•24 comments

First, make me care

https://gwern.net/blog/2026/make-me-care
318•andsoitis•6h ago•105 comments

Scientists identify brain waves that define the limits of 'you'

https://www.sciencealert.com/scientists-identify-brain-waves-that-define-the-limits-of-you
20•mikhael•1h ago•0 comments

A macOS app that blurs your screen when you slouch

https://github.com/tldev/posturr
463•dnw•10h ago•160 comments

Show HN: A small programming language where everything is pass-by-value

https://github.com/Jcparkyn/herd
37•jcparkyn•2h ago•12 comments

Case study: Creative math – How AI fakes proofs

https://tomaszmachnik.pl/case-study-math-en.html
27•musculus•2h ago•14 comments

Guix for Development

https://dthompson.us/posts/guix-for-development.html
23•clircle•5d ago•1 comments

Doom has been ported to an earbud

https://doombuds.com
341•arin-s•13h ago•106 comments

Oneplus phone update introduces hardware anti-rollback

https://consumerrights.wiki/w/Oneplus_phone_update_introduces_hardware_anti-rollback
345•validatori•4h ago•155 comments

Spanish track was fractured before high-speed train disaster, report finds

https://www.bbc.com/news/articles/c1m77dmxlvlo
129•Rygian•6h ago•114 comments

Using PostgreSQL as a Dead Letter Queue for Event-Driven Systems

https://www.diljitpr.net/blog-post-postgresql-dlq
161•tanelpoder•9h ago•50 comments

The '3.5% rule': How a small minority can change the world (2019)

https://www.bbc.com/future/article/20190513-it-only-takes-35-of-people-to-change-the-world
172•choult•4h ago•129 comments

I was right about ATProto key management

https://notes.nora.codes/atproto-again/
111•todsacerdoti•6h ago•65 comments

Show HN: An interactive map of US lighthouses and navigational aids

https://www.lighthouses.app/
30•idd2•7h ago•8 comments

Show HN: FaceTime-style calls with an AI Companion (Live2D and long-term memory)

https://thebeni.ai/
7•summerlee9611•2h ago•0 comments

Bitwise conversion of doubles using only FP multiplication and addition (2020)

https://dougallj.wordpress.com/2020/05/10/bitwise-conversion-of-doubles-using-only-floating-point...
19•vitaut•10h ago•1 comments

Show HN: Elo ranking for landing pages

https://landingleaderboard.com/
10•Intragalactic•53m ago•1 comments

Turbopack: Building faster by building less

https://nextjs.org/blog/turbopack-incremental-computation
16•feross•5d ago•11 comments

Web-based image editor modeled after Deluxe Paint

https://github.com/steffest/DPaint-js
177•bananaboy•12h ago•15 comments

The behavioral cost of personalized pricing

https://digitalseams.com/blog/the-behavioral-cost-of-personalized-pricing
54•bobbiechen•6h ago•32 comments

Infinite pancakes, anyone?

https://www.nytimes.com/2026/01/20/science/infinite-pancake-math-puzzle.html
18•cainxinth•3d ago•4 comments

ICE using Palantir tool that feeds on Medicaid data

https://www.eff.org/deeplinks/2026/01/report-ice-using-palantir-tool-feeds-medicaid-data
866•JKCalhoun•7h ago•515 comments

Introduction to PostgreSQL Indexes

https://dlt.github.io/blog/posts/introduction-to-postgresql-indexes/
291•dlt•17h ago•14 comments

Show HN: Bonsplit – Tabs and splits for native macOS apps

https://bonsplit.alasdairmonk.com
206•sgottit•13h ago•26 comments

Hackable personal news reader in bash pipes

https://github.com/haron/news.sh
22•haron•5d ago•5 comments

Optimizing GPU Programs from Java Using Babylon and Hat

https://openjdk.org/projects/babylon/articles/hat-matmul/hat-matmul
26•pjmlp•5d ago•2 comments

Show HN: Netfence – Like Envoy for eBPF Filters

https://github.com/danthegoodman1/netfence
41•dangoodmanUT•10h ago•6 comments

Nango (YC W23, Dev Infrastructure) Is Hiring Remotely

https://jobs.ashbyhq.com/Nango
1•bastienbeurier•13h ago

A flawed paper in management science has been cited more than 6k times

https://statmodeling.stat.columbia.edu/2026/01/22/aking/
625•timr•16h ago•329 comments
Open in hackernews

Case study: Creative math – How AI fakes proofs

https://tomaszmachnik.pl/case-study-math-en.html
27•musculus•2h ago

Comments

benreesman•1h ago
They can all write lean4 now, don't accept numbers that don't carry proofs. The CAS I use for builds has a coeffect discharge cert in the attestation header, couple lines of code. Graded monads are a snap in CIC.
dehsge•28m ago
There are some numbers that are uncomputable in lean. You can do things to approximate them in lean however, those approximates may still be wrong. Leans uncomputable namespace is very interesting.
fragmede•1h ago
> a session with Gemini 2.5 Pro (without Code Execution tools)

How good are you at programming on a whiteboard? How good is anybody? With code execution tools withheld from me, I'll freely admit that I'm pretty shit at programming. Hell, I barely remember the syntax in some of the more esoteric, unpracticed places of my knowledge. Thus, it's hard not to see case studies like this as dunking on a blindfolded free throw shooter, and calling it analysis.

blibble•1h ago
> How good are you at programming on a whiteboard?

pretty good?

I could certainly do a square root

(given enough time, that one would take me a while)

htnthrow11220•1h ago
It’s like that but if the blindfolded free throw shooter was also the scorekeeper and the referee & told you with complete confidence that the ball went in, when you looked away for a second.
cmiles74•39m ago
It's pretty common for software developers to be asked to code up some random algorithm on a whiteboard as part of the interview process.
semessier•1h ago
that's not a proof
groundzeros2015•32m ago
I think it’s a good way to prove x = sqrt(y). What’s your concern?
frontfor•31m ago
Agreed. Asking the AI to do a calculation isn’t the same as asking it to “prove” a mathematical statement in the usual meaning.
v_CodeSentinal•1h ago
This is the classic 'plausible hallucination' problem. In my own testing with coding agents, we see this constantly—LLMs will invent a method that sounds correct but doesn't exist in the library.

The only fix is tight verification loops. You can't trust the generative step without a deterministic compilation/execution step immediately following it. The model needs to be punished/corrected by the environment, not just by the prompter.

zoho_seni•1h ago
I've been using codex and never had a compile time error by the time it finishes. Maybe add to your agents to run TS compiler, lint and format before he finish and only stop when all passes.
rakmo•42m ago
Is this hallucination, or is this actually quite human (albeit a specific type of human)? Think of slimy caricatures like a used car salesman, isn't this the exact type of underhandedness you'd expect?
bwfan123•40m ago
I am actually surprised that the LLM came so close. I doubt it had examples in its training set for these numbers. This goes to the heart of "know-how". The LLM should should have said: "I am not sure" but instead gets into rhetoric to justify itself. It actually mimics human behavior for motivated reasoning. At orgs, management is impressed with this overconfident motivated reasoner as it mirrors themselves. To hell with the facts, and the truth, persuation is all that matters.
threethirtytwo•29m ago
You don’t need a test to know this we already know there’s heavy reinforcement training done on these models so it optimizes for passing the training. Passing the training means convincing the person rating the answers and that the answer is good.

The keyword is convince. So it just needs to convince people that’s it’s right.

It is optimizing for convincing people. Out of all answers that can convince people some can be actual correct answers, others can be wrong answers.