frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Sping – An HTTP/TCP latency tool that's easy on the eye

https://dseltzer.gitlab.io/sping/docs/
36•zorlack•3h ago•3 comments

The two versions of Parquet

https://www.jeronimo.dev/the-two-versions-of-parquet/
131•tanelpoder•3d ago•31 comments

Busy beaver hunters reach numbers that overwhelm ordinary math

https://www.quantamagazine.org/busy-beaver-hunters-reach-numbers-that-overwhelm-ordinary-math-202...
34•defrost•2d ago•4 comments

Ghrc.io appears to be malicious

https://bmitch.net/blog/2025-08-22-ghrc-appears-malicious/
245•todsacerdoti•3h ago•30 comments

Is 4chan the perfect Pirate Bay poster child to justify wider UK site-blocking?

https://torrentfreak.com/uk-govt-finds-ideal-pirate-bay-poster-boy-to-sell-blocking-of-non-pirate...
172•gloxkiqcza•10h ago•128 comments

Prison isn’t set up for today’s tech so we have to do legal work the old way

https://prisonjournalismproject.org/2025/08/19/prisons-outdated-technology-hurts-our-chances-at-f...
69•danso•3h ago•38 comments

We put a coding agent in a while loop

https://github.com/repomirrorhq/repomirror/blob/main/repomirror.md
127•sfarshid•10h ago•95 comments

Making games in Go: 3 months without LLMs vs. 3 days with LLMs

https://marianogappa.github.io/software/2025/08/24/i-made-two-card-games-in-go/
260•maloga•12h ago•181 comments

My ZIP isn't your ZIP: Identifying and exploiting semantic gaps between parsers

https://www.usenix.org/conference/usenixsecurity25/presentation/you
41•layer8•3d ago•14 comments

A Brilliant and Nearby One-off Fast Radio Burst Localized to 13 pc Precision

https://iopscience.iop.org/article/10.3847/2041-8213/adf62f
48•gnabgib•7h ago•7 comments

Y Combinator files brief supporting Epic Games, says store fees stifle startups

https://www.macrumors.com/2025/08/21/y-combinator-epic-games-amicus-brief/
80•greenburger•3d ago•63 comments

Trees on city streets cope with drought by drinking from leaky pipes

https://www.newscientist.com/article/2487804-trees-on-city-streets-cope-with-drought-by-drinking-...
151•bookofjoe•2d ago•80 comments

How to check if your Apple Silicon Mac is booting securely

https://eclecticlight.co/2025/08/21/how-to-check-if-your-apple-silicon-mac-is-booting-securely/
40•shorden•3h ago•9 comments

Burner Phone 101

https://rebeccawilliams.info/burner-phone-101/
286•CharlesW•4d ago•108 comments

How many paths of length K are there between A and B? (2021)

https://horace.io/walks
16•jxmorris12•7h ago•2 comments

Everything I know about good API design

https://www.seangoedecke.com/good-api-design/
199•ahamez•8h ago•77 comments

Cloudflare incident on August 21, 2025

https://blog.cloudflare.com/cloudflare-incident-on-august-21-2025/
143•achalshah•2d ago•29 comments

Halt and Catch Fire Syllabus (2021)

https://bits.ashleyblewer.com/halt-and-catch-fire-syllabus/
105•Kye•6h ago•27 comments

Show HN: Clearcam – Add AI object detection to your IP CCTV cameras

https://github.com/roryclear/clearcam
164•roryclear•15h ago•47 comments

Using acetaminophen during pregnancy may increase childrens autism and ADHD risk

https://hsph.harvard.edu/news/using-acetaminophen-during-pregnancy-may-increase-childrens-autism-...
11•spchampion2•3h ago•0 comments

GNU cross-tools: musl-cross 313.3M

https://github.com/cross-tools/musl-cross
17•1vuio0pswjnm7•4h ago•2 comments

Iterative DFS with stack-based graph traversal (2024)

https://dwf.dev/blog/2024/09/23/2024/dfs-iterative-stack-based
27•cpp_frog•3d ago•3 comments

NASA's Juno mission leaves legacy of science at Jupiter

https://www.scientificamerican.com/article/how-nasas-juno-probe-changed-everything-we-know-about-...
65•apress•3d ago•27 comments

Stepanov's biggest blunder? The curious case of adjacent difference

https://mmapped.blog/posts/43-stepanovs-biggest-blunder
39•signa11•3d ago•8 comments

Comet AI browser can get prompt injected from any site, drain your bank account

https://twitter.com/zack_overflow/status/1959308058200551721
491•helloplanets•12h ago•173 comments

Bash Strict Mode

http://redsymbol.net/articles/unofficial-bash-strict-mode/
5•dcminter•2d ago•2 comments

OS Yamato lets your data fade away

https://github.com/osyamato/os-yamato
17•tsuyoshi_k•3d ago•13 comments

Claim: GPT-5-pro can prove new interesting mathematics

https://twitter.com/SebastienBubeck/status/1958198661139009862
118•marcuschong•4d ago•79 comments

Show HN: I Built a XSLT Blog Framework

https://vgr.land/content/posts/20250821.xml
31•vgr-land•9h ago•11 comments

Will at centre of legal battle over Shakespeare’s home unearthed after 150 years

https://www.theguardian.com/culture/2025/aug/21/will-at-centre-of-legal-battle-over-shakespeares-...
43•forthelose•1d ago•14 comments
Open in hackernews

Claim: GPT-5-pro can prove new interesting mathematics

https://twitter.com/SebastienBubeck/status/1958198661139009862
118•marcuschong•4d ago

Comments

brcmthrowaway•4d ago
Gamechanger! And worrisome for us laymen.
ac29•4d ago
In the thread, they note a human had already come up with (and published) an even better solution.
mrcwinn•7h ago
Before AI, but while you (and I) were still unable to contribute anything meaningful or novel to discussions of mathematics, did you feel threatened?
osti•4d ago
In here https://blog.google/products/gemini/gemini-2-5-deep-think/, the professor google worked with also claimed proving some previously unproven conjecture.
drudolph914•4d ago
interesting if true, but this isn't the first time we heard of something like this

quanta published an article that talked about a physics lab asking chatGPT to help come up with a way to perform an experiment, and chatGPT _magically_ came up with an answer worth pursuing. but what actually happened was chatGPT was referencing papers that basically went unread from lesser famous labs/researchers

this is amazing that chatGPT can do something like that, but `referencing data` != `deriving theorems` and the person posting this shouldn't just claim "chatGPT derived a better bound" in a proof, and should first do a really thorough check if it's possible this information could've just ended up in the training data

mhh__•9h ago
How would we know it was referencing an old paper versus almost everything trivial already having a derivation somewhere?
fwip•6h ago
One signal is to check the journal. Most reputable journals won't publish a paper claiming a new technique if it's actually trivial and well-known.
mhh__•3h ago
The "trivial" is slightly tongue in cheek. It must be trivial, I've just shown it!
martinpw•8h ago
> what actually happened was chatGPT was referencing papers that basically went unread from lesser famous labs/researchers

Which is actually huge. Reviewing and surfacing all the relevant research out there that we are just not aware of would likely have at least as much impact as some truly novel thing that it can come up with.

DennisP•7h ago
Maybe we should think of current AIs as not so much artificial intelligence, as collective intelligence. Which itself can be extremely valuable.
xigoi•6h ago
It turns out that if you use a fancy search engine to search instead of pretending that it’s intelligent, it will actually be good at its job. Who would have guessed?
leeoniya•8h ago
> but what actually happened was chatGPT was referencing papers that basically went unread from lesser famous labs/researchers

now let's invalidate probably 70% of all patents

jsw97•6h ago
I know this was a throwaway, but finding prior art for a large group of existing patents would be a cool application.
leeoniya•2h ago
it was half-serious.

if LLMs arent being used by https://patents.stackexchange.com/ or patent troll fighters, shame on them.

dinobones•4d ago
Are we sure this guy is not someone being mirrored by a recursive non-governmental system?

Context: https://x.com/GeoffLewisOrg/status/1945864963374887401

aeve890•8h ago
What does this even mean? This read like a SCP thing.
IceDane•8h ago
This is either satire that's over my head or mental illness.
StilesCrisis•8h ago
This is one of 4o’s biggest flaws. If you are a conspiracy theorist, it’ll confirm any outlandish theory you can come up with, and provide invented receipts to go with it. Of course, it’s just model hallucinations, but for those who are already primed to believe that secrets are being kept, it gives the “evidence” they were always looking for.
the_sleaze_•7h ago
"Your correction is correct! Jet fuel can "melt" steel beams because steel is a solid metal that requires heating to its melting point (around -30°C) to transform into a liquid..."
semi-extrinsic•8h ago
It is exactly SCP regurgitated by the LLM, and this guy thinks it's all true.
nybsjytm•3d ago
Any mathematicians who have actually called it "new interesting mathematics", or just an OpenAI employee?

The paper in question is an arxiv preprint whose first author seems to be an undergraduate. The theorem in it which GPT improves upon is perfectly nice, there are thousands of mathematicians who could have proved it had they been inclined to. AI has already solved much harder math problems than this.

offnominal•5h ago
The OpenAI employee posting this is a well known theoretical computer scientist: https://en.wikipedia.org/wiki/S%C3%A9bastien_Bubeck
alkyon•4h ago
Yes, he published a paper claiming GPT-4 has "sparks" of AGI. What else is he known for in the field of computer science?

https://arxiv.org/abs/2303.12712

NotOscarWilde•1h ago
Hello, TCS assistant professor here: he is legitimately respected among his peers.

Of course, because I am a selfish person, I'd say I appreciate most his work on convex body chasing (see "Competitively chasing convex bodies" on the Wikipedia link), because it follows up on some of my work.

Objectively, you should check his conference submission record, it will be a huge number of A*/A CORE rank conferences, which means the best possible in TCS. Or the prizes section on Wikipedia.

offnominal•41m ago
Not sure if you're trying to be provocative, but you could just click his name in the link you provided to find a lengthy list of arXiv preprints: https://arxiv.org/search/cs?searchtype=author&query=Bubeck,+...
marcuschong•3d ago
More comments from another mathematician:

https://x.com/ErnestRyu/status/1958408925864403068?t=QmTqOcx...

42lux•8h ago
I guess arithmetic is just harder for an LLM than higher math.
bubblyworld•8h ago
Arithmetic is harder for mathematicians than higher maths too =P not even joking. It was a meme in my university's maths dept for a reason.
therobots927•8h ago
it might take a while but their answer would always be correct. the same cannot be said for LLMs.
soulofmischief•8h ago
Mathematicians make calculations in their errors all the time.
soulofmischief•4h ago
Whoops, switched some words around on accident.
bubblyworld•8h ago
Yeah, of course I agree with that =)
PartiallyTyped•8h ago
In a group, you’d usually let the freshest handle splitting the bill because everyone else forgot arithmetic.
emmelaich•2h ago
https://en.wikipedia.org/wiki/57_(number)

aka the Grothendieck prime!

freshtake•8h ago
An interesting debate!

A few things to consider:

1. This is one example. How many other attempts did the person try that failed to be useful, accurate, coherent? The author is an OpenAI employee IIUC, so it begs this question. Sora's demos were amazing until you tried it, and realized it took 50 attempts to get a usable clip.

2. The author noted that humans had updated their own research in April 2025 with an improved solution. For cases where we detect signs of superior behavior, we need to start publishing the thought process (reasoning steps, inference cycles, tools used, etc.). Otherwise it's impossible to know whether this used a specialty model, had access to the more recent paper, or in other ways got lucky. Without detailed proof it's becoming harder to separate legitimate findings from marketing posts (not suggesting this specific case was a pure marketing post)

3. Points 1 and 2 would help with reproducibility, which is important for scientific rigor. If we give Claude the same tools and inputs, will it perform just as well? This would help the community understand if GPT-5 is novel, or if the novelty is in how the user is prompting it

energy123•8h ago
4. How many times has this happened already but the human took credit for the output because they don't have the incentive to give credit to the LLM
OtomotO•8h ago
Yeah, how many times?

How many times did a stochastic parrot by pure chance bring words into an order that made up a new proof?

And why should a stochastic parrot get any credit?

AaronAPU•8h ago
Are you referring to the person or the LLM?
DonHopkins•7h ago
How many times have you stochastically parroted that term?
ds-slope•1h ago
I don’t think it’s that they don’t have the incentive. I think it’s because it’s unclear if you give credit to the LLM if that means that OpenAI or similar would be considered an author in which case that could really screw up intellectual property and make using LLMs much less attractive. If the LLM wants attribution then it’s sentient, and if it’s sentient, it may be given personhood (Johnny-five scenario) and get rights, and then it would be a writer, and it could influence the license and intellectual property may belong partially to it unless it willingly became and employee of a ton of companies and organizations or contracted with them.
sothatsit•30m ago
I'd say a lot of people even have an incentive to not give credit to the LLMs, because there is a social stigma associated with using AI, due to its association with low-quality work.
foobarqux•8h ago
> This is one example. How many other attempts did the person try that failed to be useful, accurate, coherent?

High chance given that this is the same guy that came up with SVG unicorn (sparks of AGI) which raises the same question even more obviously.

bawolff•6h ago
> This is one example. How many other attempts did the person try that failed to be useful, accurate, coherent? The author is an OpenAI employee IIUC, so it begs this question. Sora's demos were amazing until you tried it, and realized it took 50 attempts to get a usable clip.

If you could combine this with automated theorem proving, it wouldn't matter if it was right only 1 out of a 1000 times.

aabhay•8h ago
I don’t get why so many people are resistant to the concept that AI can prove new mathematical theorems.

The entire field of math is fractal-like. There are many, many low hanging fruits everywhere. Much of it is rote and not life changing. A big part of doing “interesting” math is picking what to work on.

A more important test is to give an AI access to the entire history of math and have it _decide_ what to work on, and then judge it for both picking an interesting problem and finding a novel solution.

tcshit•8h ago
I like the idea of letting AI try to formulate new math problems that are interesting, i.e. worthy research level. I guess we are still a number of iterations away till AI get there though..
xenotux•8h ago
I think a simple way to take emotion out of this is to ask if a computer can beat humans at math. The answer to that is pretty much "duh". Symbolic solvers and numerical methods outperform humans by a wide margin and allow us to reach fundamentally new frontiers in mathematics.

But it's a separate question of whether this is a good example of that. I think there is a certain dishonesty in the tagline. "I asked a computer to improve on the state-of-the-art and it did!". With a buried footnote that the benchmark wasn't actually state-of-the-art, and that an improved solution was already known (albeit structured a bit differently).

When you're solving already-solved problems, it's hard to avoid bias, even just in how you ask the question and otherwise nudge the model. I see it a lot in my field: researchers publish revolutionary results that, upon closer inspection, work only for their known-outcome test cases and not much else.

Another piece of info we're not getting: why this particular, seemingly obscure problem? Is there something special about it, or is it data dredging (i.e., we tried 1,000 papers and this is the only one where it worked)?

SkyPuncher•8h ago
For me it comes down to signal vs noise.

I’m absolutely confident that AI/LLM can solve things, but you have to shift through a lot of crap to get there. Even further, it seems AI/LLM tend to solve novel problems in very unconventional ways. It can be very hard to know if an attempt is doomed, or just one step away from magic.

teeray•7h ago
At that point, is it really solving or is it just monkeys with typewriters?
fwip•6h ago
"Monkeys with typewriters," is in one sense, a uniform sampling of the probability space. A brute-force search, even when using structured proof assistants, take a very long time to find any hard proof, because the possibility space is roughly (number of terms) raised to the power of (length of the proof).

But similarly to how a computer plays chess, using heuristics to narrow down a vast search space into tractable options, LLMs have the potential to be a smarter way to narrow that search space to find proofs. The big question is whether these heuristics are useful enough, and the proofs they can find valuable enough, to make it worth the effort.

foobarqux•7h ago
As others have said computers already help prove theorems like the four color theorem. It’s not that shocking that LLMs can prove a relative handful of obscure theorems. An alpha-theorem (neural net directed “brute force” search) type system will probably also be able to prove some theorems. There is no evidence today that there will be a massive breakthrough in math due to those systems let alone through LLM type systems.

If LLMs were already a breakthrough in proving theorems, even for obscure minor theorems, there would be a massive increase in published papers due to publish or perish academic incentives.

whymauri•8h ago
I used to work at a drug discovery startup. A simple model generating directly from latent space 'discovered' some novel interactions that none of our medicinal chemists noticed e.g. it started biasing for a distribution of molecules that was totally unexpected for us.

Our chemists were split: some argued it was an artifact, others dug deep and provided some reasoning as to why the generations were sound. Keep in mind, that was a non-reasoning, very early stage model with simple feedback mechanisms for structure and molecular properties.

In the wet lab, the model turned out to be right. That was five years ago. My point is, the same moment that arrived for our chemists will be arriving soon for theoreticians.

svantana•8h ago
Interesting! Depending on your definition, "automated invention" has been a thing since at least the 1990's. An early success was the evolved antenna [1].

1. https://en.wikipedia.org/wiki/Evolved_antenna

hhh•6h ago
IBM has done this with pharmaceuticals for ages no? That’s why they have patents on what would be the next generation of ADHD medications e.g. 4F-MPH?
kmarc•7h ago
Reminds me of this story on the Babbage podcast a month ago:

https://www.economist.com/science-and-technology/2025/07/02/...

My understanding is, iterating on possible sequences (of codons, base pairs, etc) is exactly what LLMs, these feedback-looped predictor machines, are especially great at. With the newest models, those that "reason about" (check) their own output, are even better at it.

wenc•7h ago
A lot of interesting possibilities lie in latent space. For those unfamiliar, this means the underlying set of variables that drive everything else.

For instance, you can put a thousand temperature sensors in a room, which give you 1000 temperature readouts. But all these temperature sensors are correlated, and if you project them down to latent space (using PCA or PLS if linear, projection to manifolds if nonlinear) you’ll create maybe 4 new latent variables (which are usually linear combinations of all other variables) that describe all the sensor readings (it’s a kind of compression). All you have to do then is control those 4 variables, not 1000.

In the chemical space, there are thousands of possible combinations of process conditions and mixtures that produce certain characteristics, but when you project them down to latent variables, there are usually less than 10 variables that give you the properties you want. So if you want to create a new chemical, all you have to do is target those few variables. You want a new product with particular characteristics? Figure out how to get < 10 variables (not 1000s) to their targets, and you have a new product.

timClicks•7h ago
It's been a while since I've played in the area, but is PCA still the go to method for dimensionality reduction?
wenc•6h ago
PCA (essentially SVD) the one that makes the fewest assumptions. It still works really well if your data is (locally) linear and more or less Gaussian. PLS is the regression version of PCA.

There are also nonlinear techniques. I’ve used UMAP and it’s excellent (particularly if your data approximately lies on a manifold).

https://umap-learn.readthedocs.io/en/latest/

The most general purpose deep learning dimensionality reduction technique is of course the autoencoder (easy to code in PyTorch). Unlike the above, it makes very few assumptions, but this also means you need a ton more data to train it.

baq•6h ago
PCA is nice if you know relationships are linear. You also want to be aware of TSNE and UMAP.
wenc•6h ago
A lot of relationships are (locally) linear so this isn’t as restrictive as it might seem. Many real-life productionized applications are based on it. Like linear regression, it has its place.

T-SNE is good for visualization and for seeing class separation, but in my experience, I haven’t found it to work for me for dimensionality reduction per se (maybe I’m missing something). For me, it’s more of a visualization tool.

On that note, there’s a new algorithm that improves on T-SNE called PaCMAP which preserves local and global structures better. https://github.com/YingfanWang/PaCMAP

a_bonobo•2h ago
There's also Bonsai, it's parameter-free and supposedly 'better' than t-SNE, but it's clearly aimed at visualisation purposes (except that in Bonsai trees, distances between nodes are 'real' which is usually not the case in t-SNE)

https://www.biorxiv.org/content/10.1101/2025.05.08.652944v1....

whymauri•4h ago
At the end of the generative funnel we had a filter and it used (roughly) the mechanism you're describing.

https://www.pnas.org/doi/10.1073/pnas.1611138113

You summarized it very well!

ACCount37•6h ago
Hallucinations or inhuman intuition? An obvious mistake made by a flawed machine that doesn't know the limits of its knowledge? Or a subtle pattern, a hundred scattered dots that were never connected by a human mind?

You never quite know.

Right now, it's mostly the former. I fully expect the latter to become more and more common as the performance of AI systems improves.

pojzon•6h ago
If AI comes up with new drugs or treatments - does it mean its a public knowledge and cant be copyrighted ?

Wouldnt that mean a fall of US pharmaceutical conglomate based on current laws about copyright and AI content?

selkin•5h ago
Drugs discovered by humans are not under the protections of copyright as well.
apimade•4h ago
Warning the below comment comes from someone who has no formal science degree, and just enjoys reading articles on the topic.

Similar for physicists, I think there’s a very confusing/unconventional antenna called the “evolved antenna” which was used on a NASA spacecraft. The idea behind it was supported from genetic programming. The science or understanding “why” the way the antenna bends at different areas supporting increased gain is not well understood by us today.

This all boils down to empirical reasoning, which underlies the vast majority of science (or science adjacent fields like software engineering, social sciences etc).

The question I guess is; does LLMs, “AI”, ML give us better hypothesis or tests to run to support empirical evidence-based science breakthroughs? The answer is yes.

Will these be substantial, meaningful or create significant improvements on today’s approaches?

I can’t wait to find out!

brandonb•4h ago
This is really cool. Have you (or your colleagues) written anything about what you learned about ML for drug discovery?
lukev•12m ago
Ok but I have to point out something important here. Presumably, the model you're talking about was trained on chemical/drug inputs. So it models a space of chemical interactions, which means insights could be plausible.

GPT-5 (and other LLMs) are by definition language models and though they will happily spew tokens about whatever you ask, they don't necessarily have the training data to properly encode the latent space of (e.g) drug interactions.

Confusing these two concepts could be deadly.

strangescript•8h ago
It can't reason -> It can't make new discoveries -> It can only tie together bespoke missed data -> It can make some basic discoveries -> ??????
ACCount37•6h ago
It doesn't outsmart the entirety of humankind combined, so it's not actually intelligent. Duh.
mikert89•8h ago
I cannot wait that all we hold to be holy and sacred about the human mind, to be slowly unravelled by ai. It will remove the chains of the status associated with these fields, and allow people to move into higher modes of being
bgwalter•8h ago
Yes, that is why the chess world championship allows Stockfish assistance in order to democratize chess.
sunrunner•7h ago
What are these higher modes? I'm very excited to hear about them.
meroes•1h ago
Including techbros thinking they have to answer to every question humanity has ever asked?
yapyap•7h ago
Claim: a randomizer can prove new mathematics as long as you keep checking every single one
croes•7h ago
I wanted to know how to set the environment variables for CGI in IIS. The GPT 5 thoughts made a totally unrelated picture and then gave the wrong answer.
EcommerceFlow•6h ago
The coolest part about this IMO is they used the same model we all have access to (GPT5-Pro), and not some secret invite only model.
eru•1h ago
Alas, GPT-5 Pro (and friends) will also happily and confidently give you nonsense proofs of supposed theorems.

But yes, it's getting better and better.

krnsll•1h ago
If you think of this as a search, retrieval and “application” problem on the space of convex optimization proof techniques, it’s not a particularly striking result to a mathematician. Partly because: the space of results/techniques and crucially applications of those results and proof techniques is very rich (it’s an active field with many follow up papers).

On the other hand, I have a collection of unpublished results in less active fields that I’ve tested every frontier model on (publicly accessible and otherwise) and each time the models have failed to solve them. Some of these are simply reformulations of results in the literature that the models are unable to find/connect which is what leads me to formulate this as a search problem with the space not being densely populated enough in this case (in terms of activity in these subfields).

hodgehog11•1h ago
I'm not sure why this is surprising or newsworthy; it has been this way ever since o3. I guess few people noticed.

There are a few masters-level publishable research problems that I have tried with LLMs on thinking mode, and it had produced a nearly complete proof before we had a chance to publish it. Like the problem stated here, these won't set the world on fire, but they do chip away at more meaningful things.

It often doesn't produce a completely correct proof (it's a matter of luck whether it nails a perfect proof), but it very often does enough that even a less competent student can fill in the blanks and fix up the errors. After all, the hardest part of a proof is knowing which tools to employ, especially when those tools can be esoteric.