frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Hosting a website on a disposable vape

https://bogdanthegeek.github.io/blog/projects/vapeserver/
44•BogdanTheGeek•25m ago•257 comments

Asciinema CLI 3.0 rewritten in Rust, adds live streaming, upgrades file format

https://blog.asciinema.org/post/three-point-o/
134•ku1ik•2h ago•23 comments

Wanted to spy on my dog, ended up spying on TP-Link

https://kennedn.com/blog/posts/tapo/
84•kennedn•1h ago•15 comments

React Won by Default – and It's Killing Front End Innovation

https://www.lorenstew.art/blog/react-won-by-default/
27•dbushell•33m ago•22 comments

PayPal to support Ethereum and Bitcoin

https://newsroom.paypal-corp.com/2025-09-15-PayPal-Ushers-in-a-New-Era-of-Peer-to-Peer-Payments,-...
173•DocFeind•4h ago•121 comments

Launch HN: Trigger.dev (YC W23) – Open-source platform to build reliable AI apps

73•eallam•2h ago•30 comments

CubeSats are fascinating learning tools for space

https://www.jeffgeerling.com/blog/2025/cubesats-are-fascinating-learning-tools-space
111•warrenm•4h ago•43 comments

Boring Work Needs Tension

https://iaziz786.com/blog/boring-work-needs-tension/
36•iaziz786•2h ago•22 comments

How big a solar battery do I need to store all my home's electricity?

https://shkspr.mobi/blog/2025/09/how-big-a-solar-battery-do-i-need-to-store-all-my-homes-electric...
128•FromTheArchives•5h ago•199 comments

Programming Deflation

https://tidyfirst.substack.com/p/programming-deflation
78•dvcoolarun•4h ago•44 comments

Self-Assembly Gets Automated in Reverse of 'Game of Life'

https://www.quantamagazine.org/self-assembly-gets-automated-in-reverse-of-game-of-life-20250910/
16•kjhughes•3d ago•0 comments

How to self-host a web font from Google Fonts

https://blog.velocifyer.com/Posts/3,0,0,2025-8-13,+how+to+self+host+a+font+from+google+fonts.html
58•Velocifyer•3h ago•65 comments

RustGPT: A pure-Rust transformer LLM built from scratch

https://github.com/tekaratzas/RustGPT
295•amazonhut•8h ago•140 comments

Researchers revive the pinhole camera for next-gen infrared imaging

https://phys.org/news/2025-09-revive-pinhole-camera-gen-infrared.html
7•wglb•3d ago•1 comments

Microsoft to force install the Microsoft 365 Copilot app in October

https://www.bleepingcomputer.com/news/microsoft/microsoft-to-force-install-the-microsoft-365-copi...
78•mikece•1h ago•46 comments

Removing newlines in FASTA file increases ZSTD compression ratio by 10x

https://log.bede.im/2025/09/12/zstandard-long-range-genomes.html
191•bede•3d ago•76 comments

Show HN: Daffodil – Open-Source Ecommerce Framework to connect to any platform

https://github.com/graycoreio/daffodil
35•damienwebdev•3h ago•3 comments

Show HN: AI-powered web service combining FastAPI, Pydantic-AI, and MCP servers

https://github.com/Aherontas/Pycon_Greece_2025_Presentation_Agents
5•Aherontas•21h ago•1 comments

A string formatting library in 65 lines of C++

https://riki.house/fmt
22•PaulHoule•2h ago•9 comments

Folks, we have the best π

https://lcamtuf.substack.com/p/folks-we-have-the-best
267•fratellobigio•11h ago•72 comments

Apple has a private CSS property to add Liquid Glass effects to web content

https://alastair.is/apple-has-a-private-css-property-to-add-liquid-glass-effects-to-web-content/
210•_alastair•3h ago•113 comments

The Mac App Flea Market

https://blog.jim-nielsen.com/2025/mac-app-flea-market/
215•ingve•11h ago•103 comments

Language models pack billions of concepts into 12k dimensions

https://nickyoder.com/johnson-lindenstrauss/
318•lawrenceyan•14h ago•109 comments

Show HN: Semlib – Semantic Data Processing

https://github.com/anishathalye/semlib
41•anishathalye•4h ago•10 comments

Creating a VGA Signal in Hubris

https://lasernoises.com/blog/hubris-vga/
21•lasernoises•3h ago•4 comments

Show HN: MCP Server Installation Instructions Generator

https://hyprmcp.com/mcp-install-instructions-generator/
4•pmig•3h ago•0 comments

Death to type classes

https://jappie.me/death-to-type-classes.html
89•zeepthee•3d ago•52 comments

Show HN: I reverse engineered macOS to allow custom Lock Screen wallpapers

https://cindori.com/backdrop
59•cindori•9h ago•37 comments

Pgstream: Postgres streaming logical replication with DDL changes

https://github.com/xataio/pgstream
56•fenn•5h ago•4 comments

A qualitative analysis of pig-butchering scams

https://arxiv.org/abs/2503.20821
165•stmw•14h ago•91 comments
Open in hackernews

Dumb statistical models, always making people look bad

https://statmodeling.stat.columbia.edu/2025/04/18/dumb-statistical-models-always-making-people-look-bad/
118•hackandthink•4mo ago

Comments

delichon•4mo ago
> why it’s often hard to demonstrate the value of human knowledge once you have a decent statistical model.

This seems to be a near restatement of the bitter lesson. It's not just that large enough statistical models outperform algorithms built from human expertise, they also outperform human expertise directly.

gopalv•4mo ago
> they also outperform human expertise directly

When measured statistically.

This is not a takedown of that statement, but the reason we've trouble with this idea is that it works in the lab and not always in real life.

To set up a clean experiment, you have define what success looks like before you conduct the experiment - that the output variable is defined.

Once you know what to measure ahead of time to determine success, then statistical models tend to not be as random as a group of humans in achieving that target.

The variance is bad in an experiment, but variance jitter is needed in an ever changing world even if most variants are worse off.

For example, if you can predict someone's earning potential from their birth zipcode, it is not wrong and often more right than otherwise.

And then if you base student loans and business loan interest rates on the basis of birth zipcodes, the original prediction does become more right.

The experimental version that's a win, but in real life that's a terrible loss to society.

bobsomers•4mo ago
> > they also outperform human expertise directly

> When measured statistically.

THANK YOU. It's mildly infuriating how often people forget that one of the things most human experts are good at is knowing when they are looking at something that is likely in distribution vs. out of distribution (and thus, updating their priors).

jonahx•4mo ago
The original article discusses this explicitly.
AstralStorm•4mo ago
Ah yes, the self fulfilling prophecies or hallucinations based on models trained on models. Overfitting. Ending up in an evolutionary dead end...

Type 4 error of not asking a question one should also exists.

So thing is, suppose you're handling the common cases right - you have software that's say 95% correct. The important bit is how critical the remaining 5% failures are. If one of them happens to be "I give up my computer and data to the exploit" or "everything is destroyed" or "a lot of people die", then the extra 1% better average is no good to any inside observer.

It so happens that a lot of people believe themselves to be outside observers, especially rich.

(What's the success bonus for someone getting treated nicely?)

nitwit005•4mo ago
You don't even need a statistical model. We make checklists because we know we'll fail to remember to check things.

Humans are tool users. If you make a statistical table to consult for some medical issue, you've using a tool.

taeric•4mo ago
I was going to say that it doesn't have to be a statistical model. Notable that statistical models are already seen as less than complete analytical models, for many people. (I think that is almost certainly a poor way of wording it? Largely just trying to say that F=ma and such are also models that don't have conditional answers.)

At any rate, I'm curious on some of the readings this post brings up. I'm also vaguely remembering that human's can have some odd behaviors where requiring justification or reasoning of decisions can sometimes provide more predictable decisions; but at a cost that you may not fully explore viable decisions.

dominicq•4mo ago
As a matter of practicality, it seems that you professionally now want to be firmly in the tails of the data distribution for your field, e.g. expert in those things that happen rarely.

Or maybe even be in a domain which, for whatever reason, is poorly represented by a statistical model, something where data points are hard to get.

genewitch•4mo ago
> expert in those things that happen rarely

Replacement bolt: 15¢ Knowing which bolt had to be replaced: $9,999.85

rawgabbit•4mo ago
OTOH. The blog mentions that humans excel at novel situations. Such as when there is little training data, when envisioning alternate outcomes, or when recognizing the data is wrong.

The most recent example I can think of is "Frank". In 2021, JPMorgan Chase acquired Frank, a startup founded by Charlie Javice, for $175 million. Frank claimed to simplify the FAFSA process for students. Javice asserted the platform had over 4 million users, but in reality, it had fewer than 300,000. To support her claim, she allegedly hired a data science professor to generate synthetic data, creating fake user profiles. JPMorgan later discovered the discrepancy when a marketing campaign revealed a high rate of undeliverable emails. In March 2025, Javice was convicted of defrauding JPMorgan.

IMO an data expert could have recognized the fake user profiles through the fact he has seen e.g., how messy real data is, know the demographics of would be users of a service like Frank (wealthy, time stressed families), know tell tale signs of fake data (clusters of data that follow obvious "first principles").

willvarfar•4mo ago
> an data expert could have recognized the fake user profiles through the fact he has seen e.g., how messy real data is, know the demographics of would be users of a service like Frank (wealthy, time stressed families), know tell tale signs of fake data

perhaps the data science professor who generated the fake data was quite well versed in all this and put effort into deliberately adding messiness and skew etc?

3abiton•4mo ago
It's unfortunate how under appreciated is statistics, in nearly all (spare academic) positions that I occupied, mostly in the technical domain interacting with non-technical stakeholders, anectodal evidence always take priority compared to statistical backed data, for decision making. It's absurd sometimes.
bsder•4mo ago
This is because the correct answer is rarely the politically palatable answer.
TheAceOfHearts•4mo ago
Anecdotally, the way I've heard many stats related tools described is as follows: if the tool confirms something that we already knew then it's a waste of time or money because it doesn't tell us anything new, and if it doesn't agree with what we already know then it's obviously wrong.

I don't think it's a trivial problem though. It's notoriously easy to twist stats to sell any narrative. And Goodhart's Law all but guarantees that any meaningful metric will get hacked.

gwern•4mo ago
> There are a few ways to look at this from the standpoint of information that is available to the decision-maker. One is that human knowledge is valuable for guiding developing the model, but once you have a statistical model, it’s a better aggregator of the information. This is echoed by research on judgmental bootstrapping (https://gwern.net/doc/statistics/decision/1974-dawes.pdf), where a statistical model trained on a human expert’s past judgments will tend to outperform that expert.

By the way, note that this applies to LLMs too. One of the biggest pons asinorums that people get hung up on is the idea that "it just imitates the data, therefore, it can never be better than the average datapoint (or at least, best datapoint); how could it possibly be better?"

Well, we know from a long history that this is not that hard: humans make random errors all the time, and even a linear model with a few parameters or a little flowchart can outperform them. So it shouldn't be surprising or a mystery if some much more complicated AI system could too.

AIPedant•4mo ago
> One of the biggest pons asinorums that people get hung up on is the idea that "it just imitates the data, therefore, it can never be better than the average datapoint (or at least, best datapoint); how could it possibly be better?"

Hmm - the phrasing that perhaps holds more water is that LLMs just imitate the data, which means that novel ideas / code tends to be smashed against the force of averaging when fed into an LLM. E.g. NotebookLM summaries/podcasts are good infotainment but they tend to flatten unconventional paragraphs into platitudes or common wisdom. Obviously this is very subjective and hard to benchmark.

airstrike•4mo ago
> Obviously this is very subjective and hard to benchmark.

I agree, but it also feels very obvious once you've been exposed to it enough times. The internet is filled of written or spoken AI slop that can generally be spotted with ease by trained eyes and ears.

jon_richards•4mo ago
The problem making a bear-proof trash can is that there's significant overlap between the smartest bears and the dumbest tourists.
roenxi•4mo ago
> and even a linear model with a few parameters

Using a simple average of past performance to predict future performance is also a technique that is often disturbingly effective vs. standard practice. I suppose technically that is a linear model, but really deserves its own class.

AstralStorm•4mo ago
Up to a point where the prediction runs afoul of the time horizon and changing unmodelled circumstances.

They do not have sufficient explicit risk or variance management. Makes them highly fragile. There are more robust versions of the estimators... Still have a problem.

Remember 2008? That market ran on these easy models.

gwern•4mo ago
Yes, exponential smoothing in forecasting is another classic example of the robustness of simple models. You can throw all your fancy ARIMAs and Box-Cox transforms at a time-series, and much of the time, it is hard to distinguish from a simple moving average.

Specifically, the Makridakis M forecasting competitions (https://en.wikipedia.org/wiki/Makridakis_Competitions) have shown for a long time that beating the baselines is shockingly difficult.

In fact, classic machine learning only really started to convincingly win with the second-to-last, M5: https://www.sciencedirect.com/science/article/pii/S016920702... ; and neural methods only just sort of began working with the latest one, M6: https://www.sciencedirect.com/science/article/pii/S016920702... . (Possibly with M7 we'll see scaled-up meta-learning Transformers finally start beating the Bayesian or decision-tree forecasters. But I don't know if or when a M7 might be held.)

senkora•4mo ago
> pons asinorums

This is a new one for me, so, in the spirit of the article, I will "act in the world to acquire more information as needed".

> An obstacle which will defeat a beginner or foolish person. [from 17th c.]

> From New Latin pons asinorum, from Latin pōns (“bridge”) + genitive plural of asinus (“donkey”). Literally, “bridge of donkeys”.

https://en.wiktionary.org/wiki/pons_asinorum

mwkaufma•4mo ago
User "Anoneuoid" from the source's own comment thread:

  There is another aspect here where those averaged outcomes are also the output of statistical models. So it is kind of like asking whether statistical models are better at agreeing with other statistical models than humans.
AstralStorm•4mo ago
You need to compare on both different variables and additionally produce actual error estimates on the comparison.

Say, suppose you're measuring successful treatments. You would have to both use the count, perhaps signed even (subtracting abject failures such as deaths), cost (financial or number of visits), then verify these numbers with a follow up.

See, the definition of success is critical here. OR and NNT are not evaluating side effects negatively, for example.

So it may turn out that you're comparing completely different ideas of better instead of matching models.

whatever1•4mo ago
At least when humans are wrong we own it. Statistical models can be wrong 100% of the times you used them and the claim is ‘oh this is how statistics work, you did not query the model infinite times’.

My point is that in many occasions being right on average is less important than being right on the tail.

vintermann•4mo ago
> Minimizing loss over aggregates is what a statistical model is designed to do, so if you evaluate human judgment against statistical predictions in aggregate on data similar to what the model was trained on, then you should expect statistical prediction to win

This reminds me of the many years machine translation was evaluated on BLEU towards reference translations, because they didn't know any better ways. Turns out that if you measure translation quality by n-gram precision towards a reference translation, then methods based on n-gram precision (such as the old pre-NMT Google translate) were really hard to beat.

reedf1•4mo ago
If there is not a human-explainable reason a model has made a prediction - and it's just a statistical blob in multi-dimensional feature space (which we cannot introspect) perceived improvement over humans is simply overfitting. It will be extremely good at finding the median issue, or following a decision tree in a more exacting way than a human. What a human can do is expand the degrees of freedom of their internal model at-will, integrate out of sample data, and have a natural human-bias to the individual at the expense of the median. I'd rather have that...
bicepjai•4mo ago
Someone had to say this. All models are dump, but some are useful.
kreyenborgi•4mo ago
Versus https://predictive-optimization.cs.princeton.edu/