frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Supernote e-ink devices for writing like paper

https://supernote.eu/choose-your-product/
1•janandonly•2m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•2m ago•0 comments

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

https://arxiv.org/abs/2602.01465
2•NBenkovich•2m ago•0 comments

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

https://www.latent.space/p/adversarial-reasoning
1•swyx•3m ago•0 comments

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•11m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
4•karakoram•11m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•11m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•11m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•13m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•14m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•14m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•16m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•19m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
1•SirLJ•20m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
2•randycupertino•21m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
3•breve•26m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•27m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
3•ks2048•27m ago•0 comments

Was going to share my work

1•hiddenarchitect•31m ago•0 comments

Pitchfork: A devilishly good process manager for developers

https://pitchfork.jdx.dev/
1•ahamez•31m ago•0 comments

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
3•mltvc•35m ago•1 comments

Why social apps need to become proactive, not reactive

https://www.heyflare.app/blog/from-reactive-to-proactive-how-ai-agents-will-reshape-social-apps
1•JoanMDuarte•36m ago•1 comments

How patient are AI scrapers, anyway? – Random Thoughts

https://lars.ingebrigtsen.no/2026/02/07/how-patient-are-ai-scrapers-anyway/
1•samtrack2019•36m ago•0 comments

Vouch: A contributor trust management system

https://github.com/mitchellh/vouch
3•SchwKatze•36m ago•0 comments

I built a terminal monitoring app and custom firmware for a clock with Claude

https://duggan.ie/posts/i-built-a-terminal-monitoring-app-and-custom-firmware-for-a-desktop-clock...
1•duggan•37m ago•0 comments

Tiny C Compiler

https://bellard.org/tcc/
5•guerrilla•39m ago•0 comments

Y Combinator Founder Organizes 'March for Billionaires'

https://mlq.ai/news/ai-startup-founder-organizes-march-for-billionaires-protest-against-californi...
4•hidden80•39m ago•4 comments

Ask HN: Need feedback on the idea I'm working on

1•Yogender78•40m ago•1 comments

OpenClaw Addresses Security Risks

https://thebiggish.com/news/openclaw-s-security-flaws-expose-enterprise-risk-22-of-deployments-un...
2•vedantnair•40m ago•0 comments

Apple finalizes Gemini / Siri deal

https://www.engadget.com/ai/apple-reportedly-plans-to-reveal-its-gemini-powered-siri-in-february-...
1•vedantnair•41m ago•0 comments
Open in hackernews

Anscombe's Quartet

https://en.wikipedia.org/wiki/Anscombe%27s_quartet
133•gidellav•5mo ago

Comments

djoldman•5mo ago
A classic.

See also:

https://en.wikipedia.org/wiki/Datasaurus_dozen

djoldman•5mo ago
The scary thing is that yea we can see these in 2D and maybe 3D. But ...

usually there are more than 2 or 3 columns in our data :(

imurray•5mo ago
It's clearly hard, but there are tools for doing exploratory visualization of high-dim data. GGobi http://ggobi.org/ and all the ones that arrange points but try to get local neighborhoods correct (t-sne, umap, et al.).
lamename•5mo ago
Yeah, but still "scary" because you have to be really careful to not fool yourself and pay attention even with those algorithms. For example, a good demonstration with tsne https://distill.pub/2016/misread-tsne/?hl=cs
sunrunner•5mo ago
Content warning: This is a baker’s dozen not a regular dozen, in case anyone clicks through expecting to find twelve and is mildly and briefly perturbed.
dejj•5mo ago
“The Datasaurus Dozen”:

https://blog.revolutionanalytics.com/2017/05/the-datasaurus-...

efavdb•5mo ago
The example shows that the usual stats aren't enough to pin down the true data. But in practice I imagine / wonder if these stats really are reasonable "sufficient stats" because the probability of seeing data with strong structure is unlikely in most contexts. In other words...

p(data | stats) = p(stats | data) * p(data) / p(stats).

and p(data) is only strong for a "blob / cloud" of points, so when there's some correlation the observed stats tell you that you likely have a blob having some degree of correlation.

aredox•5mo ago
>But in practice I imagine / wonder if these stats really are reasonable "sufficient stats" because the probability of seeing data with strong structure is unlikely in most contexts.

We just spent five years since COVID appeared to argue about statistics, with tons of bad analysis of very complicated data fuelling political rage up to this day.

The US health secretary is currently using data with "strong structure" to deny vaccines and to falsely pin down convenient targets for everything from cancer to autism.

throw0101d•5mo ago
Thought this would be about the 'other' Anscombe:

* https://en.wikipedia.org/wiki/G._E._M._Anscombe

:)

pablobaz•5mo ago
Or:

https://en.wikipedia.org/wiki/Gareth_Anscombe

:-)

flpm•5mo ago
And check this one, which is a generalization of the Datasaurus where you can define your own shapes :D

https://github.com/stefmolin/data-morph

moi2388•5mo ago
From now on I won’t trust any statistic unless I can transform it into a panda.
jihadjihad•5mo ago
Often there is little or no substitute for plotting the data to see how it is distributed. A scatter plot, histogram, density plot, etc. is almost always going to tell you a "story" about the data that the summary stats will have compressed.

But sometimes you are at the mercy of the data and your visualization of choice. Box plots, for example, are great at showing more than just how the data is centered, but it is possible to encounter situations where the box plots of the data remain static while the underlying data is clearly changing [0].

As always it is good to know about these things and continue to add to the arsenal (violin plots, in the example above) of tools and intuition needed to tease out the story behind the data.

0: https://www.research.autodesk.com/publications/same-stats-di...

ryukoposting•5mo ago
I do STEM mentoring for high school kids. Bookmarking this, because it'll be a great teaching aid at some point.
__mharrison__•5mo ago
I teach curve fitting with this dataset and recently added the fifth dataset. It illustrates Simpsons paradox.

https://www.linkedin.com/posts/panela_loved-adding-ancombes-...

aleyan•5mo ago
That's an amazing addition! Once I read about Simpson's paradox[0], couldn't help but seeing it or suspecting it everywhere. Luckily, it is not a true paradox, and it can resolved if underlying data is available and not just summary statistics.

I recommend putting together the Quintet in one image, so that the original 4 charts, plus the new one are all visible and interpretable together. It will be learning aid for decades to come.

[0] https://en.wikipedia.org/wiki/Simpson's_paradox

__mharrison__•5mo ago
Yes, not saying the data dinosaur isn't cool. But for real-world applications, the quartet with the addition of this fifth dataset is more useful for pedagogical purposes.
INGELRII•5mo ago
Always visualize first. Human 'eyballing' is a good pattern detector.

Linear correlation is just one pattern the data can have.

Unfortunately many social science publications have reviewers who know only the basics and can't judge or accept statistically valid analysis that is outside their competence. Fit it into line or nothing.

joshdavham•5mo ago
During my statistics degree, Anscombe’s Quartet was used as an example of why you should always try to visualize your dataset and not just run your calculations blindly. I’m a bit odd in that I don’t care much for data viz, but Anscombe’s Quartet really shows how important it is in practice.
WhitneyLand•5mo ago
This reminds that “visualize while thinking” will probably become an important part of reasoning as we move closer to AGI models.

This will require improvements to vision models, RL frameworks, etc, but will be interesting to see how much it can broaden current abilities.

jkyrlach•5mo ago
This dataset is definitely a treasure, and I love visualizing data. That said, i think what's missed when this is used as an argument for visual analysis is the idea of quantitatively identified outliers. If you take the descriptive statistics of p99, they most definitely will not be the same across these four sets. Visual analysis is a valuable dimension for data exploration, but it's a bit of a strawman to infer that "quantitative analysis could go no further, only visual analysis could figure this out"
divbzero•5mo ago
I know this is against the main point of Anscombe’s Quartet but just curious: Could skewness or other summary statistics differentiate the four distributions?
dccsillag•5mo ago
Take enough moments and you'll be able to differentiate any distributions.
padraigf•5mo ago
I love it. I was introduced to it by Edward Tufte's book, 'https://www.amazon.co.uk/Visual-Display-Quantitative-Informa...'.

And was just thinking about it the other day. I had a bug aggregating sleep-data from an iPhone, which comes in the form of sleep-samples.

I was trying to fix it, both by prodding Claude Code to fix the problem, and looking at debug logs of the sleep-samples, but we weren't getting anywhere. I asked Claude Code to graph the samples, and BAM, saw it right away. (the problem was that HealthKit returns you sleep-samples from ALL devices, not just the priority one)

Maybe not exactly the same thing as Anscombe/Tufte were getting at, but I was reminded of it, and the value of visualising data.

bluesmoon•5mo ago
I did a talk on Cognitive Biases in performance measurement and included Anscombe's Quartet (among other things) in the section on developer bias: https://speakerdeck.com/bluesmoon/we-love-speed-understandin...
Mithriil•5mo ago
Relevant: Simpson's paradox. https://en.wikipedia.org/wiki/Simpson%27s_paradox