frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

What is happening to the Internet in Venezuela? Did U.S. use cyber capabilities?

https://securityaffairs.com/186509/intelligence/what-is-happening-to-the-internet-in-venezuela.html
1•pamcake•55s ago•0 comments

China's Russian Town Has Log Cabins and Cyrillic Signs, but No Russians

https://www.nytimes.com/2026/01/04/world/asia/china-russia-xi-putin.html
1•duxup•1m ago•1 comments

Zoom network data identifies foreign imposters (like Amazon keystroke lag)

https://beacon.security/resources/using-zoom-qos-data-to-find-location-anomalies
1•elkindj•4m ago•0 comments

Elon Musk– and Mark Zuckerberg–faced robotic dogs shown at Art Basel Miami

https://scienceclock.com/robot-dogs-elon-musk-mark-zuckerberg-faces-ai-art/
1•akg130522•4m ago•0 comments

Software Acceleration and Desynchronization

https://ferd.ca/software-acceleration-and-desynchronization.html
1•vrnvu•5m ago•0 comments

Show HN: Pyscn-bot – Automated Python code audits and reviews

https://pyscn.ludo-tech.org/
1•d-yoda•7m ago•1 comments

Show HN: TinySolvers – Personalized Math Word Problems for Kids

https://tinysolvers.com/
1•qedlab•8m ago•0 comments

Introduction to Obsidian

https://bryanhogan.com/blog/obsidian-introduction
1•bryanhogan•9m ago•0 comments

The Computer Language Benchmarks Game

https://benchmarksgame-team.pages.debian.net/benchmarksgame/index.html
1•smartmic•9m ago•0 comments

Ned Block on Whether Consciousness Requires Biology

https://www.preposterousuniverse.com/podcast/2026/01/05/339-ned-block-on-whether-consciousness-re...
1•consumer451•12m ago•0 comments

Evangeline Lilly reveals she has brain damage after hitting her head in fall

https://www.theguardian.com/film/2026/jan/05/evangeline-lilly-brain-damage-beach-fall
1•DustinEchoes•12m ago•0 comments

Did I just accidentally find the cheapest mobile app vibe coding tool?

https://mobilable.dev/
1•vadimen•13m ago•2 comments

Falcon-H1R: Hybrid Model for Efficient Test-Time Scaling

https://falcon-lm.github.io/blog/falcon-h1r-7b/
2•simonpure•14m ago•0 comments

Functors, Applicatives, and Monads: The Scary Words You Understand

https://cekrem.github.io/posts/functors-applicatives-monads-elm/
1•todsacerdoti•14m ago•0 comments

Ask HN: Would you fly on a Boeing 737 MAX?

2•g4zj•16m ago•1 comments

Sega's Dave Rosen Passes on Christmas Day in L.A.

https://www.replaymag.com/segas-dave-rosen-passes-on-christmas-day-in-l-a/
1•wicket•16m ago•0 comments

Which Entrepreneurs Boost Productivity?

https://libertystreeteconomics.newyorkfed.org/2026/01/which-entrepreneurs-boost-productivity/
1•Bostonian•18m ago•0 comments

Most LLM conversations are noise: a cheap way to decide what to remember

https://github.com/zachseven/two-room-memory
1•zachseven•19m ago•1 comments

Hospitals Are a Proving Ground for What AI Can Do, and What It Can't

https://www.wsj.com/tech/ai/hospitals-are-a-proving-ground-for-what-ai-can-do-and-what-it-cant-60...
3•Brajeshwar•19m ago•0 comments

Mysterious Voynich manuscript may be a cipher, a new study suggests

https://www.livescience.com/archaeology/mysterious-voynich-manuscript-may-be-a-cipher-a-new-study...
1•Brajeshwar•20m ago•0 comments

Reading Is a Vice

https://www.theatlantic.com/ideas/2026/01/reading-crisis-solution-literature-personal-passion/685...
2•voxleone•20m ago•0 comments

Wheat Prices Help Predict Baseball Averages

https://www.codesota.com/explainers/steins-paradox
3•Brosper•21m ago•1 comments

Denmark Tells Trump to 'Stop the Threats' About Greenland

https://www.nytimes.com/2026/01/05/world/europe/trump-greenland-denmark.html
4•saubeidl•22m ago•0 comments

Cat meows and what they mean

https://meowscope.vercel.app
2•yksanjo•23m ago•2 comments

How GitHub monopoly is destroying the open source ecosystem

https://ploum.net/2026-01-05-unteaching_github.html
5•toastal•25m ago•0 comments

Don't feel bad; even the inventor of the -vibe coding- term is overwhelmed

https://www.globalnerdy.com/2026/01/04/dont-feel-bad-even-the-inventor-of-the-term-vibe-coding-is...
1•cumo•25m ago•0 comments

The Bitter Lessons

https://www.hyperdimensional.co/p/the-bitter-lessons
1•mooreds•26m ago•0 comments

What Stress Feels Like After Leaving a Salaried Job

https://vinitvr.pages.dev/personal-blog/pb-post10/
2•vcool07•26m ago•1 comments

I came back from Cursor to VS Code

https://pablomarino.com/research_blog/2026/01/05/research4.html
1•pablonm•29m ago•0 comments

"We're Praying Nothing Goes Wrong"–What I Heard in Basel's Secret Meeting

https://www.youtube.com/watch?v=zVZ9gJbYQPI
1•shrubby•30m ago•3 comments
Open in hackernews

Ask HN: What are the main measures of AI progress?

3•ericlamb89•1d ago
I’m interested in how AI progress is currently evaluated and trying to build a list of the major approaches people actually use.

I’m aware that all of these measures have limitations and that many are controversial or imperfect. My goal is discovery and understanding, not to defend or attack any particular framework.

I’d love to hear:

- What measures, benchmarks, or methodologies you think belong on this list

- What you see as their key strengths and failure modes

- How (or whether) you personally use them to interpret AI progress

Comments

johnnyfived•1d ago
There'd first have to be an intense evaluation and standardization process for AI / measuring AGI now. All current benchmarks are tailored to one use case (e.g. SWE) or are evaluations that can be gamed and manipulated.

I think this would take the form of something more abstract instead of concrete with raw numbers, like a revised Turing Test.

kayo_20211030•1d ago
Yeah. I think the Turing Test has passed its sell-by date. As all things inevitably do. I'd be interested in how the "revised Turing Test" you propose looks. I'm not smart enough to know what that'd be, but it'd be interesting as a starting point.
johnnyfived•1d ago
It's a great question that I haven't seen discussed on HN yet (though I'm not that active), I think this crowd is still very attuned to interesting but more deterministic problems technically.

This might sound basic but I keep coming back to this idea again and again. Alex Garland really did have the right idea with Ex-Machina, where in the film Caleb claims that he purposefully designed Ava (the AI robot) to have all the internal mechanisms shown, so people would understand always they are interacting with a machine. The point of his Turing test was to show whether they could see past the machine and still empathize with it as a human.