frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open Source Stem Knowledge Base

https://github.com/Freelunch-AI/lunch-stem
1•BrunoScaglione•1m ago•1 comments

Accelerate video generation through sparse attention

https://svg-project.github.io/v2/
1•hxi-ucb•1m ago•0 comments

Tracing JITs in the Real World CPython Core Dev Sprint

https://antocuni.eu/2025/09/24/tracing-jits-in-the-real-world--cpython-core-dev-sprint/
1•todsacerdoti•3m ago•0 comments

Update on Ongoing Microsoft Review

https://blogs.microsoft.com/on-the-issues/2025/09/25/update-on-ongoing-microsoft-review/
1•hggh•4m ago•0 comments

Million-year-old skull rewrites human evolution, scientists claim

https://www.bbc.co.uk/news/articles/cdx01ve5151o
1•4ndrewl•5m ago•0 comments

AI Agents Get Spending Power While Humans Fail Authenticity Tests at 51.2%

https://syntheticauth.ai/posts/synthetic-auth-report-issue-012
1•zerolayers•6m ago•0 comments

Electron-based apps cause system-wide lag on macOS 26 Tahoe

https://github.com/electron/electron/issues/48311
4•STRML•7m ago•0 comments

Hare, the 100-Year Language

https://www.youtube.com/watch?v=42y2Q9io3Xs
1•fuzztester•7m ago•1 comments

Typst: A Possible LaTeX Replacement

https://lwn.net/Articles/1037577/
2•keks24•9m ago•0 comments

Redox OS Development Priorities for 2025/26

https://www.redox-os.org/news/development-priorities-2025-09/
1•akyuu•13m ago•0 comments

Dual-scale chemical ordering for cryogenic properties in CoNiV-based alloys

https://www.nature.com/articles/s41586-025-09458-1
1•PaulHoule•14m ago•0 comments

New tool makes generative AI models more likely to create breakthrough materials

https://news.mit.edu/2025/new-tool-makes-generative-ai-models-likely-create-breakthrough-material...
1•gmays•16m ago•0 comments

Data Centers Are Driving Up Your Electricity Costs

https://substack.perfectunion.us/p/how-data-centers-are-driving-up-your
1•doener•16m ago•0 comments

We're giving 40% off global eSIM data (200 countries, instant activation)

https://x.com/travelyesim
1•travelyesim•16m ago•1 comments

What Google Doesn't Want You to Know [video]

https://www.youtube.com/watch?v=GvaOUFwXjf4
2•doener•16m ago•0 comments

Walking Around the Compiler

https://bernsteinbear.com/blog/walking-around/
1•azhenley•17m ago•0 comments

Car Insurers Found a New Way to Rip You Off [video]

https://www.youtube.com/watch?v=X6UW4CFz71s
1•doener•17m ago•0 comments

Three in four European companies are hooked on US tech

https://www.theregister.com/2025/09/25/three_four_european_companies/
2•rntn•17m ago•0 comments

RVV benchmark Tenstorrent Ascalon X

https://camel-cdr.github.io/rvv-bench-results/tt_asc_x/index.html
2•fidotron•18m ago•0 comments

List of predictions for autonomous Tesla vehicles by Elon Musk

https://en.wikipedia.org/wiki/List_of_predictions_for_autonomous_Tesla_vehicles_by_Elon_Musk
2•jampekka•18m ago•0 comments

Spotify Announces New AI Safeguards, Says It's Removed 75M 'Spammy' Tracks

https://variety.com/2025/digital/news/spotify-new-ai-safeguards-1236528493/
3•c420•18m ago•0 comments

Gridap.jl – Grid-based approximation of partial differential equations in Julia

https://github.com/gridap/Gridap.jl
1•TheWiggles•18m ago•0 comments

3 Prototypes: A Design and Engineering Exploration to Control Doom Scrolling

https://medium.com/@SoCohesive/just-features-series-1-can-we-engineer-healthier-scrolling-c9f830c...
1•socohesive•19m ago•1 comments

The Hysteresis of Vibe Coding

https://the-nerve-blog.ghost.io/the-hysteresis-of-vibe-coding/
1•mprast•20m ago•0 comments

Samples note: Use comments to describe what code does, not what you wish the

https://devblogs.microsoft.com/oldnewthing/20250925-00/?p=111627
1•OptionOfT•21m ago•0 comments

DOGE might be storing every American's SSN on an insecure cloud server

https://www.theverge.com/news/785706/doge-insecure-cloud-server-social-security-numbers
5•text0404•21m ago•0 comments

I have never subscribed to receive any marketing emails

https://codelearn.me/2025/08/04/marketing-emails.html
1•TheFreim•22m ago•0 comments

Higher Gemini CLI and Gemini Code Assist Limits

https://blog.google/technology/developers/gemini-cli-code-assist-higher-limits/
1•tosh•25m ago•0 comments

Statically Generated Cellular Automata

https://ternary-totalistic-ca-hub.netlify.app/
1•marcentusch•25m ago•0 comments

Workings of Science – Debunked Software Theories

https://dl.acm.org/doi/pdf/10.1145/3512338
3•waldarbeiter•25m ago•1 comments
Open in hackernews

GDPVal: Measuring the performance of our models on real-world tasks

https://openai.com/index/gdpval/
9•BGyss•1h ago

Comments

westurner•1h ago
"GDPVal: Measuring AI model performance on real world economically viable tasks" (2025) https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf1...

GDP? GlobalGoals ... The Sustainable Development Goals (SDGs) include 17 goals, 169 targets, and over 230 indicators.

For strategic alignment,

Strategic alignment: https://en.wikipedia.org/wiki/Strategic_alignment

Sustainable Development Goals: https://en.wikipedia.org/wiki/Sustainable_Development_Goals

To produce the SDGs, IIUC they clustered the world's problems as an international collaborative exercise; to succeed the MDGs (2000-2015).

Each country voluntarily produces an annual SDG report on their progress on their Targets according to the Indicators.

IMHO, Priorities should include clean energy and AI efficiency, given the growth projections for energy use of AI (and our electrical bills given continued expected supply shortages of energy)

Which real-word SDG tasks can be AI eval'd?

Snuggly73•54m ago
Apparently producing a react component that returns a piece of html with aria tags set up. Long horizon my ass.
westurner•28m ago
Did the LLM in that case suggest adopting an open-source UI library that already has tests for and implements support for W3C ARIA accessibility features, like React-Aria or other alternatives?

Or did it just do the job as prompted and not mention suggestions for continuous improvement like reusing tested open source components?

Snuggly73•18m ago
Not sure how it went in their tests - I've tried Opus and GPT5 and it was few lines of react + tests, so I guess 'no'
nextworddev•55m ago
Couldn’t find their open source evals dataset
Snuggly73•53m ago
https://huggingface.co/datasets/openai/gdpval/viewer/default...
nextworddev•35m ago
thanks!
esafak•42m ago
They reported the competitors' performance for a change. Especially curious because OpenAI is not first. Kudos?