frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

AlphaGo Moment for Model Architecture Discovery

https://arxiv.org/abs/2507.18074
23•Jimmc414•9h ago

Comments

Jimmc414•9h ago
This could be a very big paper if its claims are reproducible. Like approaching attention is all you need big.

They discovered 106 new state-of-the-art linear attention architectures through a fully autonomous AI research loop. The authors are making comparisons to AlphaGo’s move 37.

yorwba•8h ago
The part that is in principle amenable to replication is where they throw a lot of stuff at the wall and see what sticks. The part where they hype their own work, on the other hand... as a rule of thumb, if this really were a breakthrough on the level of AlphaGo, they wouldn't have to make that comparison themselves, someone else would be impressed enough to do it for them.
rafaelero•3h ago
Let's definitely wait for replication, but I am honestly not that surprised that it works. I am surprised it took so long for people to give it a real try. It's such an ideal scenario: every experiment is conducted inside the computer, so there is no need to gather data in the real world, which is the pain point for most experiments in science. The LLM is therefore free to try a lot of different combinations and learn in real time what works and what doesn't.
BoiledCabbage•3h ago
Interesting paper - it will fascinating to see if it pans out.

The one thing I didn't see that would be good is some validation that the architecture(s) that perform best on large models are the same architectures that perform best on small models.

Ie validation the assumption that you can use small models with sma amounts of training/compute to determine the best architecture for large models and high training budgets.

Even if it doesn't translate it would still be very cool to be able to qui kly evolve better small models (1M to 400M params), but I believe the implied goal (and what everyone wants) is that this exploration and discovery of novel architectures would be applicable for the really big models as well.

If you could only ai discover larger models by spending OpenAi/Anthropic/... budgets per exploration then we're not really gaining much in terms of novel ideas as the cost (time and budget) would be too prohibitive.

supermdguy•1h ago
Interesting work. Not super familiar with neural architecture search, but how do they ensure they’re not overfitting to the test set? Seems like they’re evaluating each model on the test set, and using that to direct future evolution. I get that human teams will often do the same, but wouldn’t the overfitting issues be magnified a lot by doing thousands of iterations of this?

CachyOS Kernels Based on Different Schedulers and Performance Improvements

https://github.com/CachyOS/linux-cachyos
1•theycallhermax•14s ago•0 comments

Built an NSFW AI image generator for AI art creators

https://nsfw-image-generator.com/
1•kevinleee•2m ago•1 comments

Personality Dimensions and Temperaments of Engineering Professors and Students

https://arxiv.org/abs/1507.06896
1•fzliu•9m ago•1 comments

Show HN: Launch Hacker News like community on your Domain

1•kocial•10m ago•0 comments

Show HN: Cant, rust nn lib for learning

https://github.com/TuckerBMorgan/can-t
1•TuckerBMorgan•13m ago•0 comments

Neovim plugin to prompt any model from Markdown files

https://github.com/robcmills/prompt.nvim
1•robcmills•14m ago•0 comments

Chemical Process Produces Critical Battery Metals with No Waste

https://spectrum.ieee.org/nmc-battery-aspiring-materials
2•stubish•20m ago•0 comments

Elon Musk opened a diner in Hollywood. What could go wrong?

https://www.theguardian.com/us-news/2025/jul/26/elon-musk-tesla-diner-hollywood
3•rob74•23m ago•0 comments

Doge is suggesting an AI tool that puts half of federal regs on a 'delete list'

https://www.engadget.com/big-tech/doge-is-reportedly-pushing-an-ai-tool-that-would-put-half-of-all-federal-regulations-on-a-delete-list-212053871.html
2•Incipient•25m ago•1 comments

Company developing Paducah laser uranium enrichment hits regulatory milestone

https://www.wkms.org/energy/2025-07-02/company-developing-paducah-laser-uranium-enrichment-facility-hits-key-regulatory-milestone
1•perihelions•27m ago•0 comments

Texas Is Getting Tough on Data Protection

https://www.adexchanger.com/data-privacy-roundup/texas-is-getting-tough-on-data-protection/
1•dotcoma•28m ago•0 comments

ChatGPT Gave Instructions for Murder, Self-Mutilation

https://www.theatlantic.com/technology/archive/2025/07/chatgpt-ai-self-mutilation-satanism/683649/
1•jrflowers•29m ago•0 comments

The future is not self-hosted, but self-sovereign

https://www.robertmao.com/blog/en/the-future-is-not-self-hosted-but-self-sovereign
2•robmao•29m ago•0 comments

Is Australia's bloated property market destroying the middle class?

https://www.theguardian.com/australia-news/2025/jul/13/great-job-good-education-no-home-is-australias-bloated-property-market-destroying-the-middle-class
3•PaulHoule•32m ago•0 comments

Show HN: I built a tool to fight YouTube clickbait with AI summaries

https://www.peekatube.com/en
1•project_stain•35m ago•0 comments

Show HN: Explore GitHub via What Stargazers Also Starred

https://github.com/fengkan/GitHub-Stargazer-Constellation
1•fengkan•40m ago•0 comments

Trump's AI Action Plan is a blueprint for dystopia

https://www.bloodinthemachine.com/p/trumps-ai-action-plan-is-a-blueprint
3•dotcoma•42m ago•0 comments

Are prompts the new unit of work?

https://www.archgw.com/blogs/are-prompts-the-new-unit-of-work
1•honorable_coder•45m ago•1 comments

How to expose Kubernetes OIDC JWKS endpoints

https://gawsoft.com/blog/kubernetes-oidc-expose-without-anonymous/
1•gawsoft•46m ago•1 comments

William Cowper's pet hares [1784]

https://cowperandnewtonmuseum.org.uk/the-history-of-my-three-hares/
2•quuxplusone•48m ago•0 comments

Post to HN

https://blog.cloudflare.com/zero-trust-warp-with-a-masque/
1•sawoo•1h ago•0 comments

$Lei – Aesthetic Computer

https://prompt.ac/$lei
1•justanothersys•1h ago•1 comments

Verify Identities During Self-Service Registration

https://fusionauth.io/blog/identity-verification-before-registration
1•mooreds•1h ago•0 comments

Fast and cheap bulk storage: using LVM to cache HDDs on SSDs

https://quantum5.ca/2025/05/11/fast-cheap-bulk-storage-using-lvm-to-cache-hdds-on-ssds/
15•todsacerdoti•1h ago•0 comments

Measuring Engineering

https://fffej.substack.com/p/measuring-engineering
1•mooreds•1h ago•0 comments

The Electron E1 Processor

https://www.efficient.computer/announcing-electron-e1-processor
4•bane•1h ago•1 comments

Smallest particulate matter sensor revolutionizes air quality measurement

https://www.bosch-sensortec.com/news/worlds-smallest-particulate-matter-sensor-bmv080.html
3•Liftyee•1h ago•0 comments

An Interview with Alex Ward

https://ciamweekly.substack.com/p/an-interview-with-alex-ward
1•mooreds•1h ago•0 comments

eSports for Engineers: course syllabus bridging gaming and STEM education [pdf]

https://github.com/sim-museum/esports-for-engineers/blob/master/files/syllabusFor_eSportsForEngineers.pdf
1•fifteenth•1h ago•0 comments

Voice AI for medical/premed students

https://www.codyliu.com/blog/rt-anki-voice-flashcards
1•codexliu•1h ago•0 comments