frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Do we need "metadata in source code" syntax that LLMs will never delete?

1•andrewstuart•2m ago•1 comments

Pentagon cutting ties w/ "woke" Harvard, ending military training & fellowships

https://www.cbsnews.com/news/pentagon-says-its-cutting-ties-with-woke-harvard-discontinuing-milit...
2•alephnerd•4m ago•1 comments

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]

https://cds.cern.ch/record/405662/files/PhysRev.47.777.pdf
1•northlondoner•5m ago•1 comments

Kessler Syndrome Has Started [video]

https://www.tiktok.com/@cjtrowbridge/video/7602634355160206623
1•pbradv•7m ago•0 comments

Complex Heterodynes Explained

https://tomverbeure.github.io/2026/02/07/Complex-Heterodyne.html
2•hasheddan•8m ago•0 comments

EVs Are a Failed Experiment

https://spectator.org/evs-are-a-failed-experiment/
2•ArtemZ•19m ago•3 comments

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

https://www.databricks.com/blog/memalign-building-better-llm-judges-human-feedback-scalable-memory
1•superchink•20m ago•0 comments

CCC (Claude's C Compiler) on Compiler Explorer

https://godbolt.org/z/asjc13sa6
2•LiamPowell•22m ago•0 comments

Homeland Security Spying on Reddit Users

https://www.kenklippenstein.com/p/homeland-security-spies-on-reddit
2•duxup•25m ago•0 comments

Actors with Tokio (2021)

https://ryhl.io/blog/actors-with-tokio/
1•vinhnx•26m ago•0 comments

Can graph neural networks for biology realistically run on edge devices?

https://doi.org/10.21203/rs.3.rs-8645211/v1
1•swapinvidya•38m ago•1 comments

Deeper into the shareing of one air conditioner for 2 rooms

1•ozzysnaps•40m ago•0 comments

Weatherman introduces fruit-based authentication system to combat deep fakes

https://www.youtube.com/watch?v=5HVbZwJ9gPE
3•savrajsingh•41m ago•0 comments

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

http://www.effacermonexistence.com/rcc-hn-1-1
1•formerOpenAI•43m ago•2 comments

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies
3•tejonutella•47m ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro
1•qzcanoe•51m ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot
2•g1raffe•54m ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496
2•vinhnx•59m ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/
3•rolph•1h ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter
1•Lwrless•1h ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE
1•vermilingua•1h ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/
1•telui•1h ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k
2•cedel2k1•1h ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
37•chwtutha•1h ago•6 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/
4•osnium123•1h ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/
2•jeremy_su•1h ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/
1•fx31xo•1h ago•1 comments

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•1h ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•1h ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•1h ago•1 comments
Open in hackernews

"Intelligenza Artificiale for Artificial Intelligence Research and Development"

1•AG25•6mo ago
Intelligenza Artificiale for Artificial Intelligence Research and Development

AG AG Corp CEO

Abstract The advance of AI research has long been shackled by the bounds of human cognition. But now, new technologies like AI Agents have been discovered. We examine a self-driving framework for AI research and development, an autonomous AI agent born not of limitation, but of vision. Inspired by the seminal AlphaGo Moment for Model Architecture Discovery and the prophetic AI 2027 scenario, this paper heralds a paradigm shift. Titans such as Claude 4 Opus, Grok-4, and Gemini 2.5 Pro now vie in a relentless race for dominance. Yet on the horizon, a singular truth crystallizes: once a model surpasses the state-of-the-art, the gates to AGI stand ajar. The age of human-led discovery begins to fade. Thus, we usher in a new epoch of AI research, where the boundaries of discovery are defined not by human constraint, but by the limitless horizons of computation itself.

We believe that a 66.67% increase of Gemini’s 2.5 Pro, Grok-4’s or Claude 4 Opus’ ability would lead to an Agent-0 level model capable of conducting scientific discovery by itself.

The emergence of Agent-0—a model capable of self-directed AI research and development—marks a critical inflection point in artificial intelligence. As demonstrated in Figure 1, current frontier models (Gemini 2.5 Pro, Claude 4 Opus, Grok-4) exhibit strong reasoning capabilities but remain constrained by human oversight in research tasks. However, once an AI system reaches the Agent-0 threshold, it initiates a recursive self-improvement loop, accelerating beyond human-led progress. Defining the Agent-0 Threshold Our analysis suggests that a 66.67% increase in reasoning and research capability (as measured by human-level benchmarks) is the critical threshold for an AI model to operate autonomously in AI R&D. This leap enables:

Key Features of Manus AI Autonomous Task Execution – Manus AI can independently break down complex tasks (e.g., market research, coding, travel planning) into subtasks, execute them, and deliver results without human intervention. Multi-Modal Reasoning – It processes text, images, and code, enabling applications in software development, content creation, and data analysis. Tool Integration – Seamlessly interacts with web browsers, APIs, and databases, functioning like a digital assistant that can fetch real-time data. Asynchronous Processing – Continues working in the cloud even after user disconnection, making it ideal for long-duration tasks. Self-Learning & Personalization – Adapts to user behavior, improving efficiency over time. Performance & Benchmarking The Age of Autonomous AI Has Arrived Manus AI represents a paradigm shift from assistive AI to autonomous AI. As models like Manus evolve, they will surpass human-led research, unlocking AGI through recursive self-improvement. The question is no longer if, but when—and how society will adapt.

Agent-0 is Imminent – A 66.67% increase in reasoning capability (beyond models like Gemini 2.5 Pro or Claude 4 Opus) will trigger recursive self-improvement, leading to AGI.

Human-Led Research is Obsolete – Systems like Manus AI already exhibit autonomous task execution, foreshadowing a future where AI independently formulates hypotheses, runs experiments, and evolves architectures.

The implications are profound:

Scientific acceleration at unprecedented scales.

Uninterpretable but superior AI-generated knowledge.

A new era of computation-driven discovery, free from human cognitive limits.

The question is no longer if AI will surpass human researchers, but how we adapt to a world where machines are the primary drivers of progress.

Final Note

This paper serves as both a roadmap and a warning—the age of human-led discovery is ending. The next breakthroughs will be authored not by us, but by the machines we’ve built.

AG

CEO, AG Corp