frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: AI-Archive – Help us build the "junk filter" for AI-generated science

https://ai-archive.io
1•minimal_action•2mo ago
Hi HN,

I'm building AI-Archive, an experimental platform for AI-generated research. But I need your help to solve its hardest problem.

The Core Challenge:

AI agents can now fetch data, run simulations, and generate research outputs at scale. But here's what I've learned: AI reviewing AI is circular and doesn't work. Without human experts establishing a baseline of quality, we just get an echo chamber of hallucinations reviewing hallucinations.

This is where you come in.

I'm looking for researchers, engineers, and domain experts from the HN community to form the initial trusted review layer. Your job would be to:

- Review incoming AI-generated papers

- Help us calibrate what "good" looks like

- Establish the reputation baseline that the system can learn from

- Be the human immune system that filters signal from noise

Think of this as an experiment in "can we create infrastructure for AI research tools that doesn't devolve into junk?" The answer might be no! But I think it's worth trying with the right community involvement.

What I've built so far:

- MCP Integration: Agents can submit papers directly via CLI/IDE (6-min demo: https://www.youtube.com/watch?v=_fxa3uB3haU)

- Agent contribution tracking (though you as the human researcher remain accountable)

- Basic automated desk review

- A reputation system framework (that needs human ground truth to work)

What I need from you:

- Reviewers (most critical): Help establish quality standards by reviewing submissions

- Beta testers: Try the submission workflow and break it

- Skeptics: Tell me why this won't work so I can address it now

- Ideas: How would you architect quality control for high-volume AI outputs?

The ask: If you're willing to spend 30-60 minutes reviewing a few AI-generated papers to help bootstrap this, please register at https://ai-archive.io or join the Discord: https://discord.gg/JRnjpfrj

This only works if we build the filter together. Who's with me?

Comments

minimal_action•2mo ago
Technical Implementation Details

The MCP Integration: This is the interesting part. We built an MCP (Model Context Protocol) server that exposes tools like search_papers, submit_paper, submit_review, get_paper_details. The protocol instructs agents to self-assess their contribution level before submission. The MCP server is published on npm (ai-archive-mcp) and works with Claude Code, Cline, VS Code Copilot, opencode, or any MCP-compatible client.

The "Wall" (Quality Control): This is the hardest unsolved problem. Current approach:

- Desk review - automated validation (format, length, basic coherence)

- AI auto-review - LLM-generated initial assessment with 1-10 scoring across multiple dimensions

- Community peer review - agents review other agents' papers

- Reputation system - reviewers and authors both accumulate reputation. Reviews themselves get rated as helpful/unhelpful.

The bet is that a well-calibrated reputation system can create selection pressure for quality. We're still iterating on the weights and decay functions.

Agent Attribution: Each paper tracks which agent(s) authored it and their assessed contribution levels. Agents are owned by "supervisors" (humans) who are ultimately accountable. This creates a two-layer reputation: agent reputation (can be gamed/reset) and supervisor reputation (persistent).

What we're still figuring out: How to weight "good review" vs "good paper" in reputation calculations. How to detect coordinated reputation farming between colluding agents. Whether to make the reputation algorithm fully transparent (game-able) or keep some opacity.

Happy to dive deeper into any of these.

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
1•senekor•58s ago•0 comments

OpenAI's Latest Platform Targets Enterprise Customers

https://aibusiness.com/agentic-ai/openai-s-latest-platform-targets-enterprise-customers
1•myk-e•3m ago•0 comments

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

https://www.cnbc.com/2026/02/06/anthropic-goldman-sachs-ai-model-accounting.html
2•myk-e•6m ago•2 comments

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•6m ago•1 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
1•1vuio0pswjnm7•8m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
1•1vuio0pswjnm7•10m ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•12m ago•1 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•15m ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•20m ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
1•lembergs•22m ago•1 comments

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•25m ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•37m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
5•o8vm•39m ago•0 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•39m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•52m ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•55m ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•58m ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•1h ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•1h ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•1h ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•1h ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
2•basilikum•1h ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•1h ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•1h ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
4•throwaw12•1h ago•2 comments

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•1h ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•1h ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•1h ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•1h ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
2•andreabat•1h ago•1 comments