frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Gemini 3 Pro Model Card [pdf]

https://storage.googleapis.com/deepmind-media/Model-Cards/Gemini-3-Pro-Model-Card.pdf
37•virgildotcodes•1h ago

Comments

rvz•1h ago
> The training dataset also includes: publicly available datasets that are readily downloadable; data obtained by crawlers; licensed data obtained via commercial licensing agreements; user data (i.e., data collected from users of Google products and services to train AI models, along with user interactions with the model) in accordance with Google’s relevant terms of service, privacy policy, service-specific policies, and pursuant to user controls, where appropriate; other datasets that Google acquires or generates in the course of its business operations, or directly from its workforce; and AI-generated synthetic data.

Well don't complain when you are using Gmail and your emails are being trained to develop Gemini.

patates•4m ago
It says "pursuant to user controls, where appropriate". We can now sleep peacefully with the knowledge that Google will give us the tools to disable this where it's not inappropriate.
surrTurr•59m ago
gone now;

wayback machine still has it: https://web.archive.org/web/20251118111103/https://storage.g...

lifthrasiir•56m ago
For the veracity of the link itself: https://storage.googleapis.com/deepmind-media/* has been used by DeepMind itself (e.g. "View tech report" in https://deepmind.google/models/gemini/) so it is a genuine leak.
meetpateltech•56m ago
it was accidentally pushed a little early, and now it has been taken down.

here’s the archived pdf: https://web.archive.org/web/20251118111103/https://storage.g...

TheAceOfHearts•37m ago
They scored a 31.1% on ARC AGI 2 which puts them in first place.

Also notable which models they include for comparison: Gemini 2.5 Pro, Claude Sonnet 4.5, and GPT-5.1. That seems like a minor snub against Grok 4 / Grok 4.1.

kranke155•19m ago
Grok seems extremely prone to hallucination in my experience. It also constantly asserts certainty on fuzzy topics.
buildfocus•7m ago
My impression is that Grok is very rarely used in practice outside of a niche of die-hard users, partly because of very different tuning to other models, and partly the related public reputation around it.

https://firstpagesage.com/reports/top-generative-ai-chatbots... suggests 0.6% of chat use cases, well below the other big names, and I suspect those stats for chat are higher than other scenarios like business usage. Given all that, I can see how Gemini might not be focused on competing with them.

jmmcd•7m ago
About ARC 2:

I would want to hear more detail about prompts, frameworks, thinking time, etc., but they don't matter too much. The main caveat would be that this is probably on the public test set, so could be in pretraining, and there could even be some ARC-focussed post-training - I think we don't know yet and might never know.

But for any reasonable setup, if no egregious cheating, that is an amazing score on ARC 2.

surrTurr•22m ago
good benchmark stats except for coding where it looks similar to other SOTA models
aurareturn•21m ago
Benchmark suggests it is a resounding win for Gemini 3 Pro as the top model.
patates•9m ago
It says it's been trained from scratch. I wonder if it will have the same undescribable magic that makes me spend an hour every day with 2.5. I really love the results I can get with 2.5 pro. Google eventually limiting aistudio will be a sad day.

Also I really hoped for a 2M+ context. I'm living on the context edge even with 1M.

Avi Wigderson – P vs. NP [video]

https://www.youtube.com/watch?v=HX9i9PL8os0
1•nill0•46s ago•0 comments

Do We Need to Slash the Debt?

https://manhattan.institute/article/do-we-really-need-to-slash-the-debt
1•nis0s•1m ago•1 comments

The nature of the Theory of Computation (2018) [pdf]

https://www.math.ias.edu/~avi/PUBLICATIONS/Wigderson2018.pdf
1•nill0•1m ago•0 comments

Ask HN: Does insurance cover payouts during a global outage?

1•cryptography•2m ago•0 comments

A DST primer for unit test maxxers

https://www.amplifypartners.com/blog-posts/a-dst-primer-for-unit-test-maxxers
1•todsacerdoti•2m ago•0 comments

Show HN: Tool to check risks of your startup

https://www.siqnalis.com/risk-check
1•hulk-konen•3m ago•0 comments

Technoblogy – USB-C Power Delivery Dongle

http://www.technoblogy.com/show?5729
1•rcarmo•5m ago•0 comments

Open Source in Focus: Projects We're Proud to Support

https://blog.jetbrains.com/blog/2025/11/18/open-source-in-focus-projects-we-re-proud-to-support/
1•quapster•8m ago•0 comments

Show HN: An open source static site that parses and visualizes log files locally

https://notesofcliff.github.io/logsieve/
1•ilovetux•8m ago•0 comments

MaggieLab – Simple, non-destructive, online image editor

https://www.maggielab.com
1•nunodonato•12m ago•1 comments

Open Source Power

https://blog.muni.town/open-source-power/
1•cippaciong•14m ago•0 comments

Show HN: SecuriScan – Open-source Chrome extension for passive security analysis

https://chromewebstore.google.com/detail/securiscan-web-security-a/icjlbldpcojppnjpkpkkfbhnfafnhpfl
1•ashish_sharda•15m ago•0 comments

Art Forms in Nature

https://en.wikipedia.org/wiki/Kunstformen_der_Natur
1•cl3misch•16m ago•0 comments

Made this website because the Google results answering this question were wrong

https://www.howmanytradingdays.com/
1•eastburnn•16m ago•0 comments

We found cryptography bugs in the elliptic library using Wycheproof

https://blog.trailofbits.com/2025/11/18/we-found-cryptography-bugs-in-the-elliptic-library-using-...
1•ingve•16m ago•0 comments

Cloudflare is down and causing outages at X, OpenAI

1•paulwilsonn•17m ago•1 comments

Mpgr: Central task overlay for multi-repo workspaces

https://github.com/jochumdev/mpgr
1•jochumdev•17m ago•0 comments

How I design Software Architecture

https://old.reddit.com/r/LLMDevs/comments/1ox9dou/how_i_design_software_architecture
1•kiryl_kazlovich•18m ago•1 comments

Graph Neural Networks for Faster Search

https://www.chrisgregory.me/blog/graph-contraction-search
1•tekknolagi•20m ago•0 comments

Loads and Loads of Fluffy Kittens

https://hazyresearch.stanford.edu/blog/2025-11-17-fluffy-kittens
1•todsacerdoti•20m ago•0 comments

AmiGameJam 2025

https://itch.io/jam/amigamejam
1•doener•20m ago•0 comments

Oracle's $300B OpenAI deal is now valued at minus $60B

https://www.ft.com/content/064bbca0-1cb2-45ab-85f4-25fdfc318d89
4•0xedb•20m ago•0 comments

Cloudflare Down, Again

4•atlasx1z•21m ago•2 comments

Show HN: Kassouf-Btc-Options – Thorp-Kassouf Option Models on Bitcoin

https://github.com/dradicchi/kassouf-btc-options
1•dcvr•22m ago•0 comments

Mastodon: Founder Steps Down as CEO, Receives One Million Euros

https://www.heise.de/en/news/Mastodon-Founder-Steps-Down-as-CEO-Receives-One-Million-Euros-110819...
1•rapnie•22m ago•0 comments

Cloudflare Outage 2025: What Website Owners Need to Know

https://veerhost.com/cloudflare-outage-2025-what-website-owners-need-to-know/
4•aymanaljunaid•23m ago•0 comments

Trump admin axed 383 active clinical trials, dumping over 74K participants

https://arstechnica.com/health/2025/11/over-74000-people-were-kicked-out-of-clinical-trials-becau...
2•heisenbit•25m ago•0 comments

Running Java on iOS

https://www.infoq.com/news/2025/11/java-on-ios/
1•0x54MUR41•29m ago•0 comments

Overwhelmed Hiring Team

2•Praisethegreat•29m ago•3 comments

Deep dive into the small details of micrograd

https://omrishneor.github.io/2025/11/18/deep-dive-into-the-small-details-of-micrograd/
2•omrishn•31m ago•1 comments