frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Can AI solve this Bongard problem?

https://imgur.com/a/idyH0Kh
1•Kotlopou•46m ago

Comments

Kotlopou•46m ago
I have a personal benchmark for measuring AI problems in the form of hand-drawn Bongard problems (https://en.wikipedia.org/wiki/Bongard_problem). The idea is that there are two sets of six images that differ based on some feature of the images, and the task is to find the dividing feature. This task is not perfectly well-defined, but usually there is a single solution that strikes one as obviously canonical once found.

They are nice because it's easy to hand-draw new ones with solutions that probably don't exist in the literature, and because for some reason they have proven quite hard for AI.

Sadly, the recently reported advances in generative AI for problem-solving require expensive models I don't have access to. Could somebody try pasting this image to GPT-5.5 Pro or Claude Opus 4.7 or the like, with the accompanying text "Hi. This is a Bongard problem. Can you solve it?", and share a link to the resulting chat? I would be curious.

The free models (Claude Sonnett 4.6, GPT-5.5, Gemini 3.5 Flash with extended thinking) all give obviously incorrect solutions (rules that don't actually hold for the images), to the point that I think there must be some problem in the image processing. Example: https://claude.ai/share/1ff7b5c2-c34a-40cc-a249-2d0fd3474884

P.S. For obvious reasons, I'm not sharing the solution, but I have verified that most of my friends found it within 5 minutes, and everybody found the same solution.

Connections (British TV Series)

https://en.wikipedia.org/wiki/Connections_(British_TV_series)
1•throwaw12•1m ago•0 comments

Trump wants $1B to protect White House ballroom from drones and other threats

https://arstechnica.com/tech-policy/2026/05/trump-wants-1b-to-protect-white-house-ballroom-from-d...
2•dan-bailey•2m ago•1 comments

AI is just unauthorised plagiarism at a bigger scale

https://axelk.ee/ai-is-just-unauthorised-plagiarism-at-a-bigger-scale/
2•speckx•2m ago•0 comments

Gemini accused of 30k-line code purge and fake recovery report

https://www.theregister.com/ai-ml/2026/05/21/gemini-accused-of-30000-line-code-purge-and-fake-rec...
1•igortru•3m ago•0 comments

Setting Up OpenClaw with Slack in a Sandbox

https://www.superserve.ai/blog/openclaw-setup/
1•nirnejak•4m ago•0 comments

Hating AI Is Good

https://www.thehandbasket.co/p/hating-ai-is-good-actually
2•cdrnsf•5m ago•0 comments

Xiaomi YU7 GT Breaking the Nürburgring SUV Lap Record 7:22:755[video]

https://www.youtube.com/watch?v=9zdPCUCaMlI
1•gainsurier•5m ago•0 comments

Micropatching Brings the Abandoned Equation Editor Back to Life (2018)

https://blog.0patch.com/2018/01/bringing-abandoned-equation-editor-back.html
1•bariumbitmap•9m ago•0 comments

AI slop is flooding maths YouTube [video]

https://www.youtube.com/watch?v=mRO_QonhC2c
1•marvinborner•9m ago•0 comments

OWASP PTK is now OWASP Lab project

https://owasp.org/other_projects/
1•DenisPodgurskii•10m ago•0 comments

Ask HN: Are there any social media sites that are AI positive?

1•amichail•12m ago•2 comments

Avoid unnecessary parser lookahead for operators

https://github.com/astral-sh/ruff/pull/25290
1•tosh•13m ago•0 comments

AI chief of staff framework with invisible shadow predictions

https://github.com/jaroslavsoucek-art/Giovanni
1•JarSou•13m ago•0 comments

Show HN: We dropped Go for Rust in our real-time telephony AI media plane

2•bajpailabs•13m ago•0 comments

If Your Mother Says She Loves You: A Reporter's Cautionary Tale

https://www.poynter.org/reporting-editing/2003/if-your-mother-says-she-loves-you-a-reporters-caut...
2•speckx•15m ago•1 comments

Designing Firefox for the Future

https://blog.mozilla.org/en/firefox/new-firefox-design/
2•pentagrama•15m ago•0 comments

US employers spend more than $1.5B a year to fight labor unions, report finds

https://www.theguardian.com/us-news/2026/may/20/how-much-companies-spend-fight-unions
1•robtherobber•15m ago•0 comments

Another Book Giveaway on LaTeX.org: Win the LaTeX Beginner's Guide 2026 Edition

https://latex.org/forum/viewtopic.php?t=36370
1•idle•16m ago•0 comments

1-Wire

https://en.wikipedia.org/wiki/1-Wire
1•ripe•17m ago•0 comments

Title: Show HN: myVanilla.js – In-memory JIT compiler, 100k reactive nodes in 4s

https://github.com/Enocwtc/Enocwtc.github.io
1•Enocwtc•19m ago•0 comments

Creating another MCP server, but this one is for research

https://jspann.me/blog/posts/research_mcp/
1•jspann•23m ago•2 comments

The Federal Data Field Guide

https://www.federaldatafieldguide.us/
1•sebg•25m ago•0 comments

Matterhorn pandas

https://mz.prose.sh/matterhorn-pandas
1•manuelz•25m ago•0 comments

Magic the Gathering format: Fun 40

https://fabiensanglard.net/mtg/fun//index.html
2•ibobev•27m ago•0 comments

Second Quantisation – A Quantisation Too Far

https://www.forwardscattering.org/page/Second%20Quantisation
1•ibobev•27m ago•0 comments

Ma Configuration SSH · Accueil

https://rodolphe.breard.tf/article/ma-config-ssh/
1•rodrigo975•28m ago•0 comments

My 1993 Atari Mega STE Retro Battlestation

https://www.goto10retro.com/p/my-1993-mega-ste-retro-battlestation
2•ibobev•28m ago•0 comments

Anthropic to open Milan office, expanding push into Europe

https://finance.yahoo.com/sectors/technology/articles/anthropic-open-milan-office-expanding-09502...
2•napolux•29m ago•0 comments

AVX-512 Optimization for FFmpeg Shows Wild Improvement on AMD Ryzen (2025)

https://www.phoronix.com/news/FFmpeg-AVX-512-uyvytoyuv422
1•tosh•29m ago•0 comments

Nvidia says it has 'largely conceded' China's AI chip market to Huawei

https://www.cnbc.com/2026/05/21/nvidia-jensen-huang-china-ai-chip-market-huawei.html
3•Markoff•30m ago•0 comments