frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Visual Features Across Modalities: SVG and ASCII Art Cross-Modal Understanding

https://transformer-circuits.pub/2025/october-update/index.html#svg-cross-modal
8•vismit2000•1w ago

Comments

robot-wrangler•4h ago
Generating and displaying diagrams in mermaid, svg, or css has become one of my go-to tests for reasoning. This seems fair because while SVG is admittedly syntactically difficult and maybe not emphasized in training, CSS is certainly a popular output target, and mermaid is very simple. It seems like SOTA should be able to draw and modify things that it "understands".

I'm much more interested in stuff like Venn diagrams and bipartite graphs than pictures of cats or pelicans riding bikes. It's similar to a code-generation problem in that output is a new artifact that's one step away from the problem-presentation, but it has the advantage that it's simpler than code, is less likely to have exact-match training data, usually has one correct answer, and is easy to check. Try making venn diagrams on a few circles with "exactly and only the following intersections" and gradually elaborating the spec.

This is a great way to get a starter diagram boilerplate if that's what you're looking for. One shot prompts for simple things are ok, sometimes. But it always completely falls apart when you try to iterate with small modifications, introducing errors in parts that were correct previously or ignoring requested changes. Maybe it's wrong to conclude anything from that, but to me this looks bad for the "they can reason!" argument and very bad for trusting complicated work in other domains that are harder to check. Haven't read TFA yet, but whether it confirms or denies my gut here hopefully it's going to add some perspective

What is a manifold?

https://www.quantamagazine.org/what-is-a-manifold-20251103/
78•isaacfrond•3h ago•22 comments

The Art of Atari (2016)

http://www.artofatari.com
6•ghtbircshotbe•16m ago•0 comments

Chaining FFmpeg with a Browser Agent

https://100x.bot/a/chaining-ffmpeg-with-browser-agent
4•shardullavekar•39m ago•1 comments

You can't cURL a Border

https://drobinin.com/posts/you-cant-curl-a-border/
281•valzevul•12h ago•143 comments

The Farmer Was Replaced [video]

https://www.youtube.com/watch?v=aP2WHQKJVsw
31•surprisetalk•1w ago•5 comments

Bloom filters are good for search that does not scale

https://notpeerreviewed.com/blog/bloom-filters/
48•birdculture•4h ago•6 comments

Customize Nano Text Editor

https://shafi.ddns.net/blog/customize-nano-text-editor
12•shafiemoji•1w ago•0 comments

My Truck Desk

https://www.theparisreview.org/blog/2025/10/29/truck-desk/
188•zdw•10h ago•33 comments

Things you can do with diodes

https://lcamtuf.substack.com/p/things-you-can-do-with-diodes
277•zdw•13h ago•77 comments

AI's Dial-Up Era

https://www.wreflection.com/p/ai-dial-up-era
354•nowflux•16h ago•302 comments

When stick figures fought

https://animationobsessive.substack.com/p/when-stick-figures-fought
229•ani_obsessive•12h ago•71 comments

Reverse-engineered CUPS driver for Phomemo receipt/label printers

https://github.com/vivier/phomemo-tools
33•Curiositry•1w ago•8 comments

Tell HN: X is opening any tweet link in a webview whether you press it or not

104•stillatit•7h ago•48 comments

Former US Vice-President Cheney Dies

https://www.reuters.com/world/us/former-us-vp-dick-cheney-dead-84-punchbowl-news-says-2025-11-04/
30•abawany•1h ago•15 comments

A friendly tour of process memory on Linux

https://www.0xkato.xyz/linux-process-memory/
181•0xkato•14h ago•17 comments

Ask HN: Why are most status pages delayed?

20•2gremlin181•1h ago•15 comments

Ask HN: Who is hiring? (November 2025)

356•whoishiring•21h ago•395 comments

Tenacity – a multi-track audio editor/recorder

https://tenacityaudio.org
58•smartmic•1w ago•18 comments

Lessons from interviews on deploying AI Agents in production

https://mmc.vc/research/state-of-agentic-ai-founders-edition/
74•advikipedia•6h ago•64 comments

Learning to read Arthur Whitney's C to become smart (2024)

https://needleful.net/blog/2024/01/arthur_whitney.html
306•gudzpoz•21h ago•129 comments

Resolution limit of the eye – how many pixels can we see?

https://www.nature.com/articles/s41467-025-64679-2
54•bookofjoe•1w ago•35 comments

This Month in Ladybird – October 2025

https://ladybird.org/newsletter/2025-10-31/
108•exploraz•1h ago•16 comments

The Mack Super Pumper was a locomotive engined fire fighter (2018)

https://bangshift.com/bangshiftxl/mack-super-pumper-system-locomotive-engine-powered-pumper-extin...
146•mstngl•16h ago•102 comments

Some software bloat is OK

https://waspdev.com/articles/2025-11-04/some-software-bloat-is-ok
27•senfiaj•5h ago•54 comments

Pain Points of OCaml

https://quamserena.com/2025-11-03/pain-points-of-ocaml
44•quamserena•7h ago•34 comments

The Case That A.I. Is Thinking

https://www.newyorker.com/magazine/2025/11/10/the-case-that-ai-is-thinking
204•ascertain•19h ago•639 comments

The Case Against PGVector

https://alex-jacobs.com/posts/the-case-against-pgvector/
341•tacoooooooo•1d ago•129 comments

Guideline has been acquired by Gusto

https://help.guideline.com/en/articles/12694322-guideline-has-joined-gusto-faqs-about-our-recent-...
116•surprisetalk•14h ago•94 comments

Show HN: Yourshoesmells.com – Find the most smelly boulder gym

https://yourshoesmells.com
18•boshenz•4h ago•11 comments

Ask HN: How to deal with long vibe-coded PRs?

138•philippta•6d ago•257 comments